Human-Agent Cooperation in Bridge Bidding

Edward Lockhart,Neil Burch,Nolan Bard,Sebastian Borgeaud,Tom Eccles,Lucas Smaira, Ray Smith

arxiv（2020）

引用 1|浏览37

暂无评分

摘要

We introduce a human-compatible reinforcement-learning approach to a cooperative game, making use of a third-party hand-coded human-compatible bot to generate initial training data and to perform initial evaluation. Our learning approach consists of imitation learning, search, and policy iteration. Our trained agents achieve a new state-of-the-art for bridge bidding in three settings: an agent playing in partnership with a copy of itself; an agent partnering a pre-existing bot; and an agent partnering a human player.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要