Learning To Coordinate In A Beauty Contest Game

2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC)(2013)

引用 3|浏览7
暂无评分
摘要
We study a dynamic game in which a group of players attempt to coordinate on a desired, but only partially known, outcome. The desired outcome is represented by an unknown state of the world. Agents' stage payoffs are represented by a quadratic utility function that captures the kind of tradeoff exemplified by the Keynesian beauty contest: each agent's stage payoff is decreasing in the distance between her action and the unknown state; it is also decreasing in the distance between her action and the average action taken by other agents. The agents thus have the incentive to correctly estimate the state while trying to coordinate with and learn from others. We show that myopic, but Bayesian, agents who repeatedly play this game and observe the actions of their neighbors in a connected network eventually succeed in coordinating on a single action. However, as we show through an example, the consensus action is not necessarily optimal given all the available information.
更多
查看译文
关键词
multi agent systems,learning artificial intelligence,game theory
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要