Learning To Coordinate In A Beauty Contest Game

Pooya Molavi,Ceyhun Eksin,Alejandro Ribeiro,Ali Jadbabaie

2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC)（2013）

引用 3|浏览7

暂无评分

摘要

We study a dynamic game in which a group of players attempt to coordinate on a desired, but only partially known, outcome. The desired outcome is represented by an unknown state of the world. Agents' stage payoffs are represented by a quadratic utility function that captures the kind of tradeoff exemplified by the Keynesian beauty contest: each agent's stage payoff is decreasing in the distance between her action and the unknown state; it is also decreasing in the distance between her action and the average action taken by other agents. The agents thus have the incentive to correctly estimate the state while trying to coordinate with and learn from others. We show that myopic, but Bayesian, agents who repeatedly play this game and observe the actions of their neighbors in a connected network eventually succeed in coordinating on a single action. However, as we show through an example, the consensus action is not necessarily optimal given all the available information.

查看译文

关键词

multi agent systems,learning artificial intelligence,game theory

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要