Simple learning in weakly acyclic games and convergence to Nash equilibria.

2015 53RD ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON)(2015)

引用 1|浏览15
暂无评分
摘要
We study the evolution of action profiles played by players when they revise their actions using a simple better-reply update rule. We employ a continuous-time model, in which each player is allowed to update its strategy in accordance with a Poisson process. We demonstrate that when payoff information experiences no delay and players make decisions based on up-to-date information, action profiles generated by the players converge almost surely to a Nash equilibrium if and only if the game is weakly acyclic. Interestingly, we prove that, in some cases, if the payoff information is delayed, the action profiles are guaranteed to converge almost surely to a Nash equilibrium in a strictly larger class of games which includes weakly acyclic games as special cases.
更多
查看译文
关键词
payoff information,Poisson process,continuous-time model,action profiles,Nash equilibria,weakly acyclic games,simple learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要