From Poincare Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139(2021)

引用 74|浏览396
暂无评分
摘要
In this paper we investigate the Follow the Regularized Leader dynamics in sequential imperfect information games (IIG). We generalize existing results of Poincare recurrence from normal-form games to zero-sum two-player imperfect information games and other sequential game settings. We then investigate how adapting the reward (by adding a regularization term) of the game can give strong convergence guarantees in monotone games. We continue by showing how this reward adaptation technique can be leveraged to build algorithms that converge exactly to the Nash equilibrium. Finally, we show how these insights can be directly used to build state-of-the-art model-free algorithms for zero-sum two-player Imperfect Information Games (IIG).
更多
查看译文
关键词
imperfect information games,equilibrium,recurrence,convergence
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要