Mohex 2.0: A Pattern-Based Mcts Hex Player

COMPUTERS AND GAMES, CG 2013(2014)

引用 28|浏览66
暂无评分
摘要
In recent years the Monte Carlo tree search revolution has spread from computer Go to many areas, including computer Hex. MCTS-based Hex players now outperform traditional knowledge-based alphabeta search players, and the reigning Computer Olympiad Hex gold medallist is the MCTS player MoHex. In this paper we show how to strengthen MoHex, and observe that-as in computer Go-using learned patterns in priors and replacing a hand-crafted simulation policy by a softmax policy that uses learned patterns significantly increases playing strength. The result is MoHex 2.0, about 250 Elo points stronger than MoHex on the 11x11 board, and 300 Elo points stronger on the 13x13 board.
更多
查看译文
关键词
Probabilistic Simulation, Simulation Policy, Exploration Term, Virtual Connection, Visit Count
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要