Learning to Maximize Return in a Stag Hunt Collaborative Scenario through Deep Reinforcement Learning

2017 19th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC)(2017)

引用 0|浏览6
暂无评分
摘要
In this paper we present a deep reinforcement learning approach for learning to play a time extended social dilemma game in a simulated environment. Agents face different types of adversaries with different levels of commitment to a collaborative strategy. Our method builds on recent advances in policy gradient training using deep neural networks. We investigate multiple stochastic gradient algorithms such as Reinforce or Actor Critic with auxiliary tasks for faster convergence.
更多
查看译文
关键词
deep reinforcement learning,social dilemmas,policy gradient
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要