Learning to Maximize Return in a Stag Hunt Collaborative Scenario through Deep Reinforcement Learning

Andrei Cristian Nica,Tudor Berariu,Florin Gogianu,Adina Magda Florea

2017 19th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC)（2017）

引用 0|浏览6

暂无评分

摘要

In this paper we present a deep reinforcement learning approach for learning to play a time extended social dilemma game in a simulated environment. Agents face different types of adversaries with different levels of commitment to a collaborative strategy. Our method builds on recent advances in policy gradient training using deep neural networks. We investigate multiple stochastic gradient algorithms such as Reinforce or Actor Critic with auxiliary tasks for faster convergence.

查看译文

关键词

deep reinforcement learning,social dilemmas,policy gradient

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要