A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Abstract:
A fundamental challenge in multiagent reinforcement learning is to learn beneficial behaviors in a shared environment with other agents that are also simultaneously learning. In particular, each agent perceives the environment as effectively non-stationary due to the changing policies of other agents. Moreover, each agent is itself cons...More
Code:
Data:
Full Text
Tags
Comments