Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning
NIPS 2020, 2020.
We argue that coordinating the agents' policies can guide their exploration and we investigate techniques to promote such an inductive bias
In multi-agent reinforcement learning, discovering successful collective behaviors is challenging as it requires exploring a joint action space that grows exponentially with the number of agents. While the tractability of independent agent-wise exploration is appealing, this approach fails on tasks that require elaborate group strategies....More
PPT (Upload PPT)