Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games
Abstract:
This paper introduces an information-theoretic constraint on learned policy complexity in the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) reinforcement learning algorithm. Previous research with a related approach in continuous control experiments suggests that this method favors learning policies that are more robust to cha...More
Code:
Data:
Tags
Comments