Randomized Entity-Wise Factorization For Multi-Agent Reinforcement Learning

Shariq Iqbal,Christian Schroeder,Bei Peng,Wendelin Boehmer,Shimon Whiteson,Fei Sha

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139（2021）

引用 0|浏览342

暂无评分

摘要

Multi-agent settings in the real world often involve tasks with varying types and quantities of agents and non-agent entities; however, common patterns of behavior often emerge among these agents/entities. Our method aims to leverage these commonalities by asking the question: "What is the expected utility of each agent when only considering a randomly selected subgroup of its observed entities?" By posing this counterfactual question, we can recognize stateaction trajectories within sub-groups of entities that we may have encountered in another task and use what we learned in that task to inform our prediction in the current one. We then reconstruct a prediction of the full returns as a combination of factors considering these disjoint groups of entities and train this "randomly factorized" value function as an auxiliary objective for value-based multi-agent reinforcement learning. By doing so, our model can recognize and leverage similarities across tasks to improve learning efficiency in a multi-task setting. Our approach, Randomized Entity-wise Factorization for Imagined Learning (REF IL ), outperforms all strong baselines by a significant margin in challenging multi-task StarCraft micromanagement settings.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要