Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning

arxiv(2023)

引用 3|浏览30
暂无评分
摘要
Humans are capable of abstracting various tasks as different combinations of multiple attributes. This perspective of compositionality is vital for human rapid learning and adaption since previous experiences from related tasks can be combined to generalize across novel compositional settings. In this work, we aim to achieve zero-shot policy generalization of Reinforcement Learning (RL) agents by leveraging the task compositionality. Our proposed method is a meta-RL algorithm with disentangled task representation, explicitly encoding different aspects of the tasks. Policy generalization is then performed by inferring unseen compositional task representations via the obtained disentanglement without extra exploration. The evaluation is conducted on three simulated tasks and a challenging real-world robotic insertion task. Experimental results demonstrate that our proposed method achieves policy generalization to unseen compositional tasks in a zero-shot manner.
更多
查看译文
关键词
compositional settings,disentangled task representation,human rapid learning,meta-reinforcement learning,meta-RL algorithm,multiple attributes,real-world robotic insertion task,reinforcement learning agents,simulated tasks,task compositionality,unseen compositional task representations,unseen compositional tasks,zero-shot manner,zero-shot policy generalization,zero-shot policy transfer
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要