Regularized Hierarchical Policies for Compositional Transfer in Robotics.

Markus Wulfmeier,Abbas Abdolmaleki,Roland Hafner,Jost Tobias Springenberg,Michael Neunert,Tim Hertweck,Thomas Lampe,Noah Siegel,Nicolas Heess,Martin A. Riedmiller

CoRR（2019）

引用 27|浏览212

暂无评分

摘要

The successful application of flexible, general learning algorithms -- such as deep reinforcement learning -- to real-world robotics applications is often limited by their poor data-efficiency. Domains with more than a single dominant task of interest encourage algorithms that share partial solutions across tasks to limit the required experiment time. We develop and investigate simple hierarchical inductive biases -- in the form of structured policies -- as a mechanism for knowledge transfer across tasks in reinforcement learning (RL). To leverage the power of these structured policies we design an RL algorithm that enables stable and fast learning. We demonstrate the success of our method both in simulated robot environments (using locomotion and manipulation domains) as well as real robot experiments, demonstrating substantially better data-efficiency than competitive baselines.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要