Multi-Task Policy Search For Robotics

ICRA(2014)

引用 150|浏览102
暂无评分
摘要
Learning policies that generalize across multiple tasks is an important and challenging research topic in reinforcement learning and robotics. Training individual policies for every single potential task is often impractical, especially for continuous task variations, requiring more principled approaches to share and transfer knowledge among similar tasks. We present a novel approach for learning a nonlinear feedback policy that generalizes across multiple tasks. The key idea is to define a parametrized policy as a function of both the state and the task, which allows learning a single policy that generalizes across multiple known and unknown tasks. Applications of our novel approach to reinforcement and imitation learning in realrobot experiments are shown.
更多
查看译文
关键词
feedback,intelligent robots,learning (artificial intelligence),learning systems,nonlinear control systems,continuous task variations,imitation learning,individual policy training,knowledge transfer,multitask policy search,nonlinear feedback policy,reinforcement learning,robotics,
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要