Learning Options For An Mdp From Demonstrations

Marco Tamassia,Fabio Zambetta,William Raffe,Xiaodong Li

Lecture Notes in Computer Science（2015）

引用 4|浏览29

暂无评分

摘要

The options framework provides a foundation to use hierarchical actions in reinforcement learning. An agent using options, along with primitive actions, at any point in time can decide to perform a macro-action made out of many primitive actions rather than a primitive action. Such macro-actions can be hand-crafted or learned. There has been previous work on learning them by exploring the environment. Here we take a different perspective and present an approach to learn options from a set of experts demonstrations. Empirical results are also presented in a similar setting to the one used in other works in this area.

查看译文

关键词

reinforcement learning,options

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要