Imitation with Neural Density Models

Kuno Kim,Akshat Jindal,Yang Song,Jiaming Song,Yanan Sui,Stefano Ermon

Annual Conference on Neural Information Processing Systems（2021）

引用 13|浏览497

暂无评分

摘要

We propose a new framework for Imitation Learning (IL) via density estimation of the expert's occupancy measure followed by Maximum Occupancy Entropy Reinforcement Learning (RL) using the density as a reward. Our approach maximizes a non-adversarial model-free RL objective that provably lower bounds reverse Kullback-Leibler divergence between occupancy measures of the expert and imitator. We present a practical IL algorithm, Neural Density Imitation (NDI), which obtains state-of-the-art demonstration efficiency on benchmark control tasks.

查看译文

关键词

models,density

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要