Sample Efficient Actor-Critic with Experience Replay.

Ziyu Wang,Victor Bapst,Nicolas Heess,Volodymyr Mnih,Rémi Munos,Koray Kavukcuoglu,Nando de Freitas

international conference on learning representations（2017）

引用 28|浏览371

暂无评分

摘要

This paper presents an actor-critic deep reinforcement learning agent with experience replay that is stable, sample efficient, and performs remarkably well on challenging environments, including the discrete 57-game Atari domain and several continuous control problems. To achieve this, the paper introduces several innovations, including truncated importance sampling with bias correction, stochastic dueling network architectures, and a new trust region policy optimization method.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要