Distributed Lifelong Reinforcement Learning With Sub-Linear Regret

Rasul Tutunov,Julia El-Zini,Haitham Bou-Ammar,Ali Jadbabaie

2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC)（2017）

引用 25|浏览80

暂无评分

摘要

In this paper we propose a distributed second-order method for lifelong reinforcement learning (LRL). Upon observing a new task, our algorithm scales state-of-the-art LRL by approximating the Newton direction up-to-any arbitrary precision epsilon > 0, while guaranteeing accurate solutions. We analyze the theoretical properties of this new method and derive, for the first time to the best of our knowledge, sub-linear regret under this setting

查看译文

关键词

Newton direction,sublinear regret,distributed lifelong reinforcement learning,distributed second-order method

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要