Distributed Lifelong Reinforcement Learning With Sub-Linear Regret

2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC)(2017)

引用 25|浏览80
暂无评分
摘要
In this paper we propose a distributed second-order method for lifelong reinforcement learning (LRL). Upon observing a new task, our algorithm scales state-of-the-art LRL by approximating the Newton direction up-to-any arbitrary precision epsilon > 0, while guaranteeing accurate solutions. We analyze the theoretical properties of this new method and derive, for the first time to the best of our knowledge, sub-linear regret under this setting
更多
查看译文
关键词
Newton direction,sublinear regret,distributed lifelong reinforcement learning,distributed second-order method
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要