Constrained episodic reinforcement learning in concave-convex and knapsack settings

Brantley Kianté,Dudik Miroslav,Lykouris Thodoris,Miryoosefi Sobhan,Simchowitz Max,Slivkins Aleksandrs,Sun Wen

NIPS 2020（2020）

引用 47|浏览659

暂无评分

摘要

We propose an algorithm for tabular episodic reinforcement learning with constraints. We provide a modular analysis with strong theoretical guarantees for settings with concave rewards and convex constraints, and for settings with hard constraints (knapsacks). Most of the previous work in constrained reinforcement learning is limited to linear constraints, and the remaining work focuses on either the feasibility question or settings with a single episode. Our experiments demonstrate that the proposed algorithm significantly outperforms these approaches in existing constrained episodic environments.

查看译文

关键词

constrained episodic reinforcement learning,concave-convex

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要