CARL: Control-Augmented Reinforcement Learning for Decision-Making with Stochastic Factors

Sinong Geng,Houssam Nassif,Zhaobin Kuang,Max Reppen,Ronnie Sircar

semanticscholar（2022）

引用 0|浏览3

暂无评分

摘要

We study finite-horizon dynamic decision-making in the presence of uncontrolled system variables, referred to as stochastic factors. Existing solutions fall into two distinctive categories: (i) Reinforcement learning employs end-to-end policy learning with flexible factor representation, but does not precisely model the dynamics of the problem; (ii) control-based methods, in contrast, take advantage of explicitly modeled dynamics via model fitting (calibration) but is constrained on factor representation. We propose CARL (controlaugmented RL), a decision-making framework that interpolates between the two extremes by simultaneously conducting policy learning and model fitting. We prove the convergence of CARL and provide performance guarantees via a finitesample error bound. On three diverse tasks ranging from linear-quadratic control, portfolio optimization, and epidemic spread management, we observe that CARL can outperform four leading methods for these tasks.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要