谷歌浏览器插件
订阅小程序
在清言上使用

Optimal Markov Policies for Finite-Horizon Constrained MDPs with Combined Additive and Multiplicative Utilities.

IEEE Control Systems Letters(2023)

引用 0|浏览14
暂无评分
摘要
This letter considers the problem of optimizing a finite-horizon constrained Markov decision process (CMDP) where the objective and constraints are sums of additive and multiplicative utilities. To solve this, we construct another CMDP with only additive utilities whose optimal value over a restricted set of policies is equal to that of the original CMDP. Further, we provide a finite-dimensional bilinear program (BLP) whose value equals the CMDP value and whose solution provides the optimal policy. We also suggest an algorithm to solve the proposed BLP.
更多
查看译文
关键词
Bilinear program,Markov decision processes,Markov policies,Optimal control,utilities
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要