Sparse Reinforcement Learning via Convex Optimization.

Zhiwei Qin,Weichang Li,Firdaus Janoos

ICML'14: Proceedings of the 31st International Conference on International Conference on Machine Learning - Volume 32（2014）

引用 33|浏览34

暂无评分

摘要

We propose two new algorithms for the sparse reinforcement learning problem based on different formulations. The first algorithm is an off-line method based on the alternating direction method of multipliers for solving a constrained formulation that explicitly controls the projected Bellman residual. The second algorithm is an online stochastic approximation algorithm that employs the regularized dual averaging technique, using the Lagrangian formulation. The convergence of both algorithms are established. We demonstrate the performance of these algorithms through several classical examples.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要