Proximal Gradient Temporal Difference Learning Algorithms
IJCAI, pp. 4195-4199, 2016.
In this paper, we describe proximal gradient temporal difference learning, which provides a principled way for designing and analyzing true stochastic gradient temporal difference learning algorithms. We show how gradient TD (GTD) reinforcement learning methods can be formally derived, not with respect to their original objective function...More
PPT (Upload PPT)