Proximal Gradient Temporal Difference Learning Algorithms

IJCAI, pp. 4195-4199, 2016.

Cited by: 11|Views17
EI

Abstract:

In this paper, we describe proximal gradient temporal difference learning, which provides a principled way for designing and analyzing true stochastic gradient temporal difference learning algorithms. We show how gradient TD (GTD) reinforcement learning methods can be formally derived, not with respect to their original objective function...More

Code:

Data:

Your rating :
0

 

Tags
Comments