Deterministic Value-Policy Gradients

Cited by: 0|Views20

Abstract:

Reinforcement learning algorithms such as the deep deterministic policy gradient algorithm (DDPG) has been widely used in continuous control tasks. However, the model-free DDPG algorithm suffers from high sample complexity. In this paper we consider the deterministic value gradients to improve the sample efficiency of deep reinforcement...More

Code:

Data:

Full Text
Bibtex
Your rating :
0

 

Tags
Comments