Temporal Difference Models: Model-Free Deep RL for Model-Based Control

Vitchyr Pong
Vitchyr Pong
Murtaza Dalal
Murtaza Dalal

ICLR, Volume abs/1802.09081, 2018.

Cited by: 70|Bibtex|Views48
EI
Other Links: dblp.uni-trier.de|academic.microsoft.com|arxiv.org

Abstract:

Model-free reinforcement learning (RL) has been proven to be a powerful, general tool for learning complex behaviors. However, its sample efficiency is often impractically large for solving challenging real-world problems, even for off-policy algorithms such as Q-learning. A limiting factor in classic model-free RL is that the learning si...More

Code:

Data:

Full Text
Your rating :
0

 

Tags
Comments