Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via Latent Model Ensembles

Tim Seyde
Tim Seyde
Wilko Schwarting
Wilko Schwarting
Cited by: 0|Bibtex|Views14
Other Links: arxiv.org

Abstract:

Learning complex behaviors through interaction requires coordinated long-term planning. Random exploration and novelty search lack task-centric guidance and waste effort on non-informative interactions. Instead, decision making should target samples with the potential to optimize performance far into the future, while only reducing unce...More

Code:

Data:

Full Text
Your rating :
0

 

Tags
Comments