Bayesian optimistic Kullback–Leibler exploration

Machine Learning, pp. 1-19, 2018.

Cited by: 0|Views30
EI

Abstract:

We consider a Bayesian approach to model-based reinforcement learning, where the agent uses a distribution of environment models to find the action that optimally trades off exploration and exploitation. Unfortunately, it is intractable to find the Bayes-optimal solution to the problem except for restricted cases. In this paper, we presen...More

Code:

Data:

Your rating :
0

 

Tags
Comments