Bayesian optimistic Kullback–Leibler exploration
Machine Learning, pp. 1-19, 2018.
We consider a Bayesian approach to model-based reinforcement learning, where the agent uses a distribution of environment models to find the action that optimally trades off exploration and exploitation. Unfortunately, it is intractable to find the Bayes-optimal solution to the problem except for restricted cases. In this paper, we presen...More
Get fulltext within 24h
Full Text (Upload PDF)
PPT (Upload PPT)