Safe Exploration in Finite Markov Decision Processes with Gaussian Processes

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), pp. 4312-4320, 2016.

Cited by: 78|Bibtex|Views25
EI
Other Links: dblp.uni-trier.de|academic.microsoft.com|arxiv.org

Abstract:

In classical reinforcement learning agents accept arbitrary short term loss for long term gain when exploring their environment. This is infeasible for safety critical applications such as robotics, where even a single unsafe action may cause system failure or harm the environment. In this paper, we address the problem of safely exploring...More

Code:

Data:

Full Text
Your rating :
0

 

Tags
Comments