Chrome Extension
WeChat Mini Program
Use on ChatGLM

Bayesian Residual Policy Optimization: Scalable Bayesian Reinforcement Learning with Clairvoyant Experts

2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)(2021)

Cited 6|Views112
No score
Key words
Bayesian reinforcement learning,clairvoyant experts,decision making,Markov decision processes,Bayes-optimality,state space,action space,MDP,baseline policy,policy gradient methods,task-specific expert skills,Bayesian residual policy optimization,BRPO,robots
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined