Model-Based Reinforcement Learning via Meta-Policy Optimization
CoRL, pp. 617-629, 2018.
Model-based reinforcement learning approaches carry the promise of being data efficient. However, due to challenges in learning dynamics models that sufficiently match the real-world dynamics, they struggle to achieve the same asymptotic performance as model-free methods. We propose Model-Based Meta-Policy-Optimization (MB-MPO), an approa...More
PPT (Upload PPT)