Policy Transfer with Strategy Optimization
international conference on learning representations, 2019.
Computer simulation provides an automatic and safe way for training robotic control policies to achieve complex tasks such as locomotion. However, a policy trained in simulation usually does not transfer directly to the real hardware due to the differences between the two environments. Transfer learning using domain randomization is a pro...More
PPT (Upload PPT)