Automated curriculum generation for Policy Gradients from Demonstrations
Abstract:
In this paper, we present a technique that improves the process of training an agent (using RL) for instruction following. We develop a training curriculum that uses a nominal number of expert demonstrations and trains the agent in a manner that draws parallels from one of the ways in which humans learn to perform complex tasks, i.e by ...More
Code:
Data:
Full Text
Tags
Comments