Automated curriculum generation for Policy Gradients from Demonstrations

Chevalier-Boisvert Maxime
Chevalier-Boisvert Maxime
Cited by: 0|Bibtex|Views74
Other Links: arxiv.org

Abstract:

In this paper, we present a technique that improves the process of training an agent (using RL) for instruction following. We develop a training curriculum that uses a nominal number of expert demonstrations and trains the agent in a manner that draws parallels from one of the ways in which humans learn to perform complex tasks, i.e by ...More

Code:

Data:

Full Text
Your rating :
0

 

Tags
Comments