SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies

Seyed Kamyar Seyed Ghasemipour
Seyed Kamyar Seyed Ghasemipour

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), pp. 7879-7889, 2019.

Cited by: 2|Bibtex|Views70
EI
Other Links: academic.microsoft.com|dblp.uni-trier.de

Abstract:

Imitation Learning (IL) has been successfully applied to complex sequential decision-making problems where standard Reinforcement Learning (RL) algorithms fail. A number of recent methods extend IL to few-shot learning scenarios, where a meta-trained policy learns to quickly master new tasks using limited demonstrations. However, although...More

Code:

Data:

Full Text
Your rating :
0

 

Tags
Comments