Combating False Negatives in Adversarial Imitation Learning
Abstract:
In adversarial imitation learning, a discriminator is trained to differentiate agent episodes from expert demonstrations representing the desired behavior. However, as the trained policy learns to be more successful, the negative examples (the ones produced by the agent) become increasingly similar to expert ones. Despite the fact that ...More
Code:
Data:
Full Text
Tags
Comments