Fighting Copycat Agents in Behavioral Cloning from Observation Histories
NeurIPS, 2020.
EI
Abstract:
Imitation learning trains policies to map from input observations to the actions that an expert would choose. In this setting, distribution shift frequently exacerbates the effect of misattributing expert actions to nuisance correlates among the observed variables. We observe that a common instance of this causal confusion occurs in par...More
Code:
Data:
Full Text
Tags
Comments