Learning to Understand Goal Specifications by Modelling Reward
arXiv: Artificial Intelligence, 2018.
Abstract:
Recent work has shown that deep reinforcement-learning agents can learn to follow language-like instructions from infrequent environment rewards. However, this places on environment designers the onus of designing language-conditional reward functions which may not be easily or tractably implemented as the complexity of the environment an...More
Code:
Data:
Full Text
Tags
Comments