Learning to Follow Language Instructions with Adversarial Reward Induction
arXiv: Artificial Intelligence, Volume abs/1806.01946, 2018.
EI
Abstract:
Recent work has shown that deep reinforcement-learning agents can learn to follow language-like instructions from infrequent environment rewards. However, for many real-world natural language commands that involve a degree of underspecification or ambiguity, such as tidy the room, it would be challenging or impossible to program an approp...More
Code:
Data:
Tags
Comments