Learning to Follow Language Instructions with Adversarial Reward Induction
arXiv: Artificial Intelligence, Volume abs/1806.01946, 2018.
Recent work has shown that deep reinforcement-learning agents can learn to follow language-like instructions from infrequent environment rewards. However, for many real-world natural language commands that involve a degree of underspecification or ambiguity, such as tidy the room, it would be challenging or impossible to program an approp...More
Full Text (Upload PDF)
PPT (Upload PPT)