Object-Focused Advice In Reinforcement Learning

AAMAS '16: Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems(2016)

引用 2|浏览99
暂无评分
摘要
In order for robots and intelligent agents to interact with and learn from people with no machine-learning expertise, robots should be able to learn from natural human instruction. Many human explanations consist of simple sentences without state information, yet most machine learning techniques that incorporate human guidance cannot use nonspecific explanations. This work aims to learn policies from a few sentences that aren't state specific. The proposed Object-focused advice links an object to an action, and allows a person to generalize over an object's state space. To evaluate this technique, agents were trained using Objectfocused advice collected from participants in an experiment in the Mario Bros. domain. The results show that Objectfocused advice performs better than when no advice is given, the agent can learn where to apply the advice in the state space, and the agent can recover from adversarial advice. Also, including warnings of what not do to in addition to advice of what actions to take improves performance.
更多
查看译文
关键词
Advice,Reinforcement Learning,Human Teachers
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要