Blending Autonomous Exploration and Apprenticeship Learning.

NIPS(2011)

引用 20|浏览14
暂无评分
摘要
We present theoretical and empirical results for a framework that combines the benefits of apprenticeship and autonomous reinforcement learning. Our approach modifies an existing apprenticeship learning framework that relies on teacher demonstrations and does not necessarily explore the environment. The first change is replacing previously used Mistake Bound model learners with a recently proposed framework that melds the KWIK and Mistake Bound supervised learning protocols. The second change is introducing a communication of expected utility from the student to the teacher. The resulting system only uses teacher traces when the agent needs to learn concepts it cannot efficiently learn on its own.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络