Curiosity And Boredom Based On Prediction Error As Novel Internal Rewards

BRAIN-INSPIRED INFORMATION TECHNOLOGY(2010)

引用 5|浏览4
暂无评分
摘要
In this paper, the use of two internal reward models, curiosity and boredom, is proposed. Experiments on a maze navigation task demonstrated that appropriate values of parameters simultaneously improved the performance of the predictor of the environment and increase the external rewards compared with the conventional reinforcement learning. In conclusions, the relation between the proposed method and active learning, diversive curiosity, and specific curiosity is also discussed.
更多
查看译文
关键词
active learning,reinforcement learning,prediction error
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要