Curiosity And Boredom Based On Prediction Error As Novel Internal Rewards

BRAIN-INSPIRED INFORMATION TECHNOLOGY（2010）

引用 5|浏览4

暂无评分

摘要

In this paper, the use of two internal reward models, curiosity and boredom, is proposed. Experiments on a maze navigation task demonstrated that appropriate values of parameters simultaneously improved the performance of the predictor of the environment and increase the external rewards compared with the conventional reinforcement learning. In conclusions, the relation between the proposed method and active learning, diversive curiosity, and specific curiosity is also discussed.

查看译文

关键词

active learning,reinforcement learning,prediction error

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要