Simplified Online Q-Learning For Lego Ev3 Robot

PROCEEDINGS 5TH IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE 2015)(2015)

引用 4|浏览2
暂无评分
摘要
Q-learning is a kind of model-free reinforcement learning algorithm which is effective in Robot's navigation applications. Unfortunately, Lego Mindstorms EV3 robot's file writing speed is sometimes too slow to implement Q-learning algorithm. In this paper, an approach is proposed to simplify Q-learning discrete value table into a new version that stores only one optimum action and its Q-value instead of storing every action's Q-value in each state. Exploration and contrast experiments show that our algorithm learns much faster than the original Q-learning without losing the ability to find a better policy in navigation task.
更多
查看译文
关键词
simplified online Q-learning,LEGO EV3 robot,model-free reinforcement learning algorithm,robot navigation applications,Lego Mindstorms EV3 robot,file writing speed,Q-learning discrete value table,Q-value,navigation task
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要