TDNCIB: Target-Driven Navigation with Convolutional Information Branch Based on Deep Reinforcement Learning

Xinzhou Zhao,Tian Wang,Kexin Liu,Mengyi Zhang,Peng Shi,Hichem Snoussi

2021 China Automation Congress (CAC)（2021）

引用 0|浏览0

暂无评分

摘要

Target-driven navigation has been a significant application field of reinforcement learning. However, sparse reward and lack of useful environment information are still existing problems. Therefore, in this paper, we propose a novel design to the target-driven visual navigation based on deep reinforcement learning: Target-Driven Navigation with Convolutional Information Branch (TDNCIB), which enriches the high-dimensional information learned by the network from the environment. It is an improvement of A2C algorithm. CIB can help make full use of environmental information and serve as a complement to the baseline. Besides, a novel reward design using similarity is introduced to relieve sparse reward. The ablation study is implemented in 4 different scenes of AI2-THOR framework comparing the baseline and our TDNCIB. According to our evaluation, TDNCIB converges faster than the baseline and achieves better performance in 5 common metrics with shorter trajectory length, less collision and higher success rate.

查看译文

关键词

deep reinforcement learning,sparse reward,target-driven visual navigation,TDNCIB,high-dimensional information,environmental information,convolutional information branch,A2C algorithm,AI2-THOR framework

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要