Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation.
arXiv (Cornell University)(2021)
Key words
deep learning,complex problems,visual sensory inputs,natural language instructions,navigation graph,discrete action space,complex VLN setting,continuous 3D reconstructed environments,world navigation,Robo-VLN tasks,longer trajectory lengths,continuous action,state-of-the-art works,discrete VLN,hierarchical cross-modal agent,robotics vision-and-language navigation,HCM
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined