OCID-Ref: A 3D Robotic Dataset with Embodied Language for Clutter Scene Grounding
arXiv (Cornell University)(2021)
Key words
Visual Question Answering,Continuous Recognition,Gesture Recognition,Image Captioning
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined