A large-scale hierarchical multi-view RGB-D object dataset

Robotics and Automation(2011)

引用 1801|浏览492
暂无评分
摘要
Over the last decade, the availability of public image repositories and recognition benchmarks has enabled rapid progress in visual object category and instance detection. Today we are witnessing the birth of a new generation of sensing technologies capable of providing high quality synchronized videos of both color and depth, the RGB-D (Kinect-style) camera. With its advanced sensing capabilities and the potential for mass adoption, this technology represents an opportunity to dramatically increase robotic object recognition, manipulation, navigation, and interaction capabilities. In this paper, we introduce a large-scale, hierarchical multi-view object dataset collected using an RGB-D camera. The dataset contains 300 objects organized into 51 categories and has been made publicly available to the research community so as to enable rapid progress based on this promising technology. This paper describes the dataset collection procedure and introduces techniques for RGB-D based object recognition and detection, demonstrating that combining color and depth information substantially improves quality of results.
更多
查看译文
关键词
image colour analysis,image sensors,object recognition,robot vision,video signal processing,RGB-D camera,dataset collection procedure,instance detection,large-scale hierarchical multiview RGB-D object dataset,public image recognition,public image repositories,robotic object recognition,visual object category
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要