Unsupervised Object Discovery And Localization In Images And Videos

Minsu Cho,Suha Kwak,Ivan Laptev,Cordelia Schmid,Jean Ponce

2015 12th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI)（2015）

引用 6|浏览119

暂无评分

摘要

This paper addresses unsupervised discovery and localization of dominant objects from a noisy collection of images or videos. The setting of this problem is fully unsupervised, without even class labels or any assumption of a single dominant class, and thus far more general than those of typical colocalization or weakly-supervised localization tasks. Interestingly, our approach also discovers the topology of images/frames associated with instances of the same object class, a role normally left to supervisory information in the form of class labels in conventional image and video understanding methods.We tackle the discovery and localization problem using a part-based region matching approach: Off-the-shelf region proposals are extracted to form a set of candidate bounding boxes for objects and object parts, and these regions are effectively matched across images/frames. For each image/frame, a dominant object is localized by comparing the scores of candidate regions and selecting those that stand out over other regions containing them. Given a video collection, we also associate similar object regions along consecutive frames within the same video, thus achieving unsupervised tracking. Extensive experimental evaluations on standard benchmarks demonstrate that the proposed approach substantially outperforms the current state of the art in colocalization, and achieves robust object discovery in challenging mixed-class datasets.

查看译文

关键词

unsupervised learning,object discovery,object localization,image matching,object tracking

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要