Unsupervised Object Discovery and Tracking in Video Collections

Suha Kwak,Minsu Cho,Ivan Laptev,Jean Ponce,Cordelia Schmid

2015 IEEE International Conference on Computer Vision (ICCV)（2015）

引用 142|浏览182

暂无评分

摘要

This paper addresses the problem of automatically localizing dominant objects as spatio-temporal tubes in a noisy collection of videos with minimal or even no supervision. We formulate the problem as a combination of two complementary processes: discovery and tracking. The first one establishes correspondences between prominent regions across videos, and the second one associates successive similar object regions within the same video. Interestingly, our algorithm also discovers the implicit topology of frames associated with instances of the same object class across different videos, a role normally left to supervisory information in the form of class labels in conventional image and video understanding methods. Indeed, as demonstrated by our experiments, our method can handle video collections featuring multiple object classes, and substantially outperforms the state of the art in colocalization, even though it tackles a broader problem with much less supervision.

查看译文

关键词

unsupervised object discovery,object tracking,automatic dominant objects localization,spatio-temporal tubes,video noisy collection,similar object regions,implicit topology,object class,class labels,image understanding methods,video understanding methods

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要