Reducing the Annotation Effort for Video Object Segmentation Datasets
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 3060-3069, 2020.
For further progress in video object segmentation (VOS), larger, more diverse, and more challenging datasets will be necessary. However, densely labeling every frame with pixel masks does not scale to large datasets. We use a deep convolutional network to automatically create pseudo-labels on a pixel level from much cheaper bounding box...More
PPT (Upload PPT)