End-to-end weakly-supervised semantic alignment

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition(2018)

引用 184|浏览129
暂无评分
摘要
We tackle the task of semantic alignment where the goal is to compute dense semantic correspondence aligning two images depicting objects of the same category. This is a challenging task due to large intra-class variation, changes in viewpoint and background clutter. We present the following three principal contributions. First, we develop a convolutional neural network architecture for semantic alignment that is trainable in an end-to-end manner from weak image-level supervision in the form of matching image pairs. The outcome is that parameters are learnt from rich appearance variation present in different but semantically related images without the need for tedious manual annotation of correspondences at training time. Second, the main component of this architecture is a differentiable soft inlier scoring module, inspired by the RANSAC inlier scoring procedure, that computes the quality of the alignment based on only geometrically consistent correspondences thereby reducing the effect of background clutter. Third, we demonstrate that the proposed approach achieves state-of-the-art performance on multiple standard benchmarks for semantic alignment.
更多
查看译文
关键词
background clutter,convolutional neural network architecture,weak image-level supervision,intra-class variation,end-to-end weakly-supervised semantic alignment,matching image pairs,RANSAC inlier scoring procedure,differentiable soft inlier scoring module
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要