Image Segmentation and Labeling Using Free-Form Semantic Annotation

Pattern Recognition(2014)

引用 5|浏览21
暂无评分
摘要
In this paper we investigate the problem of segmenting images using the information in text annotations. In contrast to the general image understanding problem, this type of annotation guided segmentation is less ill-posed in the sense that for the output there is higher consensus among human annotations. In the paper we present a system based on a combined visual and semantic pipeline. In the visual pipeline, a list of tentative figure-ground segmentations is first proposed. Each such segmentation is classified into a set of visual categories. In the natural language processing pipeline, the text is parsed and chunked into objects. Each chunk is then compared with the visual categories and the relative distance is computed using the word-net structure. The final choice of segments and their correspondence to the chunked objects are then obtained using combinatorial optimization. The output is compared to manually annotated ground-truth images. The results are promising and there are several interesting avenues for continued research.
更多
查看译文
关键词
combinatorial mathematics,image segmentation,natural language processing,optimisation,text analysis,annotation guided segmentation,combinatorial optimization,free-form semantic annotation,human annotations,image labeling,image understanding problem,manually annotated ground-truth images,natural language processing pipeline,semantic pipeline,tentative figure-ground segmentations,text annotations,visual categories,visual pipeline,word-net structure
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要