CVC-UAB's Participation in the Flowchart Recognition Task of CLEF-IP 2012.

Marçal Rusiñol,Lluís-Pere de las Heras,Joan Mas,Oriol Ramos Terrades,Dimosthenis Karatzas,Anjan Dutta,Gemma Sánchez,Josep Lladós

CLEF (Online Working Notes/Labs/Workshop)（2012）

引用 26|浏览18

暂无评分

摘要

The aim of this document is to describe the methods we used in the owchart recognition task of the CLEF-IP 2012 track. The owchart recognition task consisted in interpreting owchart line- drawing images. The participants are asked to extract as much as struc- tural information in these images as possible and return it in a pre- dened textual format for further processing for the purpose of patent search. The Document Analysis Group from the Computer Vision Cen- ter (CVC-UAB) has been actively working on Graphics Recognition for over a decade. Our main aim in participating in the CLEF-IP owchart recognition task is to test our graphics recognition architectures on this type of graphics understanding problem. Our recognition system comprises a modular architecture where mod- ules tackle dierent steps of the owchart understanding problem. A text/graphic separation technique is applied to separate the textual el- ements from the graphical ones. An OCR engine is applied on the text layer while on the graphical layer identify with nodes and edges as well as their relationships. We have proposed two dierent families of node and edge segmentation modules. One dealing with the raw pixel data and another working in the vectorial domain. The locations of nodes identied are fed to the recognizer module which is in charge of catego- rizing the nodeu0027s type. We have proposed two dierent node descriptors for the recognizer module. The module analyzing the edges is analysing the connections between nodes and categorizes the edge style. Finally, a post-processing module is applied in order to correct some syntactic errors. We have submitted four dierent runs by combining the two variants of the segmentation module together with the two variants of the recogni- tion module.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要