Segmentation-Free Alignment of Arbitrary Symbol Transcripts to Images.

Pau Torras,Mohamed Ali Souibgui,Jialuo Chen,Sanket Biswas,Alicia Fornés

ICDAR Workshops (1)（2023）

引用 0|浏览5

暂无评分

摘要

Developing arbitrary symbol recognition systems is a challenging endeavour. Even using content-agnostic architectures such as few-shot models, performance can be substantially improved by providing a number of well-annotated examples into training. In some contexts, transcripts of the symbols are available without any position information associated to them, which enables using line-level recognition architectures. A way of providing this position information to detection-based architectures is finding systems that can align the input symbols with the transcription. In this paper we discuss some symbol alignment techniques that are suitable for low-data scenarios and provide an insight on their perceived strengths and weaknesses. In particular, we study the usage of Connectionist Temporal Classification models, Attention-Based Sequence to Sequence models and we compare them with the results obtained on a few-shot recognition system.

查看译文

关键词

arbitrary symbol transcripts,images,segmentation-free

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要