Deep Learning And Recurrent Connectionist-Based Approaches For Arabic Text Recognition In Videos

2015 13th International Conference on Document Analysis and Recognition (ICDAR)(2015)

引用 28|浏览50
暂无评分
摘要
This paper focuses on recognizing Arabic embedded text in videos. The proposed methods proceed without applying any prior pre-processing operations or character segmentation. Difficulties related to the video or text properties are faced using a learned robust representation of the input text image. This is performed using Convolutional Neural Networks and Deep Auto-Encoders. Features are computed using a multi-scale sliding window scheme. A connectionist recurrent approach is then used. It is trained to predict correct transcriptions of the input image from the associated sequence of features. Proposed methods are extensively evaluated on a large video database recorded from several Arabic TV channels.
更多
查看译文
关键词
recurrent connectionist,deep learning,video Arabic text recognition,Arabic embedded text,character segmentation,input text image,convolutional neural networks,deep autoencoders,multiscale sliding window scheme,associated sequence,video database,Arabic TV channels
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要