Video Text Detection With Text Edges And Convolutional Neural Network

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)（2015）

引用 4|浏览46

暂无评分

摘要

Text and captions in videos provide useful information for content analysis and understanding. In this paper, we present an approach to detecting video text in a coarse-to-fine strategy. In the coarse phase we propose an efficient method to detect multi-scale candidate text regions with high recall. Then the candidate text regions are segmented and sent to the fine phase where a convolutional neural network(CNN) is applied to generate a confidence map for each candidate text region. Finally, the candidate text regions are further refined and partitioned into text lines by projection analysis. The CNN classifier in the fine phase enables feature sharing and robustly identifies text regions. The coarse phase sharply reduce the number of windows needed to be scanned by the CNN. The combination endows the proposed method with both efficiency and robustness when detecting video text. It was verified by experiment results on two publicly testing datasets and a dataset created by us.

查看译文

关键词

content analysis,coarse-to-fine strategy,multiscale candidate text region,projection analysis,feature sharing,CNN classifier,convolutional neural network,text edge,video text detection

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要