EK-Net:Real-time Scene Text Detection with Expand Kernel Distance
CoRR(2024)
摘要
Recently, scene text detection has received significant attention due to its
wide application. However, accurate detection in complex scenes of multiple
scales, orientations, and curvature remains a challenge. Numerous detection
methods adopt the Vatti clipping (VC) algorithm for multiple-instance training
to address the issue of arbitrary-shaped text. Yet we identify several bias
results from these approaches called the "shrinked kernel". Specifically, it
refers to a decrease in accuracy resulting from an output that overly favors
the text kernel. In this paper, we propose a new approach named Expand Kernel
Network (EK-Net) with expand kernel distance to compensate for the previous
deficiency, which includes three-stages regression to complete instance
detection. Moreover, EK-Net not only realize the precise positioning of
arbitrary-shaped text, but also achieve a trade-off between performance and
speed. Evaluation results demonstrate that EK-Net achieves state-of-the-art or
competitive performance compared to other advanced methods, e.g., F-measure of
85.72
更多查看译文
关键词
Scene Text Detection,Arbitrary Shapes,Real-Time,Three-stages Regression,Expand Kernel Distance
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要