Page Object Detection in Vietnamese Document Images with Novel Approach

2022 9th NAFOSTED Conference on Information and Computer Science (NICS)(2022)

引用 0|浏览3
暂无评分
摘要
We witnessed the rising popularity of Vietnamese documents on online platforms. Digitized Vietnamese documents (e.g., administrative text, scientific papers, textbooks, etc.) are available online. As a result, we need algorithms that can understand documents. Vietnamese is one of the most difficult languages with the Latin alphabet with additional accent symbols and derivative characters. Moreover, we still struggle with challenges arising from external and internal factors. This requires a good enough detector model as the foundation for extracting information tasks. In this research, we address page object detection in Vietnamese document images. We explore the performance of the UIT-DODV-Ext dataset, the largest Vietnamese document image dataset that includes scientific papers and textbooks. Additionally, we leverage the state-of-the-art object detector and then propose CasGRoIENet to improve the performance of the UIT-DODV-Ext dataset. CasGRoIENet achieves 75.9% mAP which is 2.3% higher than state-of-the-art results.
更多
查看译文
关键词
Deep Learning,Document Image Analysis,Image Processing,Page Object Detection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要