Character Recognition in Japanese Historical Documents via Adaptive Multi-Region Model

Yueyu Wang,Sei-ichiro Kamata

2018 Joint 7th International Conference on Informatics, Electronics & Vision (ICIEV) and 2018 2nd International Conference on Imaging, Vision & Pattern Recognition (icIVPR)（2018）

引用 1|浏览1

暂无评分

摘要

In this work, we introduce a novel model with an adaptive multi-region extraction network to grasp multi-aspect of discriminative features, because feature inside bounding box is insufficient for classification, and normal models are sensitive to inaccuracy of predicted bounding boxes. We use the new model to recognize Japanese from historical documents. This model can be trained end-to-end without any extra supervision. The resulting CNN-based representation has abundant of features, containing the contextual information together with center part information. These features are helpful and crucial for classification. Based on this model, we also propose a data augmentation method using both local and global data distortion to generate diversified samples in order to solve the problem of data imbalance. Experiments show that with the usage of our model, we get a better result in ancient Japanese dataset.

查看译文

关键词

Feature extraction,Adaptive systems,Adaptation models,Character recognition,Distortion,Proposals,Data mining

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要