Classification of Photographed Document Images Based on Deep-Learning Features

Guoqiang Zhong,Hui Yao,Yutong Liu, Chen Hong,Tuan Pham

Proceedings of SPIE（2017）

引用 2|浏览0

暂无评分

摘要

In this paper, we propose two new problems related to classification of photographed document images, and based on deep learning methods, present the baseline solutions for these two problems. The first problem is that, for some photographed document images, which book do they belong to? The second one is, for some photographed document images, what is the type of the book they belong to? To address these two problems, we apply "AexNet" to the collected document images. Using the pre-trained "AlexNet" on the ImageNet data set directly, we obtain 92.57% accuracy for the book-name classification and 93.33% accuracy for the book-type one. After fine-tuning on the training set of the photographed document images, the accuracy of the book-name classification increases to 95.54% and that of the book-type one to 95.42%. To our best knowledge, although there exist many image classification algorithm, no previous work has targeted to these two challenging problems. In addition, the experiments demonstrate that deep-learning features outperform features extracted with traditional image descriptors on these two problems.

查看译文

关键词

Photographed document images,classification,image features,Alexnet,SVMs

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要