Multimodal Document Image Classification
ICDAR, pp. 71-77, 2019.
State-of-the-art methods for document image classification rely on visual features extracted by deep convolutional neural networks (CNNs). These methods do not utilize rich semantic information present in the text of the document, which can be extracted using Optical Character Recognition (OCR). We first study the performance of state-of-...More
Full Text (Upload PDF)
PPT (Upload PPT)