Processing structured documents using convolutional neural networks
user-5f8cf7e04c775ec6fa691c92(2019)
摘要
Structured documents are processed using convolutional neural networks. One of the methods includes receiving a rendered form of a structured document; mapping a grid of cells to the rendered form; assigning a respective numeric embedding to each cell in the grid, comprising, for each cell: identifying content in the structured document that corresponds to a portion of the rendered form that is mapped to the cell, mapping the identified content to a numeric embedding for the identified content, and assigning the numeric embedding for the identified content to the cell; generating a matrix representation of the structured document from the numeric embeddings assigned to the cells of the grids; and generating neural network features of the structured document by processing the matrix representation of the structured document through a subnetwork comprising one or more convolutional neural network layers.
更多查看译文
关键词
Structured document,Convolutional neural network,Artificial neural network,Subnetwork,Matrix representation,Embedding,Grid,Pattern recognition,Computer science,Artificial intelligence
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络