Table Localization and Field Value Extraction in Piping and Instrumentation Diagram Images

2019 International Conference on Document Analysis and Recognition Workshops (ICDARW)(2019)

引用 10|浏览151
暂无评分
摘要
Piping and Instrumentation Diagrams (P&IDs) are graph-based engineering drawings utilised in process engineering. These documents also contain additional information in tabular form. In this paper, the localisation and extraction of information of these tables are investigated. Documents used in this context are scanned raster version of P&IDs with tabular data inside a frame. The objective is to extract fields information from these tabular structures. This process is mainly divided into table localisation and then table field extraction from the segmented tables. The table localization task is achieved primarily with contour detection methods of computer vision. For the field-value extraction, a combination of rule-based keywords and navigation approach is used, utilising an Optical Character Recognition (OCR) for text extraction and regular expression for string comparison. This paper describes application of this extendable approach to the P&ID domain, where it achieved a promising result on a private dataset.
更多
查看译文
关键词
Table Localization, Information Extraction, Piping and Instrumentation Diagrams
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要