谷歌浏览器插件
订阅小程序
在清言上使用

A Novel Data Cleaning Framework Based on Knowledge Graph

Yuanfeng Song, Danni Zhang, Xiaodong Li,Kunming Luo,Jianming Liao

2022 8TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS, BIGCOM(2022)

引用 0|浏览5
暂无评分
摘要
Inreal-world applications, data cleaning has long been a challenge across both academia and industry. Unsuccessful cleaning of data may lead to inaccurate analysis and untrustworthy decision-making. This paper proposes a novel knowledge graph-based data cleaning framework. The framework performs pattern repair and inference repair on dirty data based on the obtained implicit and explicit relationships by establishing the knowledge graph and the relationship patterns in the data. The pattern repair includes both explicit and implicit relationship matching, while the inference repair includes both attribution inference repair and rule inference repair. The experimental results show that the higher the number of association relations among data tables, the greater the improvement in cleaning efficiency; moreover, the more association knowledge is contained in the knowledge graph, the more obvious the improvement of cleaning efficiency.
更多
查看译文
关键词
data cleaning,knowledge graph,error repair,knowledge inference
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要