Provenance For Entity Resolution

PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, IPAW 2018(2018)

引用 2|浏览22
暂无评分
摘要
Data provenance can support the understanding and debugging of complex data processing pipelines, which are for instance common in data integration scenarios. One task in data integration is entity resolution (ER), i.e., the identification of multiple representations of a same real world entity. This paper focuses of provenance modeling and capture for typical ER tasks. While our definition of ER provenance is independent of the actual language or technology used to define an ER task, the method we implement as a proof of concept instruments ER rules specified in HIL, a high-level data integration language.
更多
查看译文
关键词
Data provenance, Entity resolution, Data integration
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要