Tripleprov: Efficient Processing Of Lineage Queries In A Native Rdf Store

Marcin Wylot,Philippe Cudre-Mauroux,Paul Groth

WWW '14: 23rd International World Wide Web Conference Seoul Korea April, 2014（2014）

引用 66|浏览120

暂无评分

摘要

Given the heterogeneity of the data one can find on the Linked Data cloud, being able to trace back the provenance of query results is rapidly becoming a must-have feature of RDF systems. While provenance models have been extensively discussed in recent years, little attention has been given to the efficient implementation of provenance-enabled queries inside data stores. This paper introduces TripleProv: a new system extending a native RDF store to efficiently handle such queries. TripleProv implements two different storage models to physically co-locate lineage and instance data, and for each of them implements algorithms for tracing provenance at two granularity levels. In the following, we present the overall architecture of our system, its different lineage storage models, and the various query execution strategies we have implemented to efficiently answer provenance-enabled queries. In addition, we present the results of a comprehensive empirical evaluation of our system over two different datasets and workloads.

查看译文

关键词

Provenance Queries,Provenance Polynomials,RDF,Linked Open Data

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要