gMark: Schema-Driven Generation of Graphs and Queries.

IEEE Trans. Knowl. Data Eng.(2017)

引用 84|浏览73
暂无评分
摘要
Massive graph data sets are pervasive in contemporary application domains. Hence, graph database systems are becoming increasingly important. In the experimental study of these systems, it is vital that the research community has shared solutions for the generation of database instances and query workloads having predictable and controllable properties. In this paper, we present the design and engineering principles of $\\mathsf {gMark}$ , a domain- and query language-independent graph instance and query workload generator. A core contribution of $\\mathsf {gMark}$ is its ability to target and control the diversity of properties of both the generated instances and the generated workloads coupled to these instances. Further novelties include support for regular path queries, a fundamental graph query paradigm, and schema-driven selectivity estimation of queries, a key feature in controlling workload chokepoints. We illustrate the flexibility and practical usability of $\\mathsf {gMark}$ by showcasing the framework's capabilities in generating high quality graphs and workloads, and its ability to encode user-defined schemas across a variety of application domains.
更多
查看译文
关键词
Benchmark testing,Query processing,Database languages,Estimation,Generators
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要