Path Pattern Query Processing on Large Graphs

Big Data and Cloud Computing(2014)

引用 5|浏览0
暂无评分
摘要
There are plentiful and diverse applications of graph data management and mining in the real-world scientific research and business activities. As one of the most basic operations, uniform path pattern query processing on graph data faces three big challenges. In this paper, we deal with these challenges by the following points. Firstly, a new query language on graph, called G-Path, is presented, which focuses on complex path pattern query processing on a very large graph. Also, the design of a system called HDGL is proposed, which is based on a BSP-like model as well as MapReduce model, and can effectively handle distributed graph data operations and queries. Secondly, the implementation of HDGL on the de facto cloud platform - Hadoop - is brought forward. Based on the concept of distributed state machine, the query processing of a G-Path statement in HDGL is detailed. In addition, as the query optimization of G-Path queries, several tricks are utilized to improve dramatically the performance of query execution. Finally, extensive experiments on several graph data sets are conducted to show the usability of G-Path query language and the effectiveness of HDGL.
更多
查看译文
关键词
parallel processing,mapreduce model,path pattern query,g-path,uniform path pattern query processing,de facto cloud platform,graph data mining,graph data management,g-path query language,hdgl,large graphs,data mining,graph theory,hadoop,query processing,indexes,data models,database languages,pattern matching,writing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要