RaSQL: Greater Power and Performance for Big Data Analytics with Recursive-aggregate-SQL on Spark

Proceedings of the 2019 International Conference on Management of Data(2019)

引用 40|浏览125
暂无评分
摘要
Thanks to a simple SQL extension, Recursive-aggregate-SQL (RaSQL) can express very powerful queries and declarative algorithms, such as classical graph algorithms and data mining algorithms. A novel compiler implementation allows RaSQL to map declarative queries into one basic fixpoint operator supporting aggregates in recursive queries. A fully optimized implementation of this fixpoint operator leads to superior performance, scalability and portability. Thus, our RaSQL system, which extends Spark SQL with the before-mentioned new constructs and implementation techniques, matches and often surpasses the performance of other systems, including Apache Giraph, GraphX and Myria.
更多
查看译文
关键词
aggregates in recursion, declarative algorithms in sql, distributed computation, recursive query
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要