谷歌浏览器插件
订阅小程序
在清言上使用

Representing MapReduce Optimisations in the Nested Relational Calculus

British National Conference on Databases(2013)

引用 7|浏览0
暂无评分
摘要
The MapReduce programming model is recently getting a lot of attention from both academic and business researchers. Systems based on this model hide communication and synchronization issues from the user and allow processing of high volumes of data on thousands of commodity computers. In this paper we are interested in applying MR to processing hierarchical data with nested collections such as stored in JSON or XML formats but with restricted nesting depth as is usual in the nested relational model. The current data analytics systems now often propose ad-hoc formalisms to represent query evaluation plans and to optimize their execution. In this paper we will argue that the Nested Relation Calculus provides a general, elegant and effective way to describe and investigate these optimizations. It allows to describe and combine both classical optimizations and MapReduce-specific optimizations. We demonstrate this by showing that MapReduce programs can be expressed and represented straightforwardly in NRC by adding syntactic short-hands. In addition we show that optimizations in existing systems can be readily represented in this extended formalism.
更多
查看译文
关键词
MapReduce-specific optimizations,classical optimizations,MapReduce programming model,current data,hierarchical data,nested relational model,MapReduce program,nested collection,Nested Relation Calculus,XML format,mapreduce optimisations,nested relational calculus
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要