NativeTask: A Hadoop compatible framework for high performance

BigData Conference(2013)

引用 12|浏览143
暂无评分
摘要
Although Hadoop MapReduce provides good programming abstractions and horizontal scalability, it is often blamed for its poor single node performance. In the meantime, MapReduce has already achieved a large install base, thus any performance improvement should keep the compatibility. In this paper, we address the challenges via several approaches guided by low-level performance analysis. And we materialize the approaches via NativeTask, a high-performance, fully compatible MapReduce execution engine. We evaluate its performance with representative HiBench workloads. The results show that the speedup NativeTask achieves ranges from 10% to 160%, and it paves the way for a better MapReduce that excels on both single node performance and scalability. In the future, hardware acceleration can also be applied to further improve the system's efficiency.
更多
查看译文
关键词
nativetask,high performance,c++ implementation,low-level performance analysis,cache-oblivious sort,hardware acceleration,programming abstractions,representative hibench workloads,cpu-bound application,system efficiency,hadoop compatible framework,software performance evaluation,compatibility,hadoop,horizontal scalability,hadoop mapreduce,distributed processing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要