Detecting bottlenecks in parallel DAG-based data flow programs

MTAGS@SC(2010)

引用 24|浏览14
暂无评分
摘要
In recent years, several frameworks have been introduced to facilitate massively-parallel data processing on shared-nothing architectures like compute clouds. While these frameworks generally offer good support in terms of task deployment and fault tolerance, they only provide poor assistance in finding reasonable degrees of parallelization for the tasks to be executed. However, as billing models of clouds enable the utilization of many resources for a short period of time for the same cost as utilizing few resources for a long time, proper levels of parallelization are crucial to achieve short processing times while maintaining good resource utilization and therefore good cost efficiency. In this paper, we present and evaluate a solution for detecting CPU and I/O bottlenecks in parallel DAG-based data flow programs assuming capacity constrained communication channels. The detection of bottlenecks represents an important foundation for manually or automatically scaling out and tuning parallel data flow programs in order to increase performance and cost efficiency.
更多
查看译文
关键词
cloud computing,data flow computing,data flow graphs,fault tolerance,multiprocessing systems,parallel architectures,resource allocation,task analysis,cpu bottleneck detection,directed acyclic graph,parallel dag-based data flow program,parallel data processing,resource utilization,shared-nothing architecture,task deployment,classification algorithms,data flow,data processing,communication channels,cost efficiency,instruction sets,fault tolerant,shared nothing architecture
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要