Analyzing the impact of programming models for efficient communication overlap in high-speed networks

High Performance Computing & Simulation(2014)

引用 3|浏览11
暂无评分
摘要
Exascale applications for civil engineering, simulations and other fields related with current research make intensive use of large sparse matrices. A characteristic of these matrices is the difficulty of balancing communication and computation, so that even when these two phases are overlapped the application does not achieve a good overall scalability, but instead suffers from a loss of performance. Some proposals have been presented in order to diminish this drawback, based on the hybrid use of programming models, using MPI as the communication basis and threads for computation -mainly OpenMP, but also Cilk, CUDA or OpenCL, to adapt to new heterogeneous platforms. In this work, we evaluate the impact of providing task-based parallelism instead of fork-join parallelism. As regards communication, the appearance of faster networks with specific optimizations and internal protocol characteristics makes it appealing to analyze and evaluate the influence of these networks on performance execution. We evaluate our results on two different communication networks: 10Gigabit Ethernet and Infiniband. For our evaluations we run the miniFE miniapplication of the Mantevo suite benchmark, in a homogeneous supercomputer platform based on Intel SandyBridge processors. Experimental results show how the network behavior can affect performance and how it can be managed via task-based models: from a hybrid MPI/OpenMP version that overlaps communication and computation, our task-based proposal MPI/OmpSs obtains up to 60% improvement.
更多
查看译文
关键词
local area networks,message passing,parallel programming,CUDA,Cilk,Ethernet,Infiniband,Intel SandyBridge processors,Mantevo suite benchmark,OpenCL,OpenMP,communication overlap,compute unified device architecture,exascale applications,fork-join parallelism,high-speed networks,homogeneous supercomputer platform,miniFE miniapplication,programming models,sparse matrix,task-based parallelism,Message passing,high performance interconnection networks,libraries and programming environments,scalable computing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要