RDMA-Based Library for Collective Operations in MPI

2019 IEEE/ACM Workshop on Exascale MPI (ExaMPI)(2019)

引用 2|浏览0
暂无评分
摘要
In most MPI implementations, abstraction layers separate the collective operation algorithms from the communication primitives, thus hindering its optimization with network acceleration technologies, such as RDMA. Open UCX is an RDMA-based point-ot-point communication library, that can reduce the latency between processes in MPI applications, particularly in large-scale system. This paper presents a design and implementation of a library for MPI collective operations, by extending Open UCX. Our approach is transparent to MPI applications, and can reduce the latency of repeated calls to such operations by an average of 8% for relatively small message sizes and as much as 90% for larger messages.
更多
查看译文
关键词
MPI collective operations,Open UCX,abstraction layers,collective operation algorithms,network acceleration technologies,RDMA-based point-to-point communication library
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要