Efficient Breadth First Search on Multi-GPU Systems Using GPU-Centric OpenSHMEM.

Lecture Notes in Computer Science(2018)

引用 1|浏览6
暂无评分
摘要
NVSHMEM is an implementation of OpenSHMEM for NVIDIA GPUs which allows communication to be issued from inside CUDA kernels. In this work, we present an implementation of Breadth First Search for multi-GPU systems using NVSHMEM. We analyze the benefits and bottlenecks of moving fine-grained communication into CUDA kernels. Using our implementation of BFS, we achieve up to 75% improvement in performance compared to a CUDA-aware MPI-based implementation, in the best case.
更多
查看译文
关键词
efficient breadth first search,multi-gpu,gpu-centric
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要