Bit-GraphBLAS: Bit-Level Optimizations of Matrix-Centric Graph Processing on GPU

Jou-An Chen,Hsin-Hsuan Sung,Xipeng Shen,Nathan Tallent,Kevin Barker,Ang Li

2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS)（2022）

引用 1|浏览3

暂无评分

摘要

In a general graph data structure like an adjacency matrix, when edges are homogeneous, the connectivity of two nodes can be sufficiently represented using a single bit. This insight has, however, not yet been adequately exploited by the existing matrix-centric graph processing frameworks. This work fills the void by systematically exploring the bit-level representation of graphs and the corresponding optimizations to the graph operations. It proposes a two-level representation named Bit-Block Compressed Sparse Row (B2SR) and presents a series of optimizations to the graph operations on B2SR by leveraging the intrinsics of modern GPUs. Evaluations on NVIDIA Pascal and Volta GPUs show that the optimizations bring up to $40\times$ and $6555\times$ for essential GraphBLAS kernels SpMV and SpGEMM, respectively, making GraphBLAS-based BFS accelerate up to $433\times$, SSSP, PR, and CC up to $35\times$, and TC up to $52\times$.

查看译文

关键词

Bit-GraphBLAS,bit-level optimizations,general graph data structure,adjacency matrix,single bit,existing matrix-centric graph processing frameworks,bit-level representation,corresponding optimizations,graph operations,two-level representation,Bit-Block Compressed Sparse Row,B2SR,GraphBLAS-based BFS

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要