Balanced and Compressed Coordinate Layout for the Sparse Matrix-Vector Product on GPUs

EURO-PAR 2020: PARALLEL PROCESSING WORKSHOPS(2021)

引用 2|浏览15
暂无评分
摘要
We contribute to the optimization of the sparse matrixvector product on graphics processing units by introducing a variant of the coordinate sparse matrix layout that compresses the integer representation of the matrix indices. In addition, we employ a look-ahead table to avoid the storage of repeated numerical values in the sparse matrix, yielding a more compact data representation that is easier to maintain in the cache. Our evaluation on the two most recent generations of NVIDIA GPUs, the V100 and the A100 architectures, shows considerable performance improvements over the kernels for the sparse matrix-vector product in cuSPARSE (CUDA 11.0.167).
更多
查看译文
关键词
Sparse matrix-vector product, Sparse matrix data layouts, Sparse linear algebra, High performance computing, GPUs
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要