High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation

Thomas Grützmacher,Hartwig Anzt,Florian Scheidegger,Enrique S. Quintana-Orti

2018 IEEE/ACM 8th Workshop on Irregular Applications: Architectures and Algorithms (IA3)（2018）

引用 5|浏览13

暂无评分

摘要

We address the acceleration of the PageRank al- gorithm for web information retrieval on graphics processing units (GPUs) via a modular precision framework that adapts the data format in memory to the numerical requirements as the iteration converges. In detail, we abandon the IEEE 754 single- and double-precision number representation formats, employed in the standard implementation of PageRank, to instead store the data in memory in some specialized formats. Furthermore, we avoid the data duplication by leveraging a data layout based on mantissa segmentation. Our evaluation on a V100 graphics card from NVIDIA shows acceleration factors of up to 30% with respect to the standard algorithm operating in double-precision.

查看译文

关键词

Sparse matrices,Graphics processing units,Web pages,Memory management,Layout,Kernel

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要