High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation

2018 IEEE/ACM 8th Workshop on Irregular Applications: Architectures and Algorithms (IA3)(2018)

引用 5|浏览13
暂无评分
摘要
We address the acceleration of the PageRank al- gorithm for web information retrieval on graphics processing units (GPUs) via a modular precision framework that adapts the data format in memory to the numerical requirements as the iteration converges. In detail, we abandon the IEEE 754 single- and double-precision number representation formats, employed in the standard implementation of PageRank, to instead store the data in memory in some specialized formats. Furthermore, we avoid the data duplication by leveraging a data layout based on mantissa segmentation. Our evaluation on a V100 graphics card from NVIDIA shows acceleration factors of up to 30% with respect to the standard algorithm operating in double-precision.
更多
查看译文
关键词
Sparse matrices,Graphics processing units,Web pages,Memory management,Layout,Kernel
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要