LocalityGuru: A PTX Analyzer for Extracting Thread Block-level Locality in GPGPUs

2021 IEEE International Conference on Networking, Architecture and Storage (NAS)(2021)

引用 6|浏览10
暂无评分
摘要
Exploiting data locality in GPGPUs is critical for efficiently using the smaller data caches and handling the memory bottleneck problem. This paper proposes a thread block-centric locality analysis, which identifies the locality among the thread blocks (TBs) in terms of a number of common data references. In LocalityGuru, we seek to employ a detailed just-in-time (JIT) compilation analysis of the ...
更多
查看译文
关键词
Instruction sets,Prefetching,Conferences,Memory management,Graphics processing units,Syntactics,Timing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要