LocalityGuru: A PTX Analyzer for Extracting Thread Block-level Locality in GPGPUs
2021 IEEE International Conference on Networking, Architecture and Storage (NAS)(2021)
摘要
Exploiting data locality in GPGPUs is critical for efficiently using the smaller data caches and handling the memory bottleneck problem. This paper proposes a thread block-centric locality analysis, which identifies the locality among the thread blocks (TBs) in terms of a number of common data references. In LocalityGuru, we seek to employ a detailed just-in-time (JIT) compilation analysis of the ...
更多查看译文
关键词
Instruction sets,Prefetching,Conferences,Memory management,Graphics processing units,Syntactics,Timing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要