Exploiting Direct Memory Operands in GPU Instructions

Ali Mohammadpur-Fard,Sina Darabi,Hajar Falahati, Negin Mahani,Hamid Sarbazi-Azad

IEEE Computer Architecture Letters(2024)

引用 0|浏览2
暂无评分
摘要
Modern GPUs are widely used for diverse applications, particularly data-parallel tasks like machine learning and scientific computing. However, their efficiency is hindered by architectural limitations, inherited from historical RISC processors, in handling memory loads causing high register file contention. We observe that a significant number (around 26%) of values present in the register file are typically used only once, contributing to more than 25% of the total register file bank conflicts, on average. This paper addresses the challenge of single-use memory values in the GPU register file (i.e. data values used only once) which wastes space and increases latency. To this end, we introduce a novel mechanism inspired by CISC architectures. It replaces single-use loads with direct memory operands in arithmetic operations. Our approach improves performance by 20% and reduces energy consumption by 18%, on average, with negligible (<1%) hardware overhead.
更多
查看译文
关键词
CISC,GPGPU,RISC,register file
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要