Intra-Cluster Coalescing and Distributed-Block Scheduling to Reduce GPU NoC Pressure

IEEE Transactions on Computers(2019)

引用 5|浏览70
暂无评分
摘要
GPUs continue to boost the number of streaming multiprocessors (SMs) to provide increasingly higher compute capabilities. To construct a scalable crossbar network-on-chip (NoC) that connects the SMs to the memory controllers, a cluster structure is introduced in modern GPUs in which several SMs are grouped together to share a network port. Because of network port sharing, clustered GPUs face sever...
更多
查看译文
关键词
Graphics processing units,Scheduling,Bandwidth,Processor scheduling,Registers,Instruction sets,Kernel
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要