谷歌浏览器插件
订阅小程序
在清言上使用

Overcoming Limitations of GPGPU-Computing in Scientific Applications.

IEEE Conference on High Performance Extreme Computing(2019)

引用 1|浏览3
暂无评分
摘要
The performance of discrete general purpose graphics processing units (GPGPUs) has been improving at a rapid pace. The PCIe interconnect that controls the communication of data between the system host memory and the GPU has not improved as quickly, leaving a gap in performance due to GPU downtime while waiting for PCIe data transfer. In this article, we explore two alternatives to the limited PCIe bandwidth, NVIDIA NVLink interconnect, and zero-copy algorithms for shared memory Heterogeneous System Architecture (HSA) devices. The OpenCL SHOC benchmark suite is used to measure the performance of each device on various scientific application kernels.
更多
查看译文
关键词
Scientific Computing,Embedded Devices,Accelerators,Parallel Computing,Supercomputing,GPU,GPGPU,OpenCL,SHOC,Physics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要