Chrome Extension
WeChat Mini Program
Use on ChatGLM

Assessing the GPU Offload Threshold of GEMM and GEMV Kernels on Modern Heterogeneous HPC Systems

Finn Wilkinson, Alex Cockrean,Wei-Chen Lin,Simon McIntosh-Smith,Tom Deakin

SC-W '24 Proceedings of the SC '24 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis(2025)

Cited 0|Views1
Key words
BLAS,Performance,Heterogeneous,High-Performance Computing,Nvidia Grace-Hopper,AMD MI250X,Intel Ponte Vecchio
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined