Assessing the GPU Offload Threshold of GEMM and GEMV Kernels on Modern Heterogeneous HPC Systems
SC-W '24 Proceedings of the SC '24 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis(2025)
Key words
BLAS,Performance,Heterogeneous,High-Performance Computing,Nvidia Grace-Hopper,AMD MI250X,Intel Ponte Vecchio
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined