RAJA: Portable Performance for Large-Scale Scientific Applications

David Beckingsale,Thomas R. W. Scogland,Jason Burmark,Rich Hornung,Holger Jones,William Killian,Adam J. Kunen,Olga Pearce,Peter Robinson,Brian S. Ryujin

2019 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC)（2019）

引用 210|浏览23

暂无评分

摘要

Modern high-performance computing systems are diverse, with hardware designs ranging from homogeneous multi- core CPUs to GPU or FPGA accelerated systems. Achieving desir- able application performance often requires choosing a program- ming model best suited to a particular platform. For large codes used daily in production that are under continual development, architecture-specific ports are untenable. Maintainability re- quires single-source application code that is performance portable across a range of architectures and programming models. In this paper we describe RAJA, a portability layer that enables C++ applications to leverage various programming models, and thus architectures, with a single-source codebase. We describe preliminary results using RAJA in three large production codes at Lawrence Livermore National Laboratory, observing 17×, 13× and 12× speedup on GPU-only over CPU- only nodes with single-source application code in each case.

查看译文

关键词

C++ applications,single-source codebase,RAJA,production codes,high-performance computing systems,hardware designs,programming model,architecture-specific ports,homogeneous multicore CPU,Lawrence Livermore National Laboratory

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要