RAJA: Portable Performance for Large-Scale Scientific Applications

2019 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC)(2019)

引用 210|浏览23
暂无评分
摘要
Modern high-performance computing systems are diverse, with hardware designs ranging from homogeneous multi- core CPUs to GPU or FPGA accelerated systems. Achieving desir- able application performance often requires choosing a program- ming model best suited to a particular platform. For large codes used daily in production that are under continual development, architecture-specific ports are untenable. Maintainability re- quires single-source application code that is performance portable across a range of architectures and programming models. In this paper we describe RAJA, a portability layer that enables C++ applications to leverage various programming models, and thus architectures, with a single-source codebase. We describe preliminary results using RAJA in three large production codes at Lawrence Livermore National Laboratory, observing 17×, 13× and 12× speedup on GPU-only over CPU- only nodes with single-source application code in each case.
更多
查看译文
关键词
C++ applications,single-source codebase,RAJA,production codes,high-performance computing systems,hardware designs,programming model,architecture-specific ports,homogeneous multicore CPU,Lawrence Livermore National Laboratory
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要