A benchmark-based performance model for memory-bound HPC applications

Bertrand Putigny,Brice Goglin,Denis Barthou

High Performance Computing & Simulation（2014）

引用 10|浏览7

暂无评分

摘要

The increasing computation capability of servers comes with a dramatic increase of their complexity through many cores, multiple levels of caches and NUMA architectures. Exploiting the computing power is increasingly harder and programmers need ways to understand the performance behavior. We present an innovative approach for predicting the performance of memory-bound multi-threaded applications. It relies on micro-benchmarks and a compositional model, combining measures of micro-benchmarks in order to model larger codes. Our memory model takes into account cache sizes and cache coherence protocols, having a large impact on performance of multi-threaded codes. Applying this model to real world HPC kernels shows that it can predict their performance with good accuracy, helping taking optimization decisions to increase application's performance.

查看译文

关键词

cache storage,multi-threading,multiprocessing systems,HPC kernels,NUMA architectures,benchmark-based performance model,cache coherence protocols,cache level,cache size,high performance computing,memory-bound HPC applications,memory-bound multithreaded applications,microbenchmarks,caches,memory model,microbenchmarks,multicore,timing prediction

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要