A Multi-Level Optimization Strategy To Improve The Performance Of Stencil Computation

INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017)(2017)

引用 3|浏览13
暂无评分
摘要
Stencil computation represents an important numerical kernel in scientific computing. Lever-aging multi-core or many-core parallelism to optimize such operations represents a major challenge due to both the bandwidth demand and the low arithmetic intensity. The situation is worsened by the complexity of current architectures and the potential impact of various mechanisms (cache memory, vectorization, compilation). In this paper, we describe a multi-level optimization strategy that combines manual vectorization, space tiling and stencil composition. A major effort of this study is to compare our results with the Pochoir framework. We evaluate our methodology with a set of three different compilers (Intel, Clang and GCC) on two recent generations of Intel multi-core platforms. Our results show a good match with the theoretical performance models (i.e. roofline models). We also outperform Pochoir performance by a factor of x2.5 in the best case. (C) 2017 The Authors. Published by Elsevier B.V.
更多
查看译文
关键词
Stencil computation,Vectorization,Performance model
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要