Applying Loop Tiling And Unrolling To A Sparse Kernel Code

COMPUTATIONAL SCIENCE - ICCS 2004, PT 1, PROCEEDINGS(2004)

引用 1|浏览10
暂无评分
摘要
Code transformations to optimize the performance work well where a very precise data dependence analysis can be done at compile time. However, current compilers usually do not optimize irregular codes, because they contain input dependent and/or dynamic memory access patterns. This paper presents how we can adapt two representative loop transformations, tiling and unrolling, to codes with irregular computations, obtaining a significant performance improvement over the original non-transformed code. Experiments of our proposals are conducted on three different hardware platforms. A very known sparse kernel code is used as an example code to show performance improvements.
更多
查看译文
关键词
loop tiling,transform coding
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要