Cache topology aware computation mapping for multicores

Sigplan Notices(2010)

引用 46|浏览49
暂无评分
摘要
The main contribution of this paper is a compiler based, cache topology aware code optimization scheme for emerging multicore systems. This scheme distributes the iterations of a loop to be executed in parallel across the cores of a target multicore machine and schedules the iterations assigned to each core. Our goal is to improve the utilization of the on-chip multi-layer cache hierarchy and to maximize overall application performance. We evaluate our cache topology aware approach using a set of twelve applications and three different commercial multicore machines. In addition, to study some of our experimental parameters in detail and to explore future multicore machines (with higher core counts and deeper on-chip cache hierarchies), we also conduct a simulation based study. The results collected from our experiments with three Intel multicore machines show that the proposed compiler-based approach is very effective in enhancing performance. In addition, our simulation results indicate that optimizing for the on-chip cache hierarchy will be even more important in future multicores with increasing numbers of cores and cache levels.
更多
查看译文
关键词
intel multicore machine,multicore,cache level,cache,multi-level,topology-aware,cache topology aware approach,deeper on-chip cache hierarchy,different commercial multicore machine,cache topology aware code,future multicore machine,on-chip cache hierarchy,compiler,on-chip multi-layer cache hierarchy,multicore system,cache topology aware computation,code optimization,chip
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要