Compiler managed micro-cache bypassing for high performance EPIC processors

Youfeng Wu,Ryan Rakvic,Li-Ling Chen,Chyi-Chang Miao,George Chrysos,Jesse Fang

MICRO（2002）

引用 52|浏览111

暂无评分

摘要

Advanced microprocessors have been increasing clock rates, well beyond the Gigahertz boundary. For such high performance microprocessors, a small and fast data micro cache (ucache) is important to overall performance, and proper management of it via load bypassing has a significant performance impact. In this paper, we propose and evaluate a hardware-software collaborative technique to manage ucache bypassing for EPIC processors. The hardware supports the ucache bypassing with a flag in the load instruction format, and the compiler employs static analysis and profiling to identify loads that should bypass the ucache. The collaborative method achieves a significant improvement in performance for the SpecInt2000 benchmarks. On average, about 40%, 30%, 24%, and 22% of load references are identified to bypass 256B, 1K, 4K, and 8K sized ucaches, respectively. This reduces the ucache miss rates by 39%, 32%, 28%, and 26%. The number of pipeline stalls from loads to their uses is reduced by 13%, 9%, 6%, and 5%. Meanwhile, the L1 and L2 cache misses remain largely unchanged. For the 256B ucache, bypassing improves overall performance on average by 5%.

查看译文

关键词

significant performance impact,collaborative method,load reference,load instruction format,ucache bypassing,overall performance,high performance microprocessors,high performance epic processor,advanced microprocessors,micro-cache bypassing,l2 cache,load bypassing,dynamic scheduling,compiler,static analysis,collaboration,hardware,computer architecture,performance,pipelines

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要