Summarizing multiprocessor program execution with versatile, microarchitecture-independent snapshots

Summarizing multiprocessor program execution with versatile, microarchitecture-independent snapshots(2006)

引用 28|浏览13
暂无评分
摘要
Computer architects rely heavily on software simulation to evaluate, refine, and validate new designs before they are implemented. However, simulation time continues to increase as computers become more complex and multicore designs become more common. This thesis investigates software structures and algorithms for quickly simulating modern cache-coherent multiprocessors by amortizing the time spent to simulate the memory system and branch predictors. The Memory Timestamp Record (MTR) summarizes the directory and cache state of a multiprocessor system in a compact data structure. A single MTR snapshot is versatile enough to reconstruct the microarchitectural state resulting from various coherence protocols and cache organizations. The MTR may be quickly updated by each simulated processor during a fast-forwarding phase and optionally stored offline for reuse. To fill large branch prediction tables, we introduce Branch Predictor-based Compression (BPC) which compactly stores a branch trace so that it may be used to fill in any branch predictor structure. An entire BPC trace requires less space than single discrete predictor snapshots, and it may be decompressed 3-6 times faster than performing functional simulation.
更多
查看译文
关键词
multiprocessor program execution,branch predictor structure,simulation time,large branch prediction table,branch predictor,software simulation,functional simulation,branch trace,cache state,cache organization,microarchitecture-independent snapshot,single MTR snapshot
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要