CAeSaR: unified cluster-assignment scheduling and communication reuse for clustered VLIW processors

CASES(2013)

引用 4|浏览28
暂无评分
摘要
Clustered architectures have been proposed as a solution to the scalability problem of wide ILP processors. VLIW architectures, being wide-issue by design, benefit significantly from clustering. Such architectures, being both statically scheduled and clustered, require specialized code generation techniques, as they require explicit Inter-Cluster Copy instructions (ICCs) be scheduled in the code stream. In this work we propose CAeSaR, a novel instruction scheduling algorithm that improves code generation for such architectures. It combines cluster assignment, instruction scheduling and inter-cluster communication reuse all in one single unified algorithm. The proposed algorithm improves performance by any phase-ordering issues among these three code generation and optimization steps. We evaluate CAeSaR on the MediabenchII and SPEC CINT2000 benchmarks and compare it against the state-of-the-art instruction scheduling algorithm. Our results show an improvement in execution time of up to 20.3%, and 13.8% on average, over the current state-of-the-art across the benchmarks.
更多
查看译文
关键词
instruction sets,parallel algorithms,parallel architectures,pattern clustering,processor scheduling,program compilers,CAeSaR,ICCs,ILP processors,MediabenchII,SPEC CINT2000 benchmarks,clustered VLIW processors,clustered architectures,code generation techniques,explicit inter-cluster copy instructions,instruction scheduling algorithm,intercluster communication reuse,unified cluster-assignment scheduling and communication reuse,Cluster Assignment,Clustered VLIW,Instruction Scheduling
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要