Tradeoffs In Designing Accelerator Architectures For Visual Computing

Aqeel Mahesri,Daniel Johnson,Neal Crago,Sanjay J. Patel

MICRO（2008）

引用 56|浏览50

暂无评分

摘要

Visualization, interaction, and simulation (VIS) constitute a class of applications that is growing in importance. This class includes applications such as graphics rendering, video encoding, simulation, and computer vision. These applications are ideally suited for accelerators because of their parallelizability and demand for high throughput. We compile a benchmark suite, VISBench, to sense as a proxy for this application class.We use VISBench to examine some important high level decisions for all accelerator architecture. We propose a highly parallel base architecture. We examine the need for synchronization and data communication. We also examine GPU-style SIMD execution and find that a MIMD architecture usually performs better.Given these high level choices, we use VISBench to explore the microarchitectural design space. We analyze area versus performance tradeoffs in designing individual cores and the memory hierarchy. We find that a design made of small, sample cores achieves much higher throughput than a general purpose uniprocessor. Further we find that a limited amount of support for ILP within each core aids overall performance. We find that fine-grained multithreading improves performance, but only up to a point. We find that word-level (SSE-style) SIMD provides a poor performance to area ratio. Finally, we find that sufficient memory and cache band-width is essential to performance.

查看译文

关键词

cache storage,data visualisation,multi-threading,parallel architectures,synchronisation,GPU-style SIMD execution,MIMD architecture,accelerator architecture design,cache bandwidth,data communication,fine-grained multithreading,memory bandwidth,parallel base architecture,synchronization,uniprocessor,visual computing,visualization-interaction-simulation,

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要