An Evaluation of Vectorizing Compilers

Saeed Maleki,Yaoqing Gao,Maria J. Garzarán,Tommy Wong,David A. Padua

PACT（2011）

引用 310|浏览362

暂无评分

摘要

Most of today's processors include vector units that have been designed to speedup single threaded programs. Although vector instructions can deliver high performance, writing vector code in assembly language or using intrinsics in high level languages is a time consuming and error-prone task. The alternative is to automate the process of vectorization by using vectorizing compilers. This paper evaluates how well compilers vectorize a synthetic benchmark consisting of 151 loops, two application from Petascale Application Collaboration Teams (PACT), and eight applications from Media Bench II. We evaluated three compilers: GCC (version 4.7.0), ICC (version 12.0) and XLC (version 11.01). Our results show that despite all the work done in vectorization in the last 40 years 45-71% of the loops in the synthetic benchmark and only a few loops from the real applications are vectorized by the compilers we evaluated.

查看译文

关键词

high level language,vector instruction,vector unit,vector code,vectorizing compilers,assembly language,media bench ii,high performance,compilers vectorize,petascale application collaboration teams,synthetic benchmark,vectors,kernel,benchmark testing,layout,media

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要