A Translation Framework for Virtual Execution Environment on CPU/GPU Architecture

Guoxing Dong,Kai Chen,Erzhou Zhu,Yichao Zhang,Zhengwei Qi,Haibing Guan

Parallel Architectures, Algorithms and Programming（2010）

引用 5|浏览0

暂无评分

摘要

GPUs are many-core processors with tremendous computational power. However, as automatic parallelization has not been realized yet, developing high-performance parallel code for GPUs is still very challenging. The paper presents a novel translation framework designed for virtual execution environment based on CPU/GPU architecture. It addresses two major challenges of taking advantage of general purpose computation on graphics processing units (GPGPU) to improve performance: no rewriting the existing source code and resolving binary compatibility issues between different GPUs. The translation framework uses semi-automatic parallelization technology to port existing code to explicitly parallel programming models. It not only offers a mapping strategy from X86 platform to CUDA programming model, but also synchronizes the execution between the CPU and the GPUs. The input to our translation framework is parallelizable part of the program within binary code. With an additional information related to the parallelizable part, the translation framework transforms the sequential code into PTX code and execute it on GPUs. Experimental results on several programs from CUDA SDK Code Samples and Parboil Benchmark Suite show that our translation framework could achieve very high performance, even up to several tens of times speedup over the X86 native version.

查看译文

关键词

computer graphic equipment,coprocessors,multiprocessing systems,parallel architectures,CPU-GPU Architecture,CUDA SDK code samples,CUDA programming model,Parboil benchmark suite,X86 platform,many core processors,virtual execution environment,CUDA,GPGPU,Parallelization,Translator,

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要