Neuroimaging Registration on GPU: Energy-Aware Acceleration.

Francisco Nurudín Álvarez, José Antonio Cabrera, Juan Francisco Chico,Jesús Pérez,Manuel Ujaldon

BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2016)(2016)

引用 1|浏览12
暂无评分
摘要
We present a CUDA implementation for Kepler and Maxwell GPU generations of neuroimaging registration based on the NiftyReg open-source library [1]. A wide number of strategies are deployed to accelerate the code, providing insightful guidelines to exploit the massive parallelism and memory hierarchy within emerging GPUs. Our efforts are analyzed from different perspectives: Acceleration, numerical accuracy, power consumption and energy efficiency, to identify potential scenarios where performance per watt can be optimal in large-scale biomedical applications. Experimental results suggest that parallelism and arithmetic intensity represent the most rewarding ways on the road to high performance bioinformatics when power is a major concern.
更多
查看译文
关键词
Shared Memory, Global Memory, Memory Hierarchy, Loop Unroll, Unify Memory
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要