Domain-Specific Optimization and Generation of High-Performance GPU Code for Stencil Computations.

Proceedings of the IEEE(2018)

引用 51|浏览101
暂无评分
摘要
Stencil computations arise in a number of computational domains. They exhibit significant data parallelism and are thus well suited for execution on graphical processing units (GPUs), but can be memory-bandwidth limited unless temporal locality is utilized via tiling. This paper describes how effective tiled code can be generated for GPUs from a domain-specific language (DSL) for stencils. Experim...
更多
查看译文
关键词
Graphics processing units,Optimization,Instruction sets,High performance computing,Bandwidth,Media streaming,Parallel processing,Resource management
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要