谷歌浏览器插件
订阅小程序
在清言上使用

Towards Scalable and Efficient FPGA Stencil Accelerators

high performance embedded architectures and compilers(2016)

引用 23|浏览69
暂无评分
摘要
In this paper we propose a design template for stencil computations targeting FPGA-based accelerators. The goal for our design is to provide scalable high throughput designs that can efficiently process iterative stencil programs with large size parameters, i.e., those whose data footprint is too large to fit on-chip. Our context is when we seek to use FP-GAs as accelerators attached to CPUs. Minimizing the area is not our primary goal. We propose a family of architectures based on hierarchical tiling, where the inner tiling is used to build coarse-grain data-path operators, increasing computational throughput, and the outer tiling is used to control the memory requirement , specifically data transfers to/from the accelerator. We present preliminary results for Jacobi-style stencils on 1D and 2D data, and are working on fully automating the flow.
更多
查看译文
关键词
efficient fpga stencil accelerators,scalable
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要