PETRA: A 22nm 6.97TFLOPS/W AIB-Enabled Configurable Matrix and Convolution Accelerator Integrated with an Intel Stratix 10 FPGA

2021 Symposium on VLSI Circuits(2021)

引用 2|浏览9
暂无评分
摘要
PETRA is a configurable FP16 matrix multiplication and convolution accelerator designed to be 2.5D integrated using Advanced Interface Bus (AIB). PETRA is built upon four 16×16 systolic arrays, but it employs a configurable H-tree accumulation to improve both the latency and the utilization by up to 8×. A 22nm 3.04mm 2 PETRA prototype provides 1.433TFLOPS in computing matrix-matrix multiplication (MMM) and convolution (conv) at 0.88V, and it achieves a 6.97TFLOPS/W peak efficiency at 0.7V. PETRA is integrated with an Intel Stratix 10 FPGA in a multi-chip package (MCP) to provide the flexibility of FPGA and the performance and efficiency of PETRA.
更多
查看译文
关键词
multichip package,matrix-matrix multiplication,AIB,configurable FP16 matrix multiplication,PETRA prototype,configurable H-tree accumulation,16×16 systolic arrays,Advanced Interface Bus,convolution accelerator,Intel Stratix 10 FPGA,size 22.0 nm,voltage 0.7 V,voltage 0.88 V
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要