2 SoC in 16nm FinFET technology targeting"/>

A 16nm 25mm2 SoC with a 54.5x Flexibility-Efficiency Range from Dual-Core Arm Cortex-A53 to eFPGA and Cache-Coherent Accelerators

2019 Symposium on VLSI Circuits(2019)

引用 23|浏览142
暂无评分
摘要
This paper presents a 25mm 2 SoC in 16nm FinFET technology targeting flexible acceleration of compute intensive kernels in DNN, DSP and security algorithms. The SoC includes an always-on sub-system, a dual-core Arm A53 CPU cluster, an embedded FPGA array, and a quad-core cache-coherent accelerator cluster. Measurement results demonstrate the following observations: 1) moving DSP/cryptography kernels from A53 to eFPGA increases energy efficiency between 5.5× - 28.9×, 2) the use of cache coherency for datapath accelerators increases throughput by 2.94×, and 3) accelerator flexibility-efficiency (GOPS/W) range spans from 3.1× (A53+S1MD), to 16.5× (eFPGA), to 54.5× (CCA) compared to the dual-core CPU baseline on comparable tasks. The energy per inference on MobileNet-128 CNN shows a peak improvement of 47.6×.
更多
查看译文
关键词
Dual-core Arm Cortex-A53,cache-coherent accelerators,flexible acceleration,compute intensive kernels,security algorithms,A53 CPU cluster,embedded FPGA array,quad-core cache-coherent accelerator cluster,cache coherency,datapath accelerators,dual-core CPU baseline,SoC,FinFET technology,energy efficiency,flexibility-efficiency range,size 16.0 nm
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要