S² Engine: A Novel Systolic Architecture for Sparse Convolutional Neural Networks

Jianlei Yang, Wenzhi Fu,Xingzhou Cheng,Xucheng Ye, Pengcheng Dai,Weisheng Zhao

IEEE Transactions on Computers（2022）

引用 0|浏览4

暂无评分

摘要

Convolutional neural networks (CNNs) have achieved great success in performing cognitive tasks. However, execution of CNNs requires a large amount of computing resources and generates heavy memory traffic, which imposes a severe challenge on computing system design. Through optimizing parallel executions and data reuse in convolution, systolic architecture demonstrates great advantages in accelerating CNN computations. However, regular internal data transmission path in traditional systolic architecture prevents the systolic architecture from completely leveraging the benefits introduced by neural network sparsity. Deployment of fine-grained sparsity on the existing systolic architectures is greatly hindered by the incurred computational overheads. In this work, we propose

${\mathsf {S}}^{2}$

Engine – a novel systolic architecture that can fully exploit the sparsity in CNNs with maximized data reuse.

${\mathsf {S}}^{2}$

Engine transmits compressed data internally and allows each processing element to dynamically select an aligned data from the compressed dataflow in convolution. Compared to the naïve systolic array,

${\mathsf {S}}^{2}$

Engine achieves about

$3.2\times$

3.2×

and about

$3.0\times$

3.0×

improvements on speed and energy efficiency, respectively.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要

S2 Engine: A Novel Systolic Architecture for Sparse Convolutional Neural Networks

S² Engine: A Novel Systolic Architecture for Sparse Convolutional Neural Networks