Enhancing Matrix Multiplication with a Monolithic 3D Based Scratchpad Memory

IEEE Embedded Systems Letters(2021)

引用 6|浏览8
暂无评分
摘要
Convolutional neural networks (CNNs) are one of the most popular machine learning algorithms. The convolutional layers, which account for the most execution time of CNNs, are implemented with matrix multiplication because the convolution operation performs dot products between filters and local regions of the input. On the other hand, GPUs with thousands of cores were proven to significantly accel...
更多
查看译文
关键词
Two dimensional displays,Graphics processing units,Random access memory,System performance,Decoding,Energy consumption,Transistors
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要