DLUX: a LUT-based Near-Bank Accelerator for Data Center Deep Learning Training Workloads

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems(2021)

引用 27|浏览89
暂无评分
摘要
The frequent data movement between the processor and the memory has become a severe performance bottleneck for deep neural network (DNN) training workloads in data centers. To solve this off-chip memory access challenge, the 3-D stacking processing-in-memory (3D-PIM) architecture provides a viable solution. However, existing 3D-PIM designs for DNN training suffer from the limited memory bandwidth ...
更多
查看译文
关键词
Training,Table lookup,Random access memory,Bandwidth,Layout,Three-dimensional displays
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要