HALO-CAT: A Hidden Network Processor with Activation-Localized CIM Architecture and Layer-Penetrative Tiling
CoRR(2023)
摘要
To address the 'memory wall' problem in NN hardware acceleration, we
introduce HALO-CAT, a software-hardware co-design optimized for Hidden Neural
Network (HNN) processing. HALO-CAT integrates Layer-Penetrative Tiling (LPT)
for algorithmic efficiency, reducing intermediate result sizes. Furthermore,
the architecture employs an activation-localized computing-in-memory approach
to minimize data movement. This design significantly enhances energy
efficiency, achieving a 14.2x reduction in activation memory capacity and a
17.8x decrease in energy consumption, with only a 1.5% loss in accuracy,
compared to traditional HNN processors.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要