Sparse ReRAM engine: joint exploration of activation and weight sparsity in compressed neural networks
Proceedings of the 46th International Symposium on Computer Architecture, pp. 236-249, 2019.
Exploiting model sparsity to reduce ineffectual computation is a commonly used approach to achieve energy efficiency for DNN inference accelerators. However, due to the tightly coupled crossbar structure, exploiting sparsity for ReRAM-based NN accelerator is a less explored area. Existing architectural studies on ReRAM-based NN accelerato...More
Full Text (Upload PDF)
PPT (Upload PPT)