Fragmented Huffman-Based Compression Methodology for CNN Targeting Resource-Constrained Edge Devices

Chandrajit Pal, Sunil Pankaj, Wasim Akram,Dwaipayan Biswas,Govardhan Mattela,Amit Acharyya

Circuits, Systems, and Signal Processing（2022）

引用 0|浏览6

暂无评分

摘要

In this paper, we introduce a fragmented Huffman compression methodology for compressing convolution neural networks executing on edge devices. Present scenario demands deployment of deep networks on edge devices, since application needs to adhere to low latency, enhanced security and long-term cost effectiveness. However, the primary bottleneck lies in the expanded memory footprint on account of the large size of the neural net models. Existing software implementation of deep compression strategies do exist, where Huffman compression is applied on the quantized weights, reducing the deep neural network model size. However, there is a further possibility of compression in memory footprint from a hardware design perspective in edge devices, where our proposed methodology can be complementary to the existing strategies. With this motivation, we proposed a fragmented Huffman coding methodology, that can be applied to the binary equivalent of the numeric weights of a neural network model stored in device memory. Subsequently, we also introduced the static and dynamic storage methodology on device memory space which is left behind even after storing the compressed file, that led to a big reduction in area and energy consumption of approximately 38

查看译文

关键词

Fragmented Huffman encoding (FHE),Hierarchical data format (HDF5),Residual network (ResNet),Common objects in context (COCO),Deep neural network (DNN),Convolution neural network (CNN)

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要