Discrete Model Compression With Resource Constraint For Deep Neural Networks

Shangqian Gao,Feihu Huang,Jian Pei,Heng Huang

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)（2020）

引用 72|浏览203

暂无评分

摘要

In this paper, we target to address the problem of compression and acceleration of Convolutional Neural Networks (CNNs). Specifically, we propose a novel structural pruning method to obtain a compact CNN with strong discriminative power. To find such networks, we propose an efficient discrete optimization method to directly optimize channel-wise differentiable discrete gate under resource constraint while freezing all the other model parameters. Although directly optimizing discrete variables is a complex non-smooth, non-convex and NP-hard problem, our optimization method can circumvent these difficulties by using the straight-through estimator. Thus, our method is able to ensure that the sub-network discovered within the training process reflects the true sub-network. We further extend the discrete gate to its stochastic version in order to thoroughly explore the potential sub-networks. Unlike many previous methods requiring per-layer hyper-parameters, we only require one hyper-parameter to control FLOPs budget. Moreover; our method is globally discrimination-aware due to the discrete setting. The experimental results on CIFAR-10 and ImageNet show that our method is competitive with state-of-the-art methods.

查看译文

关键词

globally discrimination-aware,discrete model compression,resource constraint,deep Neural Networks,Convolutional Neural Networks,structural pruning,CNN,discrete optimization,channel-wise differentiable discrete gate,NP-hard problem,ImageNet dataset,CIFAR-10 dataset,straight-through estimator,stochastic version

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要