Achieving adversarial robustness via sparsity

Machine Learning(2021)

引用 6|浏览34
暂无评分
摘要
Network pruning has been known to produce compact models without much accuracy degradation. However, how the pruning process affects a network’s robustness and the working mechanism behind remain unresolved. In this work, we theoretically prove that the sparsity of network weights is closely associated with model robustness. Through experiments on a variety of adversarial pruning methods, image-classification models and datasets, we find that weights sparsity will not hurt but improve robustness, where both weights inheritance from the lottery ticket and adversarial training improve model robustness in network pruning. Based on these findings, we propose a novel adversarial training method called inverse weights inheritance, which imposes sparse weights distribution on a large network by inheriting weights from a small network, thereby improving the robustness of the large network.
更多
查看译文
关键词
Adversarial learning,Neural network pruning,Robustness,Sparsity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要