Buffer Zone based Defense against Adversarial Examples in Image Classification

Kaleel Mahmood,Phuong Ha Nguyen,Lam M. Nguyen,Thanh V Nguyen,Marten van Dijk

user-5edf3a5a4c775e09d87cc848（2021）

引用 0|浏览12

暂无评分

摘要

Recent defenses published at venues like NIPS, ICML, ICLR and CVPR are mainly focused on mitigating white-box attacks. These defenses do not properly consider adaptive adversaries. In this paper, we expand the scope of these defenses to include adaptive black-box adversaries. Based on our study of these defenses, we develop three contributions. First we propose a new metric for evaluating adversarial robustness when clean accuracy is impacted. Second, we create an enhanced adaptive black-box attack. Third and most significantly, we develop a novel defense against these adaptive black-box attacks. Our defense is based on a combination of deep neural networks and simple image transformations. While straight forward in implementation, this defense yields a unique security property which we term buffer zones. We argue that our defense based on buffer zones offers significant improvements over state-of-the-art defenses. We verify our claims through extensive experimentation. Our results encompass three adversarial models (10 different black-box attacks) on 11 defenses with two datasets (CIFAR-10 and Fashion-MNIST).

查看译文

关键词

Robustness (computer science),Buffer zone,Contextual image classification,Computer security,Computer science,Adversarial system,Deep neural networks

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要