Fingerprinting Deep Neural Networks - A Deepfool Approach

2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS)(2021)

引用 19|浏览10
暂无评分
摘要
A well-trained deep learning classifier is an expensive intellectual property of the model owner. However, recently proposed model extraction attacks and reverse engineering techniques make model theft possible and similar quality deep learning solution reproducible at a low cost. To protect the interest and revenue of the model owner, watermarking on Deep Neural Network (DNN) has been proposed. However, the extra components and computations due to the embedded watermark tend to interfere with the model training process and result in inevitable degradation in classification accuracy. In this paper, we utilize the geometry characteristics inherited in the DeepFool algorithm to extract data points near the classification boundary of the target model for ownership verification. As the fingerprint is extracted after the training process has been completed, the original achievable classification accuracy will not be compromised. This countermeasure is founded on the hypothesis that different models possess different classification boundaries determined solely by the hyperparameters of the DNN and the training it has undergone. Therefore, given a set of fingerprint data points, a pirated model or its post-processed version will produce similar prediction but another originally designed and trained DNN for the same task will produce very different prediction even if they have similar or better classification accuracy. The effectiveness of the proposed Intellectual Property (IP) protection method is validated on the CIFAR-10, CIFAR-100 and ImageNet datasets. The results show a detection rate of 100% and a false positive rate of 0% for each dataset. More importantly, the fingerprint extraction and its run time are both dataset independent. It is on average similar to 130x faster than two state-of-the-art fingerprinting methods.
更多
查看译文
关键词
state-of-the-art fingerprinting methods,fingerprint extraction,Intellectual Property protection method,different prediction,originally designed trained DNN,similar prediction,post-processed version,pirated model,fingerprint data points,different classification boundaries,original achievable classification accuracy,classification boundary,DeepFool algorithm,model training process,embedded watermark,Deep Neural Network,similar quality deep learning solution reproducible,model theft,engineering techniques,recently proposed model extraction attacks,model owner,expensive intellectual property,deep learning classifier,DeepFool approach,fingerprinting Deep Neural networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要