A New Approach to Image Classification Dataset Privacy-Preserving with Deep Neural Network

Toan Pham Van, Thanh Nguyen Tung,Linh Doan Bao, Duc Tran Trung, Quang Nguyen Hung,Thanh Ta Minh

Advanced Theory and Applications of Engineering Systems Under the Framework of Industry 4.0（2023）

引用 0|浏览9

暂无评分

摘要

Deep learning technology has made great achievements in some fields such as computer vision, natural language understanding, speech processing, etc. However, a big challenge when using deep learning models is the require a large amount of data. Meanwhile, most of the data belong to organizations or personalities that cannot be public due to privacy. In this paper, we propose a new approach for data publishers to conceal the original dataset but still ensure the features for training deep learning models. Our framework has three advantages. First, we proposed a neural network as a encoder to hide a full-size color image within a noisy image of the same size. The noisy images make it impossible to guessable the contents of the original dataset inside. Second, that method is privacy-preserving as the encode method can’t be inverted without having an original dataset. Third, we demonstrate that our framework is accuracy-preserving because the encoded image still keeps features for the image classification task. All of our experiments based on a trade-off between two factors: the number of features retained (information loss), and the difficulty to invert the original dataset (privacy loss). Our code and pre-trained models are available at: https://github.com/sun-asterisk-research/privacy-dataset-research .

查看译文

关键词

Dataset privacy-preserving, Image classification, Deep neural network

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要