基于图像重构的 MNIST 对抗样本防御算法

Adversarial example defense algorithm for MNIST based on image reconstruction

下载PDF

导出

摘要随着深度学习的应用普及,其安全问题越来越受重视,对抗样本是在原有图像中添加较小的扰动,即可造成深度学习模型对图像进行错误分类,这严重影响深度学习技术的发展。针对该问题,分析现有对抗样本的攻击形式和危害,由于现有防御算法存在缺点,提出一种基于图像重构的对抗样本防御方法,以达到有效防御对抗样本的目的。该防御方法以MNIST为测试数据集,核心思路是图像重构,包括中心方差最小化和图像缝合优化,中心方差最小化只针对图像中心区域进行处理;图像缝合优化将重叠区域纳入补丁块选取的考量,并以补丁块的1/2大小作为重叠区域。使用FGSM、BIM、DeepFool以及C&W攻击方式生成对抗样本来测试两种方式的防御性能,并与现有的3种图像重构防御方式(裁剪与缩放、位深度压缩和JPEG压缩)效果对比。实验结果表明,所提中心方差最小化和图像缝合优化算法,对现有常见对抗样本的攻击起到了较好的防御效果。图像缝合优化对4种攻击算法生成的样本分类正确率都达到了75%以上,中心方差最小化的防御效果在70%左右。而用作对比的3种图像重构算法则对不同攻击算法的防御效果不稳定,整体分类正确率不足60%。所提中心方差最小化和图像缝合优化两种图像重构防御算法达到了有效防御对抗样本的目的,通过实验说明了所提防御算法在不同对抗样本攻击算法中的防御效果,另外,将其他图像重构算法与所提算法进行比较,说明了所提算法具有良好的防御性能。 With the popularization of deep learning,more and more attention has been paid to its security issues.The adversarial sample is to add a small disturbance to the original image,which can cause the deep learning model to misclassify the image,which seriously affects the performance of deep learning technology.To address this challenge,the attack form and harm of the existing adversarial samples were analyzed.An adversarial examples defense method based on image reconstruction was proposed to effectively detect adversarial examples.The defense method used MNIST as the test data set.The core idea was image reconstruction,including central variance minimization and image quilting optimization.The central variance minimization was only processed for the central area of the image.The image quilting optimization incorporated the overlapping area into the patch block selection.Considered and took half the size of the patch as the overlap area.Using FGSM,BIM,DeepFool and C&W attack methods to generate adversarial samples to test the defense performance of the two methods,and compare with the existing three image reconstruction defense methods(cropping and scaling,bit depth compression and JPEG compression).The experimental results show that the central variance minimization and image quilting optimization algorithms proposed have a satisfied defense effect against the attacks of existing common adversarial samples.Image quilting optimization achieves over 75%classification accuracy for samples generated by the four attack algorithms,and the defense effect of minimizing central variance is around 70%.The three image reconstruction algorithms used for comparison have unstable defense effects on different attack algorithms,and the overall classification accuracy rate is less than 60%.The central variance minimization and image quilting optimization proposed achieve the purpose of effectively defending against adversarial samples.The experiments illustrate the defense effect of the proposed defense algorithm in different adversarial sample attack algorithms.The comparison between the reconstruction algorithm and the algorithm shows that the proposed scheme has good defense performance.

作者秦中元贺兆祥李涛陈立全 QIN Zhongyuan;HE Zhaoxiang;LI Tao;CHEN Liquan(School of Cyber Science and Engineering,Southeast University,Nanjing 211189,China;Network Communication and Security Purple Mountain Laboratory,Nanjing 211189,China)

机构地区东南大学网络空间安全学院网络通信与安全紫金山实验室

出处《网络与信息安全学报》 2022年第1期86-94,共9页 Chinese Journal of Network and Information Security

基金国家重点研发计划(2020YFE0200600) 国家自然科学基金(61601113)。

关键词对抗样本图像重构深度学习图像分类 adversarial example image reconstruction deep learning image classification

分类号 TP393 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1宋蕾,马春光,段广晗.机器学习安全及隐私保护研究进展.[J].网络与信息安全学报,2018,4(8):1-11. 被引量：26
2刘西蒙,谢乐辉,王耀鹏,李旭如.深度学习中的对抗攻击与防御[J].网络与信息安全学报,2020,6(5):36-53. 被引量：17
3严飞,张铭伦,张立强.基于边界值不变量的对抗样本检测方法[J].网络与信息安全学报,2020,6(1):38-45. 被引量：3

二级参考文献1

1郭鹏,钟尚平,陈开志,程航.差分隐私GAN梯度裁剪阈值的自适应选取方法[J].网络与信息安全学报,2018,4(5):10-20. 被引量：6

共引文献43

1马春光,郭瑶瑶,武朋,刘海波.生成式对抗网络图像增强研究综述[J].信息网络安全,2019(5):10-21. 被引量：9
2唐鹏,黄征,邱卫东.深度学习中的隐私保护技术综述[J].信息安全与通信保密,2019,0(6):55-62. 被引量：5
3党引弟,宋宁宁.动态自适应演进安全架构研究[J].信息技术与网络安全,2019,38(10):18-23. 被引量：2
4赵镇东,常晓林,王逸翔.机器学习中的隐私保护综述[J].信息安全学报,2019,4(5):1-13. 被引量：9
5郭敏,曾颖明,于然,吴朝雄.基于对抗训练和VAE样本修复的对抗攻击防御技术研究[J].信息网络安全,2019(9):66-70. 被引量：3
6朱丽芳.人工智能技术在应用中的安全风险与管控研究[J].电信工程技术与标准化,2019,32(12):33-37. 被引量：1
7刘俊旭,孟小峰.机器学习的隐私保护研究综述[J].计算机研究与发展,2020,57(2):346-362. 被引量：66
8段广晗,马春光,宋蕾,武朋.深度学习中对抗样本的构造及防御研究[J].网络与信息安全学报,2020,6(2):1-11. 被引量：13
9张煜,吕锡香,邹宇聪,李一戈.基于生成对抗网络的文本序列数据集脱敏[J].网络与信息安全学报,2020,6(4):109-119. 被引量：6
10魏立斐,陈聪聪,张蕾,李梦思,陈玉娇,王勤.机器学习的安全问题及隐私保护[J].计算机研究与发展,2020,57(10):2066-2085. 被引量：26

1刘佳玮,张文辉,寇晓丽,李雁妮.增强型深度对抗样本攻击防御算法[J].西安电子科技大学学报,2021,48(6):23-31.
2赖妍菱,石峻峰,陈继鑫,白汉利,唐晓澜,邓碧颖,郑德生.基于U-Net的对抗样本防御模型[J].计算机工程,2021,47(12):163-170. 被引量：2
3李鹏辉,翟正利,冯舒.图对抗防御研究进展[J].计算机科学与探索,2021,15(12):2292-2303. 被引量：2
4张华.基于DDTW的非线形建筑数字图像形貌拼接仿真[J].计算机仿真,2020,37(6):448-451. 被引量：2
5李国利,邵利平,任平安.差异聚类和误差纹理合成的生成式信息隐藏[J].中国图象图形学报,2019,24(12):2126-2148. 被引量：3
6刘子龙.对抗样本的攻防算法研究[J].无线互联科技,2021,18(24):126-129.
7李国利,邵利平,任平安.结合随机映射和改进缝合线的纹理合成隐藏[J].计算机技术与发展,2020,30(1):106-111.
8李建,郭延明,于天元,武与伦,王翔汉,老松杨.基于生成对抗网络的多目标类别对抗样本生成算法[J].计算机科学,2022,49(2):83-91. 被引量：1
9叶从玲.基于对抗训练增强模型鲁棒性的新方法[J].佳木斯大学学报（自然科学版）,2022,40(1):28-32. 被引量：1
10吴文彬,周伟,唐东明.基于改进图正则项的自编码器特征学习算法[J].计算机应用研究,2022,39(2):485-490. 被引量：1

网络与信息安全学报

2022年第1期

浏览历史

内容加载中请稍等...

基于图像重构的 MNIST 对抗样本防御算法

参考文献3

二级参考文献1

共引文献43

相关作者

相关机构

相关主题

浏览历史