基于条件扩散模型的图像分类对抗样本防御方法

Image Classification Adversarial Example Defense Method Based on Conditional Diffusion Model

下载PDF

导出

摘要深度学习模型在图像分类等领域取得了较好的结果,但是深度学习模型容易受到对抗样本的干扰威胁,攻击者通过对抗样本制作算法,精心设计微小扰动,构造肉眼难以分辨却能引发模型误分类的对抗样本,给图像分类等深度学习应用带来严重的安全隐患。为提升图像分类模型的鲁棒性,利用条件扩散模型,提出一种综合对抗样本检测和对抗样本净化的对抗样本防御方法。在不修改目标模型的基础上,检测并净化对抗样本,提升目标模型鲁棒性。所提方法包括对抗样本检测和对抗样本净化2个模块。对于对抗样本检测,采用不一致性增强,通过训练一个融入目标模型高维特征和图片基本特征的图像修复模型,比较初始输入和修复结果的不一致性,检测对抗样本;对于对抗样本净化,采用端到端的对抗样本净化方式,在去噪模型执行过程中加入图片伪影,实现对抗样本净化。在保证目标模型精度的前提下,在目标模型前增加对抗样本检测和净化模块,根据检测结果,选取相应的净化策略,从而消除对抗样本,提升目标模型的鲁棒性。在CIFAR10数据集和CIFAR100数据集上与5种现有方法进行对比实验,实验结果表明:对于扰动较小的对抗样本,所提方法的检测精度较Argos方法提升了5~9个百分点;相比于ADP方法,所提方法在面对不同种类对抗样本时防御效果更稳定,且在BPDA攻击下,其对抗样本净化效果较ADP方法提升了1.3个百分点。 Deep-learning models have achieved impressive results in fields such as image classification;however,they remain vulnerable to interference and threats from adversarial examples.Attackers can craft small perturbations using various attack algorithms to create adversarial examples that are visually indistinguishable yet can lead to misclassification in deep neural networks,posing significant security risks to image classification tasks.To improve the robustness of these models,we propose an adversarial-example defense method that combines adversarial detection and purification using a conditional diffusion model,while preserving the structure and parameters of the target model during detection and purification.This approach features two key modules:adversarial detection and adversarial purification.For adversarial detection,we employ an inconsistency enhancement technique,training an image restoration model that integrates both the high-dimensional features of the target model and basic image features.By comparing the inconsistencies between the initial input and the restored output,adversarial examples can be detected.An end-to-end adversarial purification method is then applied,introducing image artifacts during the denoising process.An adversarial detection and purification module is placed before the target model to ensure its accuracy.Based on detection outcomes,appropriate purification strategies are implemented to remove adversarial examples and improve model robustness.The method was compared with recent adversarial detection and purification approaches on the CIFAR10 and CIFAR100 datasets,using five adversarial attack algorithms to generate adversarial examples.It demonstrated a 5-9 percentage points improvement in detection accuracy over Argos on both datasets in a low-purification setting.Additionally,it exhibited a more stable defense performance than Adaptive Denoising Purification(ADP),with a 1.3 percentage points higher accuracy under Backwards Pass Differentiable Approximation(BPDA)attacks.

作者陈子民关志涛 CHEN Zimin;GUAN Zhitao(School of Control and Computer Engineering,North China Electric Power University,Beijing 102206,China)

机构地区华北电力大学控制与计算机学院

出处《计算机工程》 CAS CSCD 北大核心 2024年第12期296-305,共10页 Computer Engineering

基金国家自然科学基金(62372173)。

关键词对抗防御对抗样本检测对抗样本净化扩散模型图像去噪 adversarial defense adversarial example detection adversarial purification diffusion model image denoising

分类号 TP39 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1纪守领,杜天宇,李进锋,沈超,李博.机器学习模型安全与隐私研究综述[J].软件学报,2021,32(1):41-67. 被引量：53
2王志波,王雪,马菁菁,秦湛,任炬,任奎.面向计算机视觉系统的对抗样本攻击综述[J].计算机学报,2023,46(2):436-468. 被引量：14
3白祉旭,王衡军.基于改进遗传算法的对抗样本生成方法[J].计算机工程,2023,49(5):139-149. 被引量：4

二级参考文献6

1马玉琨,毋立芳,简萌,刘方昊,杨洲.一种面向人脸活体检测的对抗样本生成算法[J].软件学报,2019,30(2):469-480. 被引量：16
2张思思,左信,刘建伟.深度学习中的对抗样本问题[J].计算机学报,2019,42(8):1886-1904. 被引量：60
3潘文雯,王新宇,宋明黎,陈纯.对抗样本生成技术综述[J].软件学报,2020,31(1):67-81. 被引量：46
4Kui Ren,Tianhang Zheng,Zhan Qin,Xue Liu.Adversarial Attacks and Defenses in Deep Learning[J].Engineering,2020,6(3):346-360. 被引量：19
5李欣姣,吴国伟,姚琳,张伟哲,张宾.机器学习安全攻击与防御机制研究进展和未来挑战[J].软件学报,2021,32(2):406-423. 被引量：23
6李明慧,江沛佩,王骞,沈超,李琦.针对深度学习模型的对抗性攻击与防御[J].计算机研究与发展,2021,58(5):909-926. 被引量：14

共引文献68

1马钰锡,张全新,谭毓安,沈蒙.面向智能攻击的行为预测研究[J].软件学报,2021,32(5):1526-1546. 被引量：6
2杨平林,李泽山,郭改枝.基于改进AdaBoost算法识别包装瓶的设计与实现[J].内蒙古师范大学学报（自然科学版）,2021,50(3):268-274. 被引量：1
3邬友朋,赵金龙,贾中营.一种基于KNN/CNN的供热客服音频分类方法[J].电力大数据,2021,24(7):56-66. 被引量：1
4陈传涛,潘丽敏,罗森林,王子文.基于FGSM样本扩充的模型窃取攻击方法研究[J].信息安全研究,2021,7(11):1023-1030. 被引量：2
5Huanhuan Ni,Yiliang Han,Xiaowei Duan,Guohui Yang.An Improved LeNet-5 Model Based on Encrypted Data[J].国际计算机前沿大会会议论文集,2021(2):166-178.
6纪守领,杜天宇,邓水光,程鹏,时杰,杨珉,李博.深度学习模型鲁棒性研究综述[J].计算机学报,2022,45(1):190-206. 被引量：45
7彭长根.人工智能安全治理挑战与对策[J].信息安全研究,2022,8(4):318-325. 被引量：7
8曹刘娟,匡华峰,刘弘,王言,张宝昌,黄飞跃,吴永坚,纪荣嵘.双标签监督的几何约束对抗训练[J].软件学报,2022,33(4):1218-1230.
9秦宝东,李媛媛,余沛航.云计算辅助的高效决策树隐私保护查询协议[J].西安邮电大学学报,2022,27(1):1-8.
10余正飞,闫巧,周鋆.面向网络空间防御的对抗机器学习研究综述[J].自动化学报,2022,48(7):1625-1649. 被引量：10

1李乐园,刘佳莹,高艳兵,杨渊斌,汪招雄.浅析荆州区羊布鲁氏菌病的防控净化措施[J].湖南畜牧兽医,2024(5):30-33.
2沈丹阳,程钰洁.无人机物流城市末端配送需求关键影响因素探索[J].物流技术,2024,43(5):13-24.
3戚娟娟,薛飞.环保工程水处理过程中超滤膜技术应用探析[J].中文科技期刊数据库（引文版）工程技术,2024(11):021-024.
4赵新英,李佩茹.激光共聚焦显微镜样本制备在研究生实验教学中的应用[J].中国现代教育装备,2024(21):24-26.
5龙享福,李少波,张仪宗,杨磊,李传江.因果学习方法和应用概述[J].计算机工程与应用,2024,60(24):1-19.
6美海军寻求加大舷外长航时诱饵的研发经费[J].航天电子对抗,2024,40(2):47-47.
7申亚椠,陈召函,姚慧敏,冯凯迪,褚素乔.地方鸡种源性疫病净化策略[J].北方牧业,2024(19):11-11.
8管吉喆,程光,周余阳.基于交换机迁移的控制平面饱和攻击防御方法[J].计算机学报,2024,47(12):2889-2908.
9严立福.长汀县牛结节性皮肤病监测分析与防控探讨[J].福建畜牧兽医,2024,46(6):21-22.
10解广月.核心素养视域下初中英语读写结合的教学探讨[J].中学生英语,2024(44):63-64.

计算机工程

2024年第12期

浏览历史

内容加载中请稍等...

基于条件扩散模型的图像分类对抗样本防御方法

参考文献3

二级参考文献6

共引文献68

相关作者

相关机构

相关主题

浏览历史