应用引导积分梯度的对抗样本生成

Adversarial Sample Generation Applying Guided Integrated Gradients

下载PDF

导出

摘要给图片添加特定扰动可以生成对抗样本,误导深度神经网络输出错误结果,更加强力的攻击方法可以促进网络模型安全性和鲁棒性的研究.攻击方法分为白盒攻击和黑盒攻击,对抗样本的迁移性可以借已知模型生成结果来攻击其他黑盒模型.基于直线积分梯度的攻击TAIG-S可以生成具有较强迁移性的样本,但是在直线路径中会受噪声影响,叠加与预测结果无关的像素梯度,影响了攻击成功率.所提出的Guided-TAIG方法引入引导积分梯度,在每一段积分路径计算上采用自适应调整的方式,纠正绝对值较低的部分像素值,并且在一定区间内寻找下一步的起点,规避了无意义的梯度噪声累积.基于ImageNet数据集上的实验表明,Guided-TAIG在CNN和Transformer架构模型上的白盒攻击性能均优于FGSM、C&W、TAIG-S等方法,并且制作的扰动更小,黑盒模式下迁移攻击性能更强,表明了所提方法的有效性. Adding specific perturbations to images can help generate adversarial samples that mislead deep neural networks to output incorrect results.More powerful attack methods can facilitate research on the security and robustness of network models.The attack methods are divided into white-box and black-box attacks,and the transferability of adversarial samples can be used to attack other black-box ones by the results generated by known models.Attacks based on linear integrated gradients(TAIG-S)can generate highly transferable adversarial samples,but they are affected by noise in the linear path,superimposing pixel gradients that are irrelevant to the prediction results,which limits the success rate of attacks.With guided integrated gradients,the proposed Guided-TAIG method uses adaptive adjustment to correct some pixel values with low absolute values on each segment of the integrated path calculation and finds the starting point of the next step within a certain interval,circumventing the accumulation of meaningless gradient noise.The experiments on the ImageNet dataset show that Guided-TAIG outperforms FGSM,C&W,and TAIG-S for white-box attacks on both CNN and Transformer architecture models,produces smaller perturbations,and has better performance for transferable attacks in the black-box mode.This demonstrates the effectiveness of the proposed method.

作者王正来关胜晓 WANG Zheng-Lai;GUAN Sheng-Xiao(School of Information Science and Technology,University of Science and Technology of China,Hefei 230026,China)

机构地区中国科学技术大学信息科学技术学院

出处《计算机系统应用》 2023年第7期171-178,共8页 Computer Systems & Applications

关键词深度神经网络对抗攻击积分梯度引导路径迁移攻击 deep neural network(DNN) adversarial attack integrated gradients guided path transferable attack

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

1谷凤彩.盾尾密封油脂运用与管控解读[J].中文科技期刊数据库（全文版）工程技术,2023(1):65-68.
2黄珊,盛丽兰,徐惠惠,周琴梅.引导性反馈在中医院内分泌科护士中医护理技术培训中应用的效果[J].中医药管理杂志,2023,31(10):60-62.
3朱韶华,贾春暘,刘芙蓉,张庆华,何静,郝胜菊,冯暄.Wolf-Hirschhorn综合征患儿临床特征及拷贝数变异分析[J].中国优生与遗传杂志,2023,31(4):792-795.
4上海银保监局普惠金融课题组,曹光群,金子寿,林玲,朱晔,狄菲,张吉光,陈舟,战昱静,张渝,谢莉.我国碳普惠领域的碳金融创新实践和思考[J].中国银行业,2023(4):50-53.
5王中波,唐楠.《海洋地质学》教学创新与探索——以“需求牵引”为中心[J].汕头大学学报（人文社会科学版）,2022,38(12):85-89.
6吴宏,吴学文,梅凌云,贺楚峰,范若皓.以《听力辅助技术》为案例的耳鼻咽喉头颈外科课程思政教学改革与探索[J].中国耳鼻咽喉颅底外科杂志,2023,29(3):106-109. 被引量：3
7王建荣.“引导自学法”在高中化学教学中的应用[J].试题与研究,2023(12):87-89. 被引量：1
8李超群,章琪泷,殷晋,曹明生,宋井宽.基于图片高频和对抗子空间的迁移性攻击[J].中国科技论文,2023,18(7):806-812.
9王骥,谢再秘,莫春梅.神经网络在养殖水质精准预测方面的研究进展[J].水产学报,2023,47(8):17-32. 被引量：2
10林庚右,周星宇,潘志松.基于掩膜的人脸压缩重建对抗攻击增强方法[J].计算机技术与发展,2023,33(8):88-94.

计算机系统应用

2023年第7期

浏览历史

内容加载中请稍等...

应用引导积分梯度的对抗样本生成

相关作者

相关机构

相关主题

浏览历史