优化梯度增强黑盒对抗攻击算法

Optimized Gradient Boosting Black-Box Adversarial Attack Algorithm

下载PDF

导出

摘要对抗样本能够使得深度神经网络以较高置信度输出错误的结果。对抗样本分为白盒攻击和黑盒攻击,白盒攻击目前达到了较高的成功率,而黑盒攻击由于对模型、参数的未知,导致现有黑盒攻击方法的攻击成功率还较低。为了进一步提高黑盒攻击的成功率,提出了一种优化梯度增强黑盒对抗攻击算法。使用混合图像的方式去混合其他类别的图像样本,从而得到混合了其他类别信息的混合梯度。使用上一次迭代过程中的梯度方差去调整当前图像样本的梯度,得到优化梯度。将优化梯度与Adam优化算法结合进行迭代优化生成可迁移性强的对抗样本。在ImageNet数据集上进行了实验,结果表明所提算法能有效提升对抗样本的黑盒攻击性。在单模型攻击和集成模型攻击中的平均攻击成功率分别为71.7%和88.3%,融合了三个基于转换的对抗攻击算法后平均攻击成功率则达到了96.8%。此外,对现有的5个对抗防御模型进行攻击能够实现92.7%的平均成功率,优于当前基于输入转换的攻击方法以及基于梯度的攻击方法。 Adversarial examples can make deep neural networks output wrong results with higher confidence.Adversarial examples are divided into white-box attacks and black-box attacks.White-box attacks have achieved a high success rate at present,while black-box attacks have a low attack success rate due to unknown models and parameters.In order to improve the success rate of black-box attacks,this paper proposes a optimized gradient boosting black-box adversarial attack algo-rithm.Firstly,the method in this paper uses the mixed image method to mix the image samples of other categories and obtain the mixed gradient with the information of other categories.Secondly,the gradient variance in the last iteration pro-cess is used to adjust the gradient of the current image sample to obtain the optimized gradient.Then,the optimized gradi-ent is combined with the Adam optimization algorithm to perform iterative optimization to generate highly transferable adversarial examples.Experiments on the ImageNet dataset show that the proposed algorithm can effectively improve the black-box attack of adversarial examples.The average attack success rate of single model attack and integrated model attack is 71.7%and 88.3%respectively.The average attack success rate has reached 96.8%after the fusion of three trans-form-based anti-attack algorithms.In addition,the average success rate of attacking the five existing adversarial defense models is 92.7%,which is better than the current attack method based on input transformation and gradient attack method.

作者刘梦庭凌捷 LIU Mengting;LING Jie(School of Computer,Guangdong University of Technology,Guangzhou 510006,China)

机构地区广东工业大学计算机学院

出处《计算机工程与应用》 CSCD 北大核心 2023年第18期260-267,共8页 Computer Engineering and Applications

基金广东省重点领域研发计划项目(2019B010139002) 广州市科技计划项目(201902020007,202007010004)。

关键词对抗样本深度神经网络黑盒攻击优化梯度可迁移性 adversarial examples deep neural network black-box attack optimized gradient transferability

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1崔廷玉,张武,贺正芸,周星宇,张瑶,胡谷雨,潘志松.针对眼部掩模的人脸识别对抗贴片研究[J].计算机技术与发展,2023,33(6):139-146.
2丛明,吴敏杰,杜宇,李泳耀.基于抓取模式识别的欠驱动灵巧手抓取方法[J].华中科技大学学报（自然科学版）,2023,51(6):29-35. 被引量：2
3张鑫,沈子钰,李云.面向时序数据的多范数约束对抗样本生成方法[J].指挥与控制学报,2023,9(3):253-262.
4王鑫,刘中旺.基于MATLAB的相关滤波跟踪算法仿真分析[J].计算机测量与控制,2023,31(8):224-230.
5糟伟红,袁至.基于ROF优化算法的微电网自适应过电流保护[J].电力电容器与无功补偿,2023,44(3):126-133. 被引量：1
6姜文涛,孟庆姣.自适应时空正则化的相关滤波目标跟踪[J].智能系统学报,2023,18(4):754-763. 被引量：1

计算机工程与应用

2023年第18期

浏览历史

内容加载中请稍等...

优化梯度增强黑盒对抗攻击算法

相关作者

相关机构

相关主题

浏览历史