基于自适应攻击强度的对抗训练方法

Adversarial training method with adaptive attack strength

下载PDF

导出

摘要深度神经网络(DNN)易受对抗样本攻击的特性引发了人们对人工智能系统安全性和可靠性的重大关切,其中对抗训练是增强对抗鲁棒性的一种有效方式。针对现有方法使用固定的对抗样本生成策略但存在忽视对抗样本生成阶段对对抗训练重要性的问题,提出一种基于自适应攻击强度的对抗训练方法。首先,将干净样本和对抗样本输入模型得到输出;然后,计算干净样本和对抗样本模型输出的差异;最后,衡量该差异与上一时刻差异的变化情况,并自动调整对抗样本强度。对三个基准数据集的全面实验结果表明,相较于基准方法投影梯度下降的对抗训练(PGD-AT),该方法在三个基准数据集的AA(AutoAttack)攻击下鲁棒精度分别提升1.92、1.50和3.35个百分点,且所提出方法在鲁棒性和自然准确率方面优于最先进的防御方法可学习攻击策略的对抗训练(LAS-AT)。此外,从数据增强角度看,该方法可以有效解决对抗训练这种特殊数据增强方式中增广效果随训练进展会不断下降的问题。 The vulnerability of deep neural networks to adversarial attacks has raised significant concerns about the security and reliability of artificial intelligence systems.Adversarial training is an effective approach to enhance adversarial robustness.To address the issue that existing methods adopt fixed adversarial sample generation strategies but neglect the importance of the adversarial sample generation phase for adversarial training,an adversarial training method was proposed based on adaptive attack strength.Firstly,the clean sample and the adversarial sample were input into the model to obtain the output.Then,the difference between the model outputs of the clean sample and the adversarial sample was calculated.Finally,the change of the difference compared with the previous moment was measured to automatically adjust the strength of the adversarial sample.Comprehensive experimental results on three benchmark datasets demonstrate that compared with the baseline method Adversarial Training with Projected Gradient Descent(PGD-AT),the proposed method improves the robust precision under AA(AutoAttack)attack by 1.92,1.50 and 3.35 percentage points on three benchmark datasets,respectively,and the proposed method outperforms the state-of-the-art defense method Adversarial Training with Learnable Attack Strategy(LAS-AT)in terms of robustness and natural accuracy.Furthermore,from the perspective of data augmentation,the proposed method can effectively address the problem of diminishing augmentation effect during adversarial training.

作者陈彤位纪伟何仕远宋井宽杨阳 CHEN Tong;WEI Jiwei;HE Shiyuan;SONG Jingkuan;YANG Yang(School of Computer Science and Engineering,University of Electronic Science and Technology of China,Chengdu Sichuan 611731,China)

机构地区电子科技大学计算机科学与工程学院

出处《计算机应用》 CSCD 北大核心 2024年第1期94-100,共7页 journal of Computer Applications

基金国家自然科学基金资助项目(U20B2063,62220106008,62306067) 中国博士后科学基金资助项目(2022M720660)。

关键词对抗训练对抗样本对抗防御适应攻击强度深度学习图像分类人工智能安全 adversarial training adversarial example adversarial defense adaptive attack strength deep learning image classification artificial intelligence security

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1白丽贇,胡学敏,宋昇,童秀迟,张若晗.基于深度级联神经网络的自动驾驶运动规划模型[J].计算机应用,2019,39(10):2870-2875. 被引量：10

二级参考文献3

1余翔,王新民,李俨.无人直升机路径规划算法研究[J].计算机应用,2006,26(2):494-495. 被引量：6
2张超超,房建东.基于定向加权A~*算法的自主移动机器人路径规划[J].计算机应用,2017,37(A02):77-81. 被引量：24
3胡学敏,易重辉,陈钦,陈茜,陈龙.基于运动显著图的人群异常行为检测[J].计算机应用,2018,38(4):1164-1169. 被引量：7

共引文献9

1邓廷元.捕捉闪光信息寻求解题途径[J].中学数学研究（华南师范大学）（上半月）,2000(4):23-25.
2刘士豪,胡学敏,姜博厚,张若晗,孔力.基于生成对抗双网络的虚拟到真实驾驶场景的视频翻译模型[J].计算机应用,2020,40(6):1621-1626. 被引量：4
3王志忠.制动系统支持高阶驾驶自动化功能系统设计的几点讨论[J].汽车实用技术,2020(11):29-32.
4胡学敏,童秀迟,郭琳,张若晗,孔力.基于深度视觉注意神经网络的端到端自动驾驶模型[J].计算机应用,2020,40(7):1926-1931. 被引量：5
5饶东宁,杨锦鹏,刘越畅.时态规划综述及研究现状[J].广东工业大学学报,2021,38(3):9-16. 被引量：1
6石晏丞,李军.汽车自动驾驶领域的传感器融合技术[J].装备机械,2021(3):1-6. 被引量：11
7王猛,陈珏璇,邓正兴.自动驾驶模糊神经网络速度规划方法[J].计算机工程与科学,2021,43(11):2011-2019. 被引量：4
8朱靖,刘新超,谷晓钢,黄玲琴.基于级联神经网络的SiC MOSFET结温预测模型[J].电子元件与材料,2023,42(4):458-466.
9刘维维,厉丹,王立恺,李盘山,勾炜煊.多传感器协同驾驶系统的设计[J].软件工程与应用,2022,11(2):330-334.

1许鑫,马文政,张浩,马新明,乔红波.基于融合对抗训练的农作物品种信息抽取方法[J].农业机械学报,2023,54(12):272-279.
2杨茹芸,马静.一种融合知识与Res-ViT的特征增强多模态情感识别模型[J].数据分析与知识发现,2023,7(11):14-25.
3黄晓斌,陈谦,柴浩,叶鹏,谢光辉,徐辉,马琛明,张海波,陈国中.对主动脉根部计算机断层扫描血管造影的分割与评估-CVpilot人工智能系统的临床意义探讨[J].心肺血管病杂志,2024,43(1):58-62. 被引量：1
4唐国亮,徐尤峰.基于机器学习的电网客服语音智能检测系统的设计与实现[J].微型电脑应用,2024,40(1):217-219.
5周洪亮.基于物理实验数据的回归分析[J].科学技术创新,2024(4):183-188.
6陈俊锋,周磊,何肇雄,尹俊杰.基于强效预警角度的多层次预警机配置方法研究[J].中国电子科学研究院学报,2023,18(11):1003-1007.
7李亚静,霍纬纲,丁磊.基于集成LSTM自编码器的多维时间序列异常检测[J].计算机应用与软件,2024,41(1):285-290. 被引量：1
8王心怡,行鸿彦,侯天浩,郑锦程.基于演化博弈的无线传感器网络入侵检测研究[J].电子测量与仪器学报,2023,37(10):97-105. 被引量：1
9王子洵,魏传辉,吕天梅,王佩红,董凯.自供电可穿戴智能纺织品研究进展[J].纺织工程学报,2023,1(6):35-53. 被引量：2
10卢翼,陈友三,杨磊,肖诗银.基于人工智能的CT检查对肝肿瘤患者良恶性的诊断价值[J].影像研究与医学应用,2023,7(24):74-76.

计算机应用

2024年第1期

浏览历史

内容加载中请稍等...

基于自适应攻击强度的对抗训练方法

参考文献1

二级参考文献3

共引文献9

相关作者

相关机构

相关主题

浏览历史