基于分布式扰动的文本对抗训练方法

Textual Adversarial Training Method Based on Distributed Perturbation

下载PDF

导出

摘要文本对抗防御旨在增强神经网络模型对不同对抗攻击的抵御能力,目前的文本对抗防御方法通常只能对某种特定对抗攻击有效,对于原理不同的对抗攻击效果甚微。为解决文本对抗防御方法的不足,提出一种文本对抗分布训练(TADT)方法,将TADT形式化为一个极小极大优化问题,其中内部最大化的目标是了解每个输入示例的对抗分布,外部最小化的目标是通过最小化预期损失来减小对抗示例的数量,并对基于梯度下降和同义词替换的攻击方法进行研究。在2个文本分类数据集上的实验结果表明,相比于DNE方法,在PWWS、GA、UAT等3种不同的对抗攻击下,TADT方法的准确率平均提升2%,相比于其他方法提升了10%以上,且在不影响干净样本准确率的前提下显著提升了模型的鲁棒性,并在各种对抗攻击下具有较高的准确率,展示了良好的泛化性能。 Text adversarial defense aims to enhance the resilience of neural network models against different adversarial attacks.The current text confrontation defense methods are usually only effective against certain specific confrontation attacks and have little effect on confrontation attacks with different principles.To address the deficiencies of existing textual adversarial defense methods and principles of adversarial attack methods,this paper proposes the Textual Adversarial Distribution Training(TADT)method and formalizes it as a minimax optimization problem.The goal of inner maximization is to learn the adversarial distribution of each input example.The goal of outer minimization is to reduce the number of adversarial examples by minimizing the expected loss.This paper mainly studies the attack method based on gradient descent and synonym replacement.The experimental results on two text classification datasets show that under three different adconfrontation attacks,Probability Weighted Word Saliency(PWWS),Genetic Attack(GA),and Unsupervised Adversarial Training(UAT),the accuracy of TADT is improved by an average of 2%compared with the latest Dirichlet Neighborhood Ensemble(DNE)method.Compared with other methods,the accuracy of TADT method is improved by more than 10%,and the accuracy of clean samples is not affected.On the premise of not affecting the accuracy of clean samples,TADT significantly improves the robustness of the model and has high accuracy under various adversarial attacks,showing good generalization performance.

作者沈志东岳恒宪 SHEN Zhidong;YUE Hengxian(School of Cyber Science and Engineering,Wuhan University,Wuhan 430000,China)

机构地区武汉大学国家网络安全学院

出处《计算机工程》 CAS CSCD 北大核心 2023年第9期16-22,共7页 Computer Engineering

基金国家重点研发计划(2018YFC1604000) 湖北省重点研发计划项目(2022BAA041)。

关键词文本对抗分布对抗训练变分自动编码器梯度下降蒙特卡罗采样 textual adversarial distribution Adversarial Training(AT) variational autoencoder gradient descent Monte Carlo sampling

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1陈健,唐俊遥,朱生光,周兆钊.深度堆栈自编码网络在船舶重量估算中的应用[J].计算机工程,2019,45(5):315-320. 被引量：4
2徐彦.基于梯度下降的脉冲神经元在线学习方法[J].计算机工程,2015,41(12):150-155. 被引量：6

二级参考文献20

1刘建峰,王琦,姚震球.基于神经网络的客货船造价估算[J].华东船舶工业学院学报,1996,10(2):23-28. 被引量：4
2Rullen R V, Thorpe S J. Rate Coding Versus Temporal Order Coding:What the Retinal Ganglion Cells Tell the Visual Cortex [ J]. Neural Computation, 2001,13 ( 6 ) : 1255-1283.
3Maass W. Networks of Spiking Neurons: The Third Generation of Neural Network Models [ J ]. Neural Networks, 1997,10 (9) : 1659-1671.
4Maass W. Noisy Spiking Neurons with Temporal Coding Have More Computational Power than Sigmoidal Neurons[M]. Cambridge ,USA :MIT Press,1997:259-268.
5徐彦.基于时间编码的Spiking神经网络有监督学习机制的研究[D].南京:河海大学,2013.
6Bohte S M,Kok J N, La J A. Error-backpropagation in Temporally Encoded Networks of Spiking Neurons [ J ]. Neurocomputing, 2002,48 ( 1-4 ) : 17-37.
7McKennoch S, Liu D, Bushnell L G. Fast Modifications of the Spikeprop Algorithm [ C ]//Proceedings of Inter- national Joint Conference on Neural Networks. Vancouver, Canada : IEEE Press, 2006 : 3970-3977.
8Schrauwen B ,Campenhout J V. Extending Spikeprop [ C ]// Proceedings of International Joint Conference on NeuralNetworks. Budapest, Hungary .IEEE Press,2004:471-476.
9Booij O, Nguyen H T. AGradient Descent Rule for Multiple Spiking Neurons Emitting Multiple Spikes [ J ]. Information Processing Letters, 2005,95 ( 6 ) : 552-558.
10Ghosh-Dastidar S,Adeli H. ANew Supervised Learning Algorithm for Multiple Spiking Neural Networks with Application in Epilepsy and Seizure Detection [ J ]. Neural Networks ,2009,22 ( 2 ) : 1419-1431.

共引文献8

1蔺想红,李丹,王向文,张宁.基于脉冲序列合成核的脉冲神经元在线监督学习算法[J].计算机工程,2017,43(12):197-202. 被引量：2
2徐彦,熊迎军,杨静.脉冲神经元脉冲序列学习方法综述[J].计算机应用,2018,38(6):1527-1534. 被引量：3
3杨静,赵欣,徐彦,姜赢.基于梯度下降的脉冲神经元精确序列学习算法[J].计算机工程与应用,2018,54(23):14-22. 被引量：2
4杨静,徐彦,赵欣.带延迟调整的脉冲神经元梯度下降学习算法[J].计算机工程,2019,45(9):169-175. 被引量：1
5张美兰,刘琳琳.基于机器学习算法的船舶结构强度分析[J].中国舰船研究,2019,14(S01):151-157. 被引量：2
6孙俊义,刘治红,郭延涛.基于数字孪生的关键工装寿命预测综述[J].兵工自动化,2022,41(6):7-13. 被引量：4
7张东东,艾小川,刘畅.基于相似样本特征提取的装备性能退化研究[J].系统工程与电子技术,2022,44(7):2374-2380.
8张佳丽.基于压缩感知耦合梯度下降的红外-可见光图像自适应融合算法[J].光学技术,2019,45(1):70-77. 被引量：3

1官榕林,李秀滢,张健毅.一种新型的目标识别对抗攻击方法研究[J].北京电子科技学院学报,2023,31(2):60-70. 被引量：1
2杨丹,金宁,杨开宇,李晶,董树林,胡健钏,李晓君.红外热成像折转光学系统的光轴静态敏感度分析[J].红外技术,2023,45(8):828-836. 被引量：2
3李哲铭,王晋东,侯建中,李伟,张世华,张恒巍.基于显著区域优化的对抗样本攻击方法[J].计算机工程,2023,49(9):246-255. 被引量：1
4徐涛.刺参高温期、汛期健康养殖技术要点[J].农业知识,2023(8):33-33.
5杜中武.我军历史上模范党支部的启示[J].军队党的生活,2023(8):15-17.
6刘嘉诚,吕子涵,乔鹏昊.风险偏好对经济林果农户租用冷库决策的影响研究[J].水利科学与寒区工程,2023,6(8):45-49.
7付英俊.基于RAROC波动的四大行经济资本配置效率分析[J].金融理论与教学,2023(4):60-64.
8倪海燕,王文博,任群言,鹿力成,马力.多波束声呐海底底质半监督学习分类方法[J].声学技术,2023,42(4):524-532.
9丁晓蔚,季婧,赵笑宇,王本强,丁毅杰,王献东.互联网金融安全情绪感知及风险预警应用研究——基于BERT所作的探索[J].情报杂志,2023,42(9):57-70. 被引量：3
10张柔荑,陈珂欣,赵晨,李佳璐.电子商务平台商标恶意投诉的民事责任承担[J].襄阳职业技术学院学报,2023,22(4):129-134.

计算机工程

2023年第9期

浏览历史

内容加载中请稍等...

基于分布式扰动的文本对抗训练方法

参考文献2

二级参考文献20

共引文献8

相关作者

相关机构

相关主题

浏览历史