遗忘学习前置的反后门学习方法研究

Research on Anti-Backdoor Learning Method Based on Preposed Unlearning

下载PDF

导出

摘要反后门学习方法(anti-backdoor learning,ABL)在利用中毒数据集进行模型训练过程中能实时检测并抑制后门生成,最终得到良性模型。但反后门学习方法存在后门样本和良性样本无法有效隔离、后门消除效率不高的问题。为此,提出遗忘学习前置的反后门学习方法(anti-backdoor learning method based on preposed unlearning,ABLPU),在隔离阶段对训练样本增加提纯操作,达到有效隔离良性样本的目标,在消除阶段采用后门遗忘-模型再训练的范式,并引入遗忘系数,实现后门的高效消除。在CIFAR-10数据集上针对后门攻击方法BadNets,遗忘学习前置的反后门学习方法较反后门学习方法(基线方法)良性准确率提高1.21个百分点,攻击成功率下降1.38个百分点。 The anti-backdoor learning(ABL)method can detect and suppress backdoor generation in real time during model training with poisoned datasets,and finally obtain a benign model.However,the ABL method suffers from the problem that the backdoor samples and benign samples cannot be effectively isolated and the efficiency of backdoor elimination is not high.To this end,an anti-backdoor learning method based on preposed unlearning(ABL-PU)is proposed,which adds a purification operation to the training samples in the isolation phase to achieve the goal of effective isolation of benign samples,and adopts a paradigm of backdoor unlearning and model retraining in the elimination phase,and introduces unlearning coefficients to achieve efficient backdoor elimination.On the CIFAR-10 dataset,against the classical backdoor attack method BadNets,the anti-backdoor learning method based on preposed unlearning improves the benign accuracy rate by 1.21 percentage points and decreases the attack success rate by 1.38 percentage points compared with the anti-backdoor learning method(the baseline method).

作者王晗旭李欣许文韬斯彬洲 WANG Hanxu;LI Xin;XU Wentao;SI Binzhou(School of Information Network Security,People’s Public Security University of China,Beijing 100038,China;Key Laboratory of Security Prevention Technology and Risk Assessment of the Ministry of Public Security,Beijing 100026,China)

机构地区中国人民公安大学信息网络安全学院安全防范技术与风险评估公安部重点实验室

出处《计算机工程与应用》 CSCD 北大核心 2024年第19期259-267,共9页 Computer Engineering and Applications

基金中国人民公安大学网络空间安全执法技术双一流创新研究专项(2023SYL07)。

关键词后门攻击反后门学习数据提纯遗忘学习前置遗忘系数 backdoor attacks anti-backdoor learning data purification preposed unlearning unlearning coefficient

分类号 TP309 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

1唐厚兴,胡启帆,孟嘉琪.基于主体遗忘行为的产学研协同创新最优网络结构[J].南昌工程学院学报,2024,43(2):80-89.
2武加敏.学生思维发展导向下小学语文视觉化学习方法研究[J].小学生（多元智能大王）,2024(8):19-21.
3王梦可,杨朝晖,查晓婧,夏银水.基于分区再训练的RRAM阵列多缺陷容忍算法[J].计算机应用研究,2024,41(10):3068-3072.
4中山大学网络空间安全学院学生获CVPR2024[J].信息网络安全,2024(8):1240-1240.
5张鹏,刘嘉,邵煜.考虑压力均匀性的供水管网独立计量区域(DMA)分区优化研究[J].科技通报,2024,40(2):30-37.
6Zongben Xu,Jun Shu,Deyu Meng.Simulating learning methodology(SLeM):an approach to machine learning automation[J].National Science Review,2024,11(8):6-11.
7孙银雨.含分布式电源的10kV配电网单相接地故障隔离方法[J].电气技术与经济,2024(9):194-196.
8张点,董云卫.基于掩膜自动编码器的对抗对比蒸馏算法[J].计算机学报,2024,47(10):2274-2288.
9张杰,宁全利,邓海飞.整体性学习方法在兵器装备类基础课程教学中的应用研究[J].创新教育研究,2024,12(9):396-402.
10易宗剑.基于邻域特征融合半监督的图像分类方法[J].电子制作,2024,32(16):55-58.

计算机工程与应用

2024年第19期

浏览历史

内容加载中请稍等...

遗忘学习前置的反后门学习方法研究

相关作者

相关机构

相关主题

浏览历史