针对身份证文本识别的黑盒攻击算法研究

Research on Black-box Attack Algorithm by Targeting ID Card Text Recognition

下载PDF

导出

摘要身份证认证场景多采用文本识别模型对身份证图片的字段进行提取、识别和身份认证,存在很大的隐私泄露隐患.并且,当前基于文本识别模型的对抗攻击算法大多只考虑简单背景的数据(如印刷体)和白盒条件,很难在物理世界达到理想的攻击效果,不适用于复杂背景、数据及黑盒条件.为缓解上述问题,本文提出针对身份证文本识别模型的黑盒攻击算法,考虑较为复杂的图像背景、更严苛的黑盒条件以及物理世界的攻击效果.本算法在基于迁移的黑盒攻击算法的基础上引入二值化掩码和空间变换,在保证攻击成功率的前提下提升了对抗样本的视觉效果和物理世界中的鲁棒性.通过探索不同范数限制下基于迁移的黑盒攻击算法的性能上限和关键超参数的影响,本算法在百度身份证识别模型上实现了100%的攻击成功率.身份证数据集后续将开源. Identity card authentication scenarios often use text recognition models to extract,recognize,and au-thenticate ID card images,which poses a significant privacy breach risk.Besides,most of current adversarial attack algorithms for text recognition models only consider simple background data(such as print)and white-box condi-tions,making it difficult to achieve ideal attack effects in the physical world,and is not suitable for complex back-grounds,data,and black-box conditions.In order to alleviate the above problems,this paper proposes a black-box attack algorithm for the ID card text recognition model by taking into account the more complex image back-ground,more stringent black-box conditions and attack effects in the physical world.By using the transfer-based black-box attack algorithm,the proposed algorithm introduces binarization mask and space transformation,which improves the visual effect of adversarial examples and the robustness in the physical world while ensuring the at-tack success rate.By exploring the performance upper limit and the influence of key hyper-parameters of the trans-fer-based black-box attack algorithm under different norm constraints,the proposed algorithm achieves 100%at-tack success rate on the Baidu ID card recognition model.The ID card dataset will be made publicly available in the future.

作者徐昌凯冯卫栋张淳杰郑晓龙张辉王飞跃 XU Chang-Kai;FENG Wei-Dong;ZHANG Chun-Jie;ZHENG Xiao-Long;ZHANG Hui;WANG Fei-Yue(The Institute of Information Science,School of Computer and Information Technology,Beijing Jiaotong University,Beijing 100044;Beijing Key Laboratory of Advanced Information Science and Network Technology,Beijing 100044;State Key Laboratory of Multimodal Artificial Intelligence Systems,Institute of Automation,Chinese Academy of Sciences,Beijing 100190;State Key Laboratory for Management and Control of Complex Systems,Institute of Automation,Chinese Academy of Sciences,Beijing 100190;School of Artificial Intelligence,University of Chinese Academy of Sciences,Beijing 100049;School of Transportation Science and Engineering,Beihang University,Beijing 100191)

机构地区北京交通大学计算机与信息技术学院信息科学研究所现代信息科学与网络技术北京市重点实验室中国科学院自动化研究所多模态人工智能系统全国重点实验室中国科学院自动化研究所复杂系统管理与控制国家重点实验室中国科学院大学人工智能学院北京航空航天大学交通科学与工程学院

出处《自动化学报》 EI CAS CSCD 北大核心 2024年第1期103-120,共18页 Acta Automatica Sinica

基金科技创新2030--“新一代人工智能”重大项目(2020AAA0108401) 北京市自然科学基金(JQ20022) 国家自然科学基金(62072026,72225011) 中国人工智能学会--昇腾CANN学术基金,OpenI启智社区资助。

关键词对抗样本黑盒攻击身份证文本识别物理世界二值化掩码 Adversarial examples black-box attack ID card text recognition physical world binarization mask

分类号 TP391.41 [自动化与计算机技术—计算机应用技术] TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1王硕,徐茹枝,关志涛.基于主特征归因的对抗样本生成方法研究[J].电子学报,2023,51(11):3137-3145.
2陈少真,叶武剑,刘怡俊.基于知识蒸馏与改进ViT网络的花卉图像细粒度分类[J].光电子．激光,2024,35(1):29-40. 被引量：1
3杨晓龙,高红梅,高定国,达措.基于迁移学习的敦煌藏文古籍整页识别[J].中文信息学报,2023,37(11):29-37.
4刘小敏,范梦晴,李杰.基于DSP的视频运动目标实时跟踪系统研究与实现[J].江西科学,2023,41(6):1182-1185. 被引量：1
5程佳琦,付强.我国跆拳道运动员进攻战术应用特征分析研究[J].体育科技文献通报,2023,31(12):63-65.
6邢宇杰,王啸,石川,黄海,崔鹏.基于节点特征对抗性攻击的图对比学习鲁棒性验证[J].清华大学学报（自然科学版）,2024,64(1):13-24.

自动化学报

2024年第1期

浏览历史

内容加载中请稍等...

针对身份证文本识别的黑盒攻击算法研究

相关作者

相关机构

相关主题

浏览历史