保护Transformer模型知识产权的鲁棒水印

Robust Watermarking for Protect Transformer Intellectual Property

下载PDF

导出

摘要作为自然语言处理(NLP)领域最强大的深度学习模型,Transformer在机器翻译和自然语言生成等任务中表现出色。同时,这意味着Transformer模型的知识产权(IPR)侵权风险也越来越大,尤其那些训练成本很高的大型模型。尽管目前存在针对卷积神经网络(ConvolutionalNeuralNetworks)和生成对抗网络(GenerativeAdversarialNetworks)等模型的所有权验证方法,但针对Transformer的工作还很欠缺。因此,为了完善Transformer的知识产权保护,让版权所有者在黑盒环境和白盒环境下都能够有效验证Transformer模型所有权,本文首先提出了一种基于额外关注力的白盒水印方案,该方案将所有者签名嵌入模型中并能够抵抗各种攻击,包括通常水印无法抵御的模糊攻击(不破坏现有水印而是加入攻击者水印造成所有权混淆)。之后,本文提出了一个基于混合触发器(HybridTriggers)的后门添加方案,该方案在不访问模型源码的黑盒情况下实现了对模型所有权的验证,具有良好的隐蔽性和抗去除性。此外,本文研究了一种新形式的模糊攻击,实验结果表明,面对这种攻击,本文提出的水印方案优于现有的深度神经网络水印方案。本文为Transformer提供了一个更鲁棒的水印方案,解决了现有技术的局限性,加强了Transformer的知识产权保护。 As the most powerful deep learning model in natural language processing(NLP),Transformer has excellent performance in tasks such as machine translation and natural language generation.However,this also means that Transformer models are increasingly at risk of Intellectual Property Rights(IPR)infringement,especially for large models with extremely high training costs.Although ownership verification methods are available for models such as Convolutional Neural Networks(CNN)and Generative Adversarial Networks(GAN),protection work for Transformer is still lacking.Therefore,in order to effectively verify the ownership of Transformer models in both black-box and white-box settings,this paper first proposes a robust watermark that can resist various attacks,including ambiguity attacks(not destroying the existing watermark,but adding the attacker's watermark to cause ownership obfuscation),which normal watermarking schemes cannot resist,by adding Extra Attention as a white-box watermark carrier.Secondly,this paper implements a backdoor addition scheme based on Hybrid Triggers,which has good crypticity and removal resistance while achieving model ownership verification without access to the source code.In addition,a new form of ambiguity attack is investigated in this paper,and experimental results show that the watermarking scheme of this paper outperforms existing deep neural network watermarking schemes in the face of such attacks.The watermarking method proposed in this paper addresses the limitations of previous works,provides more robust watermarking for the Transformer,and enhances the intellectual property protection of the model.

作者王保卫郑伟钤 WANG Baowei;ZHENG Weiqian(School of Software,Nanjing University of Information Science and Technology,Nanjing 210044,China)

机构地区南京信息工程大学软件学院

出处《信息安全学报》 CSCD 2024年第5期127-138,共12页 Journal of Cyber Security

基金本课题得到国家自然科学基金等资助。

关键词深度神经网络知识产权保护所有权验证鲁棒水印 deep neural network intellectual property protection ownership verification robust watermark

分类号 TP183 [自动化与计算机技术—控制理论与控制工程] TP309.7 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献1

1刘伟发,张光华,杨婷,王鹤.基于标志网络的深度学习多模型水印方案[J].信息安全学报,2022,7(6):105-115. 被引量：2

共引文献1

1金彪,林翔,熊金波,尤玮婧,李璇,姚志强.基于水印技术的深度神经网络模型知识产权保护[J].计算机研究与发展,2024,61(10):2587-2606.

1萨其瑞,尤玮婧,张逸飞,邱伟杨,马存庆.联邦学习模型所有权保护方案综述[J].信息网络安全,2024(10):1553-1561.
2林云.基于UniLM预训练的改进数学问答模型[J].物联网技术,2024,14(10):120-122.
3吴穗湘,任江涛,嵇志国.NLP在高速公路信息发布内容审核中的应用[J].中国交通信息化,2024(9):117-120.
4李远丽,刘伟,李润生,牛朝阳,李芳润,卢万杰.光学遥感图像语义描述的深度学习方法[J].信息工程大学学报,2024,25(5):532-537.
5王鹏林,史东辉,江金松,宫小兰,刘梦茹.虚拟现实技术辅助机器车视觉循迹建模[J].安庆师范大学学报（自然科学版）,2024,30(3):71-77.
6林云柯.人工智能会梦到鬼故事吗?——从“文学模型”到计算机病毒[J].中国图书评论,2024(10):8-19.
7古博韬,石开铭,李建华,王丽颖,杨乐,方东平.基于大语言模型的施工安全管理技术[J].施工技术（中英文）,2024,53(17):15-19. 被引量：1
8李森,唐建,袁强.基于国产FPGA的UDP协议栈IP核设计与实现[J].空天预警研究学报,2024,38(5):347-352.
9吴倍骏.美国涉华337调查现状分析及应对策略探讨[J].中国发明与专利,2024,21(S01):113-118.
10傅建平.数字政府建设中应用生成式人工智能的风险及其克服[J].武汉大学学报（哲学社会科学版）,2024,77(6):34-47.

信息安全学报

2024年第5期

浏览历史

内容加载中请稍等...

保护Transformer模型知识产权的鲁棒水印

参考文献1

共引文献1

相关作者

相关机构

相关主题

浏览历史