深度伪造及其取证技术综述

A survey of Deepfake and related digital forensics

导出

摘要深度学习作为机器学习的一个具有前景的重要分支,在计算机视觉方面取得了重大突破。深度伪造(Deepfake)通常指的是使用深度学习(deep learning)进行涉及人脸和人声的多媒体伪造技术,如果被恶意滥用会给社会带来灾难。深度伪造不仅限于面部的替换,还有修改面部特征、修改表情、唇形同步、姿势变换、完整脸生成、篡改音频到视频以及文本到视频等方式。人类面部在社会、政治、经济等方面的敏感性,使得深度伪造技术威胁着社会和个人的安全。对深度伪造产物进行检测也成为数字取证领域的一个重要研究课题。为了提供对Deepfake检测研究工作的最新概述,本文描述了各种针对解决Deepfake相关问题的处理方法。本文主要参考了谷歌学术检索2018—2022共5年的深度伪造论文,分为不同类别进行分析比较,并且详细介绍了深度伪造数据集的特点以及伪造方法,简述了深度伪造技术及其基本原理,介绍了检测器在深度伪造技术数据集上的性能效果,分别从输入维度、浅层特征和深层特针对深度伪造检测技术进行分类,并对未来发展前景进行展望。 Deep learning,a promising branch of machine learning,has made significant breakthroughs in computervision.However,Deepfake,which refers to the set of techniques for forging human-related multimedia data using deeplearning,can bring disasters to society if used maliciously.It is not only limited to facial replacement,but also othermanipulations,such as fabricating facial features,manipulating expressions,synchronizing lips,modifying head gestures,entire face synthesis,and tampering related audios to videos and related texts to videos.Moreover,it can be used to gener⁃ate faked pornographic videos or even faked speeches to subvert state power.Thus,deep forgery technology can greatlythreaten society and individuals,thereyby detecting Deepfake has also become an important research topic in digital foren⁃sics.We conducted a systematic and critical survey to provide an overview of the latest research on Deepfake detection by exploring the recent developments in Deepfake and related forensic techniques.This survey mainly referred to papers onDeepfake in Google Scholar during 2018—2022.This survey divided the Deepfake detection techniques into two categoriesfor analysis and comparison:input dimensions and forensic features.First,a comprehensive and systematic introduction ofdigital forensics is presented from the following aspects:1)the development and security of deep forgery detection technol⁃ogy,2)Deepfake technology architecture,and 3)the prevailing datasets and evaluation metrics.Then,this survey pres⁃ents Deepfake techniques in several categories.Finally,future challenges and development prospects are discussed.Interms of image and video effects,Deepfake techniques are usually divided into four categories:face replacement,lip syn⁃chronization,head puppets,and attribute modification.The most commonly used Deepfake algorithms are based on selfencoders,generative adversarial networks,and diffusion models.An typical autoencoder consists of two convolutional neu⁃ral networks acting as an encoder and a decoder.The encoder reduces the dimensions of the input targets’facial image andencodes it into a vector corresponding to facial features.We share the parameters of the encoder;that is,we use the sameencoder to learn only the common feature information for the encoder network.The structure of a generative adversarial net⁃work is based on a generator and a discriminator.The generator is similar to the decoder in an autoencoder,which convertsthe input noise into a picture and sends it to the discriminator for discrimination along with the real existing picture.Thediscriminator and the generator use back-propagation to optimize the parameters.Moreover,diffusion model is a parameter⁃ized Markov chain trained using variational inference to produce samples that match the data after a finite time.There arealways two processes to train a diffusion model.One is the forward process,also called the diffusion process.The other pro⁃cess is reverse diffusion,also known as the reverse process,which slowly restores the original image from noise throughcontinuous sampling.In the Deepfake detection task,the datasets have also evolved to fill past gaps.In general,this sur⁃vey divides the Deepfake datasets into two generations.The first-generation datasets are often not large enough,and thequality of the content is not satisfying because of the low degree of research fervor.These source videos are usually fromvideo sites or existing face datasets,which can lead to copyright and privacy concerns.The main first-generation datasetsare UADFV,DF-TIMIT,FaceForensics,and diverse fake face dataset(DFFD).The second generation of face forgery data⁃sets has improved forgery effects and image clarity.The main second-generation datasets are Celeb-DF,Deepfake detec⁃tion challenge dataset(DFDC)preview,DeeperForensic-1.0,DFDC,Korean Deepfake detection dataset(KoDF),etc.Interms of input dimension,detecting Deepfake can be roughly divided into three categories:1)the first category is inputtingthe image or key frame from the video,namely,inputting the image or key frame extracted from the video and judging theinput data from the visual performance.This category is commonly used because it can be promoted easily to other com⁃puter vision classification models,and most Deepfake videos are conducted by frame-by-frame images.2)The second isinputting continuous frames from video.In particular,multiple consecutive frames are inputted to allow the model to per⁃ceive the difference in the relationship between the frames from real and fake videos.3)The third is inputting multipleframes and audio simultaneously from the video;that is,the video’s authenticity is detected by examining its video framesand audio together.The features focused on by Deepfake detection techniques also vary.This survey divides them into fourcategories:1)the frequency domain-based approach looks for anomalies in the video at the signal level,treating the videoas a sequence of frames and a synchronized audio signal.Such anomalies,including image mismatches and mismatches inaudio-video synchronization,are usually generated from the mismatches at the signal level during Deepfake video genera⁃tion.2)The texture and spatio-temporal approaches tend to focus only on face position and feature matching in the forgedvideo generation process,where breakdowns that violate the laws of physics and human physiology may occur.3)Thereconstruction–classification learning methods emphasize the common compact representations of genuine faces andenhance the learned representations to be aware of unknown forgery patterns.Classification learning involves mining theessential discrepancy between real and fake images,facilitating the understanding of forgeries.4)Data-driven methods aredetection methods that do not target specific features.However,they use supervised learning to feed real and fake videosinto the model for training.The road to the research on deep forgery techniques and deep forgery detection is still long.Wemust overcome the existing shortcomings and face the challenges of future technological advances.

作者丁峰匡仁盛周越孙珑朱小刚朱国普 Ding Feng;Kuang Rensheng;Zhou Yue;Sun Long;Zhu Xiaogang;Zhu Guopu(School of Software,Nanchang University,Nanchang 330047,China;School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150006,China;School of Public Policy and Administration,Nanchang University,Nanchang 330047,China;Jiangxi Institute of Interest of Things Industry Technology,Yingtan 335003,China)

机构地区南昌大学软件学院哈尔滨工业大学计算机科学与技术学院南昌大学公共政策与管理学院江西省物联网产业技术研究院

出处《中国图象图形学报》 CSCD 北大核心 2024年第2期295-317,共23页 Journal of Image and Graphics

基金国家自然科学基金项目(62262041,62172402) 数据安全治理关键技术研究与应用项目(20224BBC41001) 江西省自然科学基金项目(20232BAB202011)。

关键词深度造假机器学习人工智能深度学习数字取证数字反取证 Deepfake machine learning artificial intelligence deep learning digital forensics digital anti-forensics

分类号 TP37 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献14

1曹申豪,刘晓辉,毛秀青,邹勤.人脸伪造及检测技术综述[J].中国图象图形学报,2022,27(4):1023-1038. 被引量：8
2李晓龙,俞能海,张新鹏,张卫明,李斌,卢伟,王伟,刘晓龙.数字媒体取证技术综述[J].中国图象图形学报,2021,26(6):1216-1226. 被引量：15
3李旭嵘,纪守领,吴春明,刘振广,邓水光,程鹏,杨珉,孔祥维.深度伪造与检测技术综述[J].软件学报,2021,32(2):496-518. 被引量：29
4李颖,边山,王春桃,卢伟.CNN结合Transformer的深度伪造高效检测[J].中国图象图形学报,2023,28(3):804-819. 被引量：5
5李泽宇,张旭鸿,蒲誉文,伍一鸣,纪守领.多模态深度伪造及检测技术综述[J].计算机研究与发展,2023,60(6):1396-1416. 被引量：2
6蔺琛皓,沈超,邓静怡,胡鹏斌,王骞,马仕清,李琦,管晓宏.虚假数字人脸内容生成与检测技术[J].计算机学报,2023,46(3):469-498. 被引量：5
7罗向阳,王道顺,汪萍,刘粉林.基于图像多域特征缩放与BP网络的信息隐藏盲检测[J].东南大学学报（自然科学版）,2007,37(A01):87-91. 被引量：3
8谭明奎,许守恺,张书海,陈奇.深度对抗视觉生成综述[J].中国图象图形学报,2021,26(12):2751-2766. 被引量：9
9王任颖,储贝林,杨震,周琳娜.视觉深度伪造检测技术综述[J].中国图象图形学报,2022,27(1):43-62. 被引量：8
10谢天,于灵云,罗常伟,谢洪涛,张勇东.深度人脸伪造与检测技术综述[J].清华大学学报（自然科学版）,2023,63(9):1350-1365. 被引量：5

二级参考文献45

1姜楠,王健,杨义先.新的唯秘密载体信息隐藏分析方法[J].北京邮电大学学报,2006,29(2):1-4. 被引量：7
2United States Department of Agriculture.NRCS photo gallery[EB/OL].(2002-05-24)[2006-12].http://photogallery.nrcs.usda.gov/.
3Hsu C W,Chang C C,Lin C J.A practical guide to support vector classification[EB/OL].(2006-10-20)[2006-12-12].http://www.csie.ntu.edu.tw/～cjlin/papers/guide/guide.pdf.
4Farid H.Detecting hidden messages using higher-order statistical models[C]//Proceedings of IEEE International Conference on Image Processing.New York,USA,2002,2:905-908.
5Xuan G R,Shi Y Q,Gao J J,et al.Steganalysis based on multiple features formed by statistical moments of wavelet characteristic functions[C]//Proceedings of 7th International Information Hiding Workshop,Lecture Notes in Computer Science 3727.Barcelona,Spain,2005:262-277.
6Lie W,Lin G.A feature-based classification technique for blind image steganalysis[J].IEEE Transactions on Multimedia,2005,7(6):1007-1020.
7Luo X Y,Yang C F,Wang D S,et al.LTSB steganalysis based on quartic equation[J].LNCS Transactions on Data Hiding and Multimedia Security,2007(Ⅱ):68-90.
8Fridrich J,Soukal D,Goljan M.Maximum likelihood estimation of length of secret message embedded using+-K steganography in spatial domain[C]//Proceedings of Security and Watermarking of Multimedia Contents VII,SPIE 5681.San Jose,USA,2005:595-606.
9Brown A.S-Tools steganography[EB/OL].(2003-09-30)[20012].http://www.jjtc.com/Security/stegtools.htm.
10Westfeld A.Steganography software F5[EB/OL].(2003-03)[2006-12].http://wwwrn.inf.tu-dresden.de/～westfeld/f5.html.

共引文献82

1尚海涛.“深度伪造”法律规制的新范式与新体系[J].河北法学,2023,41(1):23-42. 被引量：14
2罗向阳,刘粉林,杨春芳,何雄飞.基于噪声模型和特征联合的PS图像与隐写图像检测[J].计算机学报,2010,33(6):1060-1072. 被引量：4
3张谦,刘粉林,王九智,罗向阳.基于实验的典型隐写通用盲检测方法性能分析[J].信息工程大学学报,2012,13(3):312-318.
4刘宇擎,张玉槐,段沛奇,施柏鑫,余肇飞,黄铁军,高文.针对强人工智能安全风险的技术应对策略[J].中国工程科学,2021,23(3):75-81. 被引量：9
5朱晓瑜,赵静岚.人脸识别技术滥用问题及治理对策[J].中国安全防范技术与应用,2021(4):32-37. 被引量：3
6倪雪莉,王群,梁广俊.微信证据的鉴真方法研究[J].信息网络安全,2021(12):60-69. 被引量：1
7纪守领,杜天宇,邓水光,程鹏,时杰,杨珉,李博.深度学习模型鲁棒性研究综述[J].计算机学报,2022,45(1):190-206. 被引量：32
8马喆,周华兵.采用低层特征的深度伪造图像检测方法[J].软件导刊,2022,21(1):238-242.
9董琳,黄丽清,叶锋,黄添强,翁彬,徐超.人脸伪造检测泛化性方法综述[J].计算机科学,2022,49(2):12-30. 被引量：4
10秘雨欣.深度伪造视频的鉴别与规制[J].信息技术与信息化,2022(1):26-28.

1特日格乐,孙安修,赵剑锋,朱天兵,王越.三七化学成分及药理作用研究进展[J].中文科技期刊数据库（全文版）医药卫生,2023(3):174-176.
2张玲玉,解海卫,张艳,崔浩然.微波和紫外改性生物炭对化肥的吸附性能的影响研究[J].山西化工,2024,44(2):4-5.
3张洁,朱亮,寇远涛.国内外学术场景下个性化文本检索研究述评[J].农业大数据学报,2023,5(4):24-36.
4舒红跃,张颖.人工智能生命:演化方式、危险与伦理考量[J].江汉论坛,2024(2):53-60.
5瞿左珉,殷琪林,盛紫琦,吴俊彦,张博林,余尚戎,卢伟.人脸深度伪造主动防御技术综述[J].中国图象图形学报,2024,29(2):318-342.
6邱雨,吴波.“中国技术威胁论”的批判与应对[J].理论导刊,2024(3):101-109.
7孙跃元,许建峰.商业算法自动化决策的私权构建与实现[J].中州学刊,2024(2):70-78.

中国图象图形学报

2024年第2期

浏览历史

内容加载中请稍等...

深度伪造及其取证技术综述

参考文献14

二级参考文献45

共引文献82

相关作者

相关机构

相关主题

浏览历史