结合小波变换高频信息的可控面部性别伪造

Controlled Facial Gender Forgery Combining Wavelet Transform High Frequency Information

下载PDF

导出

摘要基于生成对抗网络(Generative Adversarial Network,GAN)的图像到图像的翻译(Image-to-Image Translation,I2I)技术在各种领域中取得了一系列突破,并广泛应用于图像合成、图像着色、图像超分辨率,特别是在面部属性操作方面获得了深入研究。为了解决目前I2I领域由于模型架构以及数据不均衡所导致的不同翻译方向的生成图像性能表现差异的问题,提出了一种HFIGAN(High Frequency Injection GAN)模型,实现了结合高频信息的可控面部性别伪造。首先在结合高频信息的小波模块中,将编码特征通过离散小波变换进行特征级的分解,将所得到的高频信息在解码阶段对等注入,使得在上采样过程中的源域与目标域之间的信息可以达成平衡状态。其次,针对I2I任务中多域转换在不同方向的翻译难度不一致的问题,通过对损失函数进行重新设计,将难易样本的损失进行放缩,提高难样本对模型的反馈,使模型更专注于难样本的训练从而提升模型性能。最后,提出基于风格特征的多样性正则项,将风格向量在不同空间中的距离度量添加至传统的多样性损失中进行监督,使得模型能在保持生成图像多样性的同时提升图像的生成质量。分别在CelebA-HQ数据集和FFHQ数据集上进行实验并验证了所提方法的有效性。在主流的I2I模型中结合所提损失进行了损失函数通用性验证。实验结果表明,与以往先进方法相比,HFIGAN在面部性别伪造方面性能更加优异,所提出的损失函数具备一定的通用性。 Image-to-image translation(I2I)technology based on generative adversarial networks has made a series of breakthroughs in various fields,and is widely used in image synthesis,image coloring,and image super-resolution,especially in face attribute manipulation.To solve the issue of disparity in the performance of generated images in different translation directions due to model architecture and data imbalance,an high-frequency injection GAN(HFIGAN)model is proposed to achieve controlled facial gender forgery for transmitting high-frequency information.Firstly,in the wavelet module for transmitting high-frequency information,the features in the coding stage are decomposed at the feature level by discrete wavelet transform,and the obtained high-frequency information is injected reciprocally in the decoding stage,so that the information composition between the source and target domains is always in a more desirable ratio.Second,images’dynamic consistency loss addresses the inconsistent translation difficulty in different directions for multi-domain conversion tasks in I2I.By redesigning the loss function,we scale the loss of difficult and easy samples,improve the feedback of difficult samples to the model,and make the model focus more on training difficult samples to improve performance.Finally,the diversity regular term based on style features is proposed to add the distance metric of style vectors in different spaces to the traditional diversity loss for supervision,which enables the model to maintain the diversity of generated images while improving the quality of image generation.Experiments on CelebA-HQ dataset and FFHQ dataset verify the effectiveness of the proposed method.The generalization of the loss function is verified in the mainstream I2I model combined with the proposed loss in this paper.Experimental results show that HFIGAN has better performance in facial gender falsification compared with previous advanced methods,and the proposed loss function has some generality.

作者陈万泽陈家祯黄丽清叶锋黄添强罗海峰 CHEN Wanze;CHEN Jiazhen;HUANG Liqing;YE Feng;HUANG Tianqiang;LUO Haifeng(College of Computer and Cyber Security,Fujian Normal University,Fuzhou 350117,China)

机构地区福建师范大学计算机与网络空间安全学院

出处《计算机科学》 CSCD 北大核心 2023年第S02期340-349,共10页 Computer Science

基金国家自然基金面上项目(62072106) 福建省自然科学基金(2020J01168,2022J01190) 福建省教育厅科学基金(JAT210053)。

关键词图像生成生成对抗网络图像到图像的翻译人脸属性编辑聚焦损失 Image generation Generative adversarial network Image-to-Image translation Facial attribute manipulation Focal loss

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1李宇,丁文倩,金立左,潘泓.三种面部性别识别算法的比较[J].机械设计与制造工程,2018,47(12):37-42.
2陈万泽,黄丽清,陈家祯,叶锋,黄添强,罗海峰.融合小波快捷连接生成对抗网络的面部性别伪造[J].网络与信息安全学报,2023,9(3):150-160.
3李永娜,昂亚轩.面孔性别,注视方向和面部表情对面部可信度判断的影响[J].心理学进展,2019,9(3):609-617. 被引量：2
4李文军,向凌云,陈冠霖,杨雪莹.基于VR技术的计算机专业在线教学初探[J].办公自动化,2023,28(21):7-9.
5张微.会展英语特点和译员培养的关键点分析[J].中国会展,2023(16):86-88.
6林锦豪,姚银波.双层PZT相控阵换能器研究[J].中国医疗器械信息,2023,29(11):44-46.
7严博宇.钻头磨损机理及改进技术研究[J].西部探矿工程,2023,35(11):58-60. 被引量：1
8黄章程.预防性公路养护技术在公路施工中的应用[J].产品可靠性报告,2023(9):122-124. 被引量：1
9张晓芸.PISA数学素养测评对小学生的适应性研究[J].小学数学教师,2023(10):4-10.
10杨琳琳,别书凡,王建坤,皇甫懿,刘焱,李文峰,施杰.基于深度学习的玉米植株表型检测方法研究[J].江苏农业科学,2023,51(19):165-172.

计算机科学

2023年第S02期

浏览历史

内容加载中请稍等...

结合小波变换高频信息的可控面部性别伪造

相关作者

相关机构

相关主题

浏览历史