融合通道位置注意力机制和并行空洞卷积的人脸年龄合成被引量：1

Face age synthesis fusing channel-coordinate attention mechanism and parallel dilated convolution

导出

摘要目的人脸年龄合成旨在合成指定年龄人脸图像的同时保持高可信度的人脸,是计算机视觉领域的热门研究方向之一。然而目前主流人脸年龄合成模型过于关注纹理信息,忽视了与人脸相关的多尺度特征,此外网络存在对身份信息筛选不佳的问题。针对以上问题,提出一种融合通道位置注意力机制和并行空洞卷积的人脸年龄合成网络(generative adversarial network(GAN)composed of the parallel dilated convolution and channel-coordinate atten⁃tion mechanism,PDA-GAN)。方法PDA-GAN基于生成对抗网络提出了并行三通道空洞卷积残差块和通道—位置注意力机制。并行三通道空洞卷积残差块将3种膨胀系数空洞卷积提取的不同尺度人脸特征融合,提升了特征尺度上的多样性和总量上的丰富度;通道—位置注意力机制通过对人脸特征的长度、宽度和深度显著性计算,定位图像中与年龄高度相关的通道和空间位置区域,增强了网络对通道和空间位置上敏感特征的表达能力,解决了特征冗余问题。结果实验在Flickr高清人脸数据集(Flickr-faces-high-quality,FFHQ)上训练,在名人人脸属性高清数据集(large-scale celebfaces attributes dataset-high quality,Celeba-HQ)上测试,将本文提出的PDA-GAN与最新的3种人脸年龄图像合成网络进行定性和定量比较,以验证本文方法的有效性。实验结果表明,PDA-GAN显著提升了人脸年龄合成的身份置信度和年龄估计准确度,具有良好的身份信息保留和年龄操控能力。结论本文方法能够合成具有较高真实度和准确性的目标年龄人脸图像。 Objective Face age synthesis is one of the most popular research fields in computer vision aiming at synthesiz⁃ing face images of specified ages while maintaining high fidelity.With the continuous progress of science and technology,face age synthesis technology is being gradually applied in face recognition,film special effects,public security,and other fields with a very wide range of application scenarios.The generative adversarial network(GAN)is one of the most widely used deep learning models in face synthesis.The generator and discriminator of GAN fight each other to generate images that are real enough to be fake.While GAN and its variant models have achieved good synthesis results,some deficiencies remain unaddressed.First,in order to synthesize images that are close to the target age,the current face age synthesis mod⁃els only limit the process of age change to texture information and ignore multi-scale features,such as contour,hair color,and texture,on the face.Second,the limited receptive field of the convolutional layer hinders the full convolutional net⁃work from extracting multi-scale features in the image.These problems greatly restrict the face age image synthesis effect of GAN.To solve these problems,this paper proposes a GAN composed of the parallel dilated convolution and channelcoordinate attention mechanism(PDA-GAN).Method PDA-GAN proposes a parallel three-channel dilated convolutional residual block(PTDCRB)and a channel-coordinate attention mechanism(CCAM)based on generative adversarial net⁃works.PTDCRB is introduced in the generator network of the baseline.Each PTDCRB comprises three parallel dilated con⁃volution channels that extract features at the same time.The dilated convolutions on different branches set expansion coeffi⁃cients of[1,2,3],respectively.Each branch of PTDCRB shares weights and reduces the amount of network parameters.The first layer of each branch in PTDCRB uses a 1×1 convolutional layer,the second layer is a dilated convolution with different expansion coefficients,and the third layer uses a 1×1 convolutional layer to reduce dimensionality and improve computational efficiency.Meanwhile,CCAM significantly screens the channel dimension of the feature vector,retains meaningful channel information in the feature,and learns the importance of different channels in order to avoid feature redundancy.CCAM then embeds the position information into the feature vector after channel attention and fuses them together after calculating the attention mechanism along the two orthogonal directions of length and width.The purpose of CCAM is to easily capture the dependencies of features at different positions.Result An experiment is conducted on the FFHQ dataset,samples in the Celeba-HQ dataset are selected as the test set,and PDA-GAN is qualitatively and quantita⁃tively compared with the three latest face age image synthesis networks HRFAE,LIFE,and SAM to verify its effective⁃ness.Age accuracy and identity consistency are adopted as quantitative indicators.PDA-GAN achieves the best accuracy for synthetic age images,with an average prediction difference of 4.09.The identity confidence can reach 99.2%when synthesizing a 30-year-old face.In the age-independent attribute retention experiment,PDA-GAN outperforms the other models in both quantitative indicators,with a gender retention rate of 99.7%and emotion retention rate of 93.2%.An ablation experiment is performed to further prove the effectiveness of each module of PDA-GAN,where PTDCRB is intro⁃duced into different layers of the generator backbone network.Experimental results show that PTDCRB-3 significantly improves identity confidence and age estimation accuracy.Four PTDCRB expansion coefficient sets are then established to train the network,and an expansion coefficient of[1,2,3]needs to be achieved to confirm the optimality of model iden⁃tity confidence and predicted age distribution.The standard generator structure and the generator structure introducing the channel-coordinate attention mechanism are then tested for their performance on age synthesis accuracy and identity verifi⁃cation confidence.Experimental results show that the identity retention and age synthesis abilities are significantly improved after adding the channel-coordinate attention mechanism.Conclusion This study proposes a parallel threechannel dilated convolution residual block with shared weights that captures feature information at each scale and enhances the richness of the model detail features.To enhance the expressiveness of the model on sensitive features,this paper pro⁃poses a channel-coordinate attention mechanism that learns features of the channel and spatial dimensions simultaneously.Under the combined effect of the parallel three-channel dilated convolution residual block and the channel-position atten⁃tion mechanism,the identity preservation ability and age synthesis accuracy of the model for face images are improved.Experimental results show that the proposed method outperforms other popular methods for face age synthesis tasks and can synthesize natural and realistic face images of the target age with high fidelity and accuracy.

作者张珂于婷婷石超君娄文硕刘阳 Zhang Ke;Yu Tingting;Shi Chaojun;Lou Wenshuo;Liu Yang(Department of Electronic and Communication Engineering,North China Electric Power University,Baoding 071003,China;Hebei Key Laboratory of Power Internet of Things Technology,North China Electric Power University,Baoding 071003,China)

机构地区华北电力大学电子与通信工程系华北电力大学河北省电力物联网技术重点实验室

出处《中国图象图形学报》 CSCD 北大核心 2023年第12期3870-3883,共14页 Journal of Image and Graphics

基金国家自然科学基金项目(62076093,62206095) 中央高校基本科研业务费专项资金资助(2022MS078,2020MS099,2020YJ006)。

关键词图像合成人脸年龄生成对抗网络(GAN) 空洞卷积注意力机制 image synthesis face age generative adversarial network(GAN) dilated convolution attention mechanism

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献4

1吴柳玮,孙锐,阚俊松,高隽.双重对偶生成对抗网络的跨年龄素描-照片转换[J].中国图象图形学报,2020,25(4):732-744. 被引量：3
2Ke Zhang,Yukun Su,Xiwang Guo,Liang Qi,Zhenbing Zhao.MU-GAN:Facial Attribute Editing Based on Multi-Attention Mechanism[J].IEEE/CAA Journal of Automatica Sinica,2021,8(9):1614-1626. 被引量：6
3张珂,王新胜,郭玉荣,苏昱坤,何颖宣.人脸年龄估计的深度学习方法综述[J].中国图象图形学报,2019,0(8):1215-1230. 被引量：14
4封顺,高胜极.基于生成对抗网络的人脸老化技术在公安领域前景探究[J].电子测试,2022,36(2):61-65. 被引量：1

二级参考文献11

1王章野,曹玫璇,李理,彭群生.基于个性化原型的人脸衰老图像合成[J].电子学报,2009,37(B04):118-124. 被引量：10
2王先梅,梁玲燕,王志良,胡四泉.人脸图像的年龄估计技术研究[J].中国图象图形学报,2012,17(6):603-618. 被引量：33
3赵京晶,方琪,梁植程,胡长胜,杨福猛,詹曙.超分辨率重建的素描人脸识别[J].中国图象图形学报,2016,21(2):218-224. 被引量：10
4赵一丁,田森平.基于分类与回归混合模型的人脸年龄估计方法[J].计算机应用,2017,37(7):1999-2002. 被引量：4
5张珂,高策,郭丽茹,苑津莎,赵振兵,李保罡.非受限条件下多级残差网络人脸图像年龄估计[J].计算机辅助设计与图形学学报,2018,30(2):346-353. 被引量：11
6王体,赵梦媛,黄艳燕.基于生成对抗网络的人脸年龄合成研究概述[J].软件,2020,41(10):171-174. 被引量：1
7莫惠中,赵建军.人脸老化与逆龄技术在电影特效制作中的应用浅析[J].现代电影技术,2021(2):9-14. 被引量：3
8刘金坤,李春宇,张智勇,张帅,骆建新.虹膜识别技术在公安领域的应用[J].刑事技术,2021,46(4):428-432. 被引量：3
9刘颖,张艺轩,佘建初,王富平,林庆帆.人脸去遮挡新技术研究综述[J].计算机科学与探索,2021,15(10):1773-1794. 被引量：9
10Kunfeng Wang,Chao Gou,Yanjie Duan,Yilun Lin,Xinhu Zheng,Fei-Yue Wang.Generative Adversarial Networks:Introduction and Outlook[J].IEEE/CAA Journal of Automatica Sinica,2017,4(4):588-598. 被引量：46

共引文献20

1李大湘,马宣,任娅琼,刘颖.基于深度代价敏感CNN的年龄估计算法[J].模式识别与人工智能,2020,33(2):176-181. 被引量：3
2郭玉荣,张珂,王新胜,苑津莎,赵振兵,马占宇.端到端双通道特征重标定DenseNet图像分类[J].中国图象图形学报,2020,25(3):486-497. 被引量：12
3张辰昱,徐树公,黄剑波.一种利用年龄编辑改进年龄估计的方法[J].上海大学学报（自然科学版）,2021,27(1):28-38.
4张亮亮,张明艳,程凡永,周鹏.基于深度学习的脸部年龄预测算法[J].计算机工程,2021,47(5):267-272. 被引量：1
5李丽莹,王阳,马军山.基于深度学习的人脸表观年龄估计[J].软件导刊,2021,20(6):54-58. 被引量：2
6陈慧雅,伍锡如.基于生成对抗网络的交通模糊图像复原[J].桂林电子科技大学学报,2021,41(2):167-172. 被引量：3
7田会娟,乔明天,蔡敏鹏.基于变化光照下的人脸识别与年龄估计[J].激光与光电子学进展,2022,59(2):244-253. 被引量：6
8ChiYan Lee,Hideyuki Hasegawa,Shangce Gao.Complex-Valued Neural Networks:A Comprehensive Survey[J].IEEE/CAA Journal of Automatica Sinica,2022,9(8):1406-1426. 被引量：4
9Yang Wang,Ying Tian,Ou Tian.Face Age Estimation Based on CSLBP and Lightweight Convolutional Neural Network[J].Computers, Materials & Continua,2021(11):2203-2216. 被引量：1
10王新月,钟福金.基于多分支CNN和多尺度特征融合的非受控人脸年龄估计[J].重庆邮电大学学报（自然科学版）,2022,34(4):612-620. 被引量：1

同被引文献9

1罗明杰,冯开平.基于沙漏结构与注意力机制的轻量级人脸表情识别方法[J].计算机与现代化,2023(11):89-94. 被引量：1
2吕晓琪,李浩,谷宇.基于深度学习算法的人脸图像活体特征变换尺度提取[J].吉林大学学报（工学版）,2023,53(11):3201-3206. 被引量：2
3黄灵,何希平,贺丹,杨楚天,旷奇弦.融合卷积神经网络和Transformer的人脸欺骗检测模型[J].信息安全研究,2024,10(1):25-33. 被引量：1
4林泽强,汪思文.基于改进孪生网络的小样本人脸识别方法与系统设计[J].电脑与信息技术,2024,32(1):28-31. 被引量：1
5许梦珍,张静,彭鸿滨.基于改进GoogLeNet卷积网络的人脸面部表情识别方法[J].电脑与信息技术,2024,32(1):32-36. 被引量：1
6张绍龙.基于人工智能技术的高光谱人脸自动化识别系统设计[J].自动化与仪表,2024,39(1):130-133. 被引量：1
7孙强,陈远.多层次时空特征自适应集成与特有-共享特征融合的双模态情感识别[J].电子与信息学报,2024,46(2):574-587. 被引量：1
8李姗姗.基于FaceBoxes和ResNet34的人脸视频心率测量[J].现代信息科技,2024,8(3):139-142. 被引量：1
9陈北京,张海涛,李玉茹.面向人脸属性编辑的三阶段对抗扰动生成主动防御算法[J].计算机学报,2024,47(3):677-689. 被引量：1

引证文献1

1田石磊.基于计算机视觉的人脸关键点特征智能提取方法[J].无线互联科技,2024,21(13):57-59.

1马力.沉浸式遗址类博物馆:数字化赋能研学教育[J].中国教育网络,2023(8):79-80.
2李帅先,谭桂梅,刘汝璇,唐奇伶.利用自相似性实现医学图像合成的生成对抗网络[J].中南民族大学学报（自然科学版）,2024,43(1):78-89. 被引量：1
3Jin Bei.Manufacturing in China:Sharing Opportunities With World[J].China News Release,2023(8):14-16.
4Naixia Mou,Qi Jiang,Lingxian Zhang,Jiqiang Niu,Yunhao Zheng,Yanci Wang,Tengfei Yang.Personalized tourist route recommendation model with a trajectory understanding via neural networks[J].International Journal of Digital Earth,2022,15(1):1738-1759. 被引量：1
5梁宇棋,马健晖,唐钒,李晏宁,刘书朋.ArcCheck三维剂量验证系统对直线加速器机械及剂量误差探测的敏感性分析[J].现代仪器与医疗,2023,29(5):36-43. 被引量：1
6曾志贤,曹建军,翁年凤,袁震,余旭.Cross-Modal Entity Resolution for Image and Text Integrating Global and Fine-Grained Joint Attention Mechanism[J].Journal of Shanghai Jiaotong university(Science),2023,28(6):728-737.
7兰治,严彩萍,李红,郑雅丹.混合双注意力机制生成对抗网络的图像修复模型[J].中国图象图形学报,2023,28(11):3440-3452. 被引量：1
8Clayton Yang Teng Bey,Jin-Uu Koh,Christopher Wai Keung Lai.Burnout syndrome and anxiety among healthcare workers during global pandemics:An umbrella review[J].World Journal of Meta-Analysis,2023,11(7):368-379.
9Information for contributors to PEDOSPHERE[J].Pedosphere,2023,33(6).
10Xing Hong,Lei Zhang,Shoulei Xu,Zeyu Cheng,Yazhao Wang,Bernard A.Goodman,Dingkang Xiong,Wen Deng.White light luminescence in Dy/Tm co-doped yttria stabilized zirconia single crystals[J].Journal of Rare Earths,2023,41(12):1904-1910. 被引量：1

中国图象图形学报

2023年第12期

浏览历史

内容加载中请稍等...

融合通道位置注意力机制和并行空洞卷积的人脸年龄合成被引量：1

参考文献4

二级参考文献11

共引文献20

同被引文献9

引证文献1

相关作者

相关机构

相关主题

浏览历史

融合通道位置注意力机制和并行空洞卷积的人脸年龄合成 被引量：1

参考文献4

二级参考文献11

共引文献20

同被引文献9

引证文献1

相关作者

相关机构

相关主题

浏览历史

融合通道位置注意力机制和并行空洞卷积的人脸年龄合成被引量：1