
基于编解码网络的多姿态人脸图像正面化方法 被引量:2

A multi-pose face frontalization method based on encoder-decoder network
摘要 多姿态人脸图像正面化可以缓解头部姿态变化对人脸分析任务的影响.以往直接从多姿态人脸图像合成正面人脸图像的方法存在细节特征缺失的问题.针对这一问题,本文提出一种基于编解码网络的多姿态人脸图像正面化方法——多任务卷积编解码网络(MCEDN).该方法引入正面基础特征网络合成正面人脸基础特征,并在此基础上融合编码网络提取的多姿态人脸局部特征进行细节补偿,最终合成更加清晰的正面人脸图像.利用多任务学习机制建立端到端模型,统一局部特征提取、正面基础特征解析、正面图像合成3个模块,通过共享参数提升整个模型的效果.与已有方法对比, MCEDN在多个数据集上都可以合成结构稳定、细节清晰的正面人脸图像.我们直接使用合成的正面人脸图像进行人脸识别和表情识别,识别准确率达到先进水平,这表明MCEDN可以有效保留人脸细节特征,支持人脸分析任务. Multi-pose face frontalization can alleviate the influence of pose variance on face analysis.The traditional method of synthesizing a frontal face image directly from a multi-pose face image presents a problem in missing face details.To overcome this problem,we propose a face frontalization method based on the encoderdecoder network,namely multitask convolutional encoder-decoder network(MCEDN).The MCEDN introduces a frontal raw feature network to synthesize the global raw features of the frontal face.Then,the network utilizes the decoder to synthesize a clearer frontal face image by fusing local features extracted by the encoder and global raw features.We use a multitask learning mechanism to build an end-to-end model.The method then integrates three modules,namely local feature extraction,global raw feature synthesis,and frontal image synthesis.The model performance was improved by sharing parameters.In comparison with existing methods,MCEDN can synthesize frontal face images with a stable structure and rich details on multiple datasets.Then,we use the synthesized frontal images for face recognition and face expression recognition,and the state-of-the-art results demonstrate that the MCEDN preserves a number of face details.
作者 徐海月 姚乃明 彭晓兰 陈辉 王宏安 Haiyue XU;Naiming YAO;Xiaolan PENG;Hui CHEN;Hongan WANG(Beijing Key Laboratory of Human-Computer Interaction,Institute of Software,Chinese Academy of Sciences,Beijing 100190,China;University of Chinese Academy of Sciences,Beijing 100049,China;State Key Laboratory of Computer Science,Institute of Software,Chinese Academy of Sciences,Beijing 100190,China)
出处 《中国科学:信息科学》 CSCD 北大核心 2019年第4期450-463,共14页 Scientia Sinica(Informationis)
基金 国家重点研发计划项目(批准号:2016YFB1001405) 国家自然科学基金项目(批准号:61661146002) 中国科学院前沿科学重点研究计划项目(批准号:QYZDY-SSW-JSC041)资助
关键词 人脸正面化 卷积神经网络 编解码网络 多任务学习 人脸识别 表情识别 face frontalization convolutional neural network encoder-decoder network multitask learning face recognition facial expression recognition
  • 相关文献



  • 1张晓华,山世光,曹波,高文,周德龙,赵德斌.CAS-PEAL大规模中国人脸图像数据库及其基本评测介绍[J].计算机辅助设计与图形学学报,2005,17(1):9-17. 被引量:40
  • 2杜成,苏光大,林行刚,顾华.多姿态人脸图像合成[J].光电子.激光,2004,15(12):1498-1501. 被引量:5
  • 3李武军,王崇骏,张炜,陈世福.人脸识别研究综述[J].模式识别与人工智能,2006,19(1):58-66. 被引量:108
  • 4柴秀娟,山世光,卿来云,陈熙霖,高文.基于3D人脸重建的光照、姿态不变人脸识别[J].软件学报,2006,17(3):525-534. 被引量:54
  • 5Zhao W, Chellappa R, Phillips P J, et al. Face recognition: a literature survey [J]. ACM Computing Surveys, 2003, 35 (4): 399-458.
  • 6Dryden I L, Mardia K V. Statistical shape analysis [M]. Chiehester: John Wiley, 1998.
  • 7Schaefer S, McPhail T, Warren J. Image deformation using moving least squares [J]. ACM Transactions on Graphics, 2006, 25(3): 533-540.
  • 8Choudhury T, Clarkson B, Jebara T, et al. Multimodal person recognition using unconstrained audio and video [C] // Proceedings of the International Conference on Audio-and Video-Based Person Authentication. Tokyo: Springer, 1999: 176-181.
  • 9Blanz V, Grother P, Phillips P J, et al. Face recognition based on frontal views generated from non-frontal images [C] //Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Los Alamitos: IEEE Computer Society Press, 2005, 2:454-461.
  • 10Gonzalez-Jimenez D, Alba-CastroJ. Symmetry-aided frontal view synthesis for pose-robust face recognition [C] // Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Los Alamitos: IEEE Computer Society Press, 2007:II-237-II-240.












使用帮助 返回顶部