期刊文献+

用于人像提取及半身像合成的生成对抗网络算法 被引量:4

Head Area Extraction and Portrait Synthesis Method Using GAN
下载PDF
导出
摘要 利用生成对抗网络(generative adversarial network,GAN)进行标准上半身人像的合成,从普通人像照片中截取部分区域得到面部对齐后的标准化上半身合成图像,处理后的标准化人像实现了目标主体与背景的分离,可以有效地优化目标识别和分割算法的结果.图像的合成过程分为2个主要步骤,首先利用图像特征识别人脸并截取头部区域,然后以裁切后的头部区域为中心进行上半身人像的合成,得到人脸特征点及头部区域对齐后的上半身合成图像.该算法可以有效地从背景中分离人像区域,利用合成后的图像进行图像分割和评价,可以避免图像背景对于图像识别主体的干扰.通过自有数据集验证了该算法可以改善分割算法的精确度、召回率和F值,最终合成人脸图像的Facenet平均距离及标准差相比现有的人脸图像正则化算法均有减小,通过在CelebA及LFW等通用数据集上的验证测试,显示出算法具有良好的通用性和适应性,该算法可以广泛适用于人像照片的主体提取和人像合成,作为分割和识别等应用的前置步骤. The paper described a general method for portrait synthesis using the generative adversarial network(GAN),which can generate a standard feature point aligned portrait image by cropping facial area from an in-the-wild photo.The main target of processed image is separated from background and the object detection and segmentation algorithm results are optimized.The processing pipeline includes two main parts:firstly,recognize head area using low-level hand-craft features;secondly,use the cropped area as the input of GAN to synthesis portrait image with facial feature aligned.This method can effectively extract facial parts of the image and avoid affection from the background pattern and objects,as well as enhance the facial segmentation of existing algorithms.The experimental results optimized the precision,recall and F-measure values of the existing segmentation algorithm,demonstrated in CelebA and LFW datasets,which are different from the self-made training dataset,and decreased the Facenet distance and standard deviation compared with the state-of-art face frontalization algorithms,showed well generalization ability and proved that this method can be widely used as preprocessing of image segmentation and portrait synthesis methods.
作者 何冀军 申远 郭玉堂 郑津津 He Jijun;Shen Yuan;Guo Yutang;Zheng Jinjin(School of Engineering Science,University of Science and Technology of China,Hefei 230026;School of Computer Science,Hefei Normal University,Hefei 230601;School of Electronic Information and Electrical Engineering,Hefei 230601)
出处 《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2020年第4期599-605,共7页 Journal of Computer-Aided Design & Computer Graphics
基金 国家自然科学基金联合基金(GG2090090072,U1332130,U1713206) 111引智工程(B07033) 安徽省重点研究与开发计划(1704a0902051) 安徽省科技重大专项(18030901033) 安徽省高校自然科学研究重点项目(KJ2018A0487) 安徽省自然科学基金(1908085ME135)。
关键词 生成对抗网络 图像生成 人像合成 图像分割 generative adversarial network photo generate portrait synthesis image segmentation
  • 相关文献

参考文献3

二级参考文献64

  • 1柴秀娟,山世光,卿来云,陈熙霖,高文.基于3D人脸重建的光照、姿态不变人脸识别[J].软件学报,2006,17(3):525-534. 被引量:54
  • 2Zhao W, Chellappa R, Rosenfeld A, et al. Face recognition: literature survey [ J]. ACM Computing Surveys, 2003, 35 (4) 399-458.
  • 3Phillips P, Grother P, Micheals R, et al. FRVT evaluation re- port [ EB/OL ]. [ 2012-05-02 ]. http://www, frv. org/FRVT 2002/documents. htm.
  • 4Li Y, Su G. Face pose estimation and synthesis by 2D morphable model [ C ]//Proceedings of the International conference on Com- putational Intelligence and Security. Heidelberg: Springer-Verlag Berlin, 2007 : 1001-1008.
  • 5Akshay A, Tom G, Roland G, et al. Learning-based face syn- thesis for pose-robust recognition from single image [ C ]//Pro- ceedings of the British Machine Vision Conference. London: British Machine Vision Association and Society for Pattern Recog- nition, 2009: 1-10.
  • 6Blanz V, Vetter T. A morphable model for the synthesis of 3 D faces [ C ]//Proceedings of the 26th Annual Conference on Com- puter Graphics and Interactive Techniques. New York: ACM Press, 1999: 187-194.
  • 7Blanz V, Vetter T. Face recognition based on fitting a 3D mor- phable model [ J ]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2003, 25 (9) : 1063-1074.
  • 8Duda R O, Hart P E,tork D G. Pattern Classification [ M]. 2rid ed. New York: John Wiley & Sons, 2001.
  • 9Jones M, Poggio T. Multidimensional morphable models: a framework for representing and matching object classes [ J ]. International Journal of Computer vision, 1998, 29 (2) : 107 -131.
  • 10Romdhani S, Vetter T. Efficient robust and accurate fitting of a 3 D morphable model [ C ]//Proceedings of the 9th IEEE Interna- tional Conference on Computer Vision. New York: IEEE Press,2003 : 59-66.

共引文献27

同被引文献17

引证文献4

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部