摘要
针对人脸编辑存在的编辑结果不自然、生成图像变化较大等问题,提出了一种有闭解的可控人脸编辑算法。首先,随机采样n个潜在向量来构造样本矩阵,并计算出该矩阵的前k个主成分向量;然后,利用ResNet-50得到人脸图像的5个属性,并通过支持向量机(SVM)计算出各属性的语义边界;最后,计算这些属性的可解释方向向量,这些向量在尽量靠近主成分向量的同时也尽量远离对应属性的语义边界,从而减小人脸属性之间的耦合性,并提高编辑过程中的可控性。该算法具有闭解,因此效率较高。实验结果表明,所提算法和语义的闭式分解(SeFa)算法和可解释的生成对抗网络控制(GANSpace)算法相比,在初始分数(IS)上分别增加了19%和26%,在弗雷歇距离(FID)上分别减小了4%和37%,在最大平均差异(MMD)上分别减小了15%和48%。可见,该算法具有较好的可控性和解耦性。
To solve the problems in face editing,such as unnatural editing results and great changes in generated images,a controllable face editing algorithm with closed-form solution was proposed.Firstly,n latent vectors were sampled randomly to construct a sample matrix,and the top k principal component vectors of the matrix were calculated.Then,five attributes of face image were obtained by ResNet-50,and the semantic boundary of each attribute was calculated by Support Vector Machine(SVM).Finally,the interpretable direction vectors of these attributes were calculated,which were as closed to the principal components vectors as possible and stayed as far away from the semantic boundary of the corresponding attribute as possible at the same time,thereby reducing the coupling between facial attributes,and improving the controllability in face editing.Because the algorithm has a closed-form solution,it has high efficiency.Experimental results show that the compared with closed-form Factorization of latent Semantics in GANs(SeFa)algorithm and Discovering Interpretable Generative Adversarial Network Controls(GANSpace)algorithm,the proposed algorithm increases the Inception Score(IS)by 19% and 26% respectively,decreases the Fréchet Inception Distance(FID)by 4% and 37% respectively,and decreases the Maximum Mean Discrepancy(MMD)by 15% and 48% respectively.It can be seen that this algorithm has good controllability and decoupling.
作者
陶玲玲
刘波
李文博
何希平
TAO Lingling;LIU Bo;LI Wenbo;HE Xiping(School of Artificial Intelligence,Chongqing Technology and Business University,Chongqing 400067,China;Chongqing Key Laboratory of Intelligent Perception and BlockChain Technology(Chongqing Technology and Business University),Chongqing 400067,China)
出处
《计算机应用》
CSCD
北大核心
2023年第2期601-607,共7页
journal of Computer Applications
基金
重庆工商大学重点平台基金资助项目(950119093)
重庆工商大学研究生“创新型科研项目”(yjscxx2021-112-99)。
关键词
生成对抗网络
人脸编辑
潜在空间
语义空间
属性语义边界
Generative Adversarial Network(GAN)
face editing
latent space
semantic space
attribute semantic boundary