摘要
In order to achieve high recognition rate,most facial expression recognition(FER)methods generate sufficient labeled facial images based on generative adversarial networks(GAN)to train model.However,these methods do not estimate the facial pose before passing the images to the generator,which affects the quality of generated images.And mode collapse is prone to occur during the training process,leading to generate a single-style facial images.To solve these problems,a FER model is proposed based on pose conditioned dendritic convolution neural network(PCD-CNN)with pose and expression.Before passing the facial images to the generator,PCD-CNN was used to process facial images,effectively estimating the facial landmarks to detect face and disentangle the pose.In order to accelerate the training speed of the model,PCD-CNN was based on the ShuffleNet-v2 framework.Every landmark of facial image was modeled by a separate ShuffleNet-DeconvNet,maintaining better performance with fewer parameters.To solve the mode collapse during image generation,we theoretically analyzed the causes,and implemented mini-batch processing on the discriminator in the model and directly calculated the statistical characteristics of the mini-batch samples.Experiments were carried out on the Multi-PIE and BU-3DFE facial expression datasets.Compared with current advanced methods,our method achieves higher accuracy 93.08%,and the training process is more stable.
出处
《国际计算机前沿大会会议论文集》
2020年第1期518-533,共16页
International Conference of Pioneering Computer Scientists, Engineers and Educators(ICPCSEE)
基金
We would like to acknowledge the support from the National Science Foundation of China(61472095).