持续学习改进的人脸表情识别被引量：3

Facial expression recognition improved by continual learning

导出

摘要目的大量标注数据和深度学习方法极大地提升了图像识别性能。然而,表情识别的标注数据缺乏,训练出的深度模型极易过拟合,研究表明使用人脸识别的预训练网络可以缓解这一问题。但是预训练的人脸网络可能会保留大量身份信息,不利于表情识别。本文探究如何有效利用人脸识别的预训练网络来提升表情识别的性能。方法本文引入持续学习的思想,利用人脸识别和表情识别之间的联系来指导表情识别。方法指出网络中对人脸识别整体损失函数的下降贡献最大的参数与捕获人脸公共特征相关,对表情识别来说为重要参数,能够帮助感知面部特征。该方法由两个阶段组成:首先训练一个人脸识别网络,同时计算并记录网络中每个参数的重要性;然后利用预训练的模型进行表情识别的训练,同时通过限制重要参数的变化来保留模型对于面部特征的强大感知能力,另外非重要参数能够以较大的幅度变化,从而学习更多表情特有的信息。这种方法称之为参数重要性正则。结果该方法在RAF-DB(real-world affective faces database),CK+(the extended Cohn-Kanade database)和Oulu-CASIA这3个数据集上进行了实验评估。在主流数据集RAF-DB上,该方法达到了88.04%的精度,相比于直接用预训练网络微调的方法提升了1.83%。其他数据集的实验结果也表明了该方法的有效性。结论提出的参数重要性正则,通过利用人脸识别和表情识别之间的联系,充分发挥人脸识别预训练模型的作用,使得表情识别模型更加鲁棒。 Objective Facial expression recognition(FER)has become an important research topic in the field of computer vision.FER plays an important role in human-computer interaction.Most studies focus on classifying basic discrete expressions(i.e.,anger,disgust,fear,happiness,sadness,and surprise)using static image-based approaches.Recognition performance in deep learning-based methods has progressed considerably.Deep neural networks,especially convolutional neural networks(CNNs),achieve outstanding performance in image classification tasks.A large amount of labeled data is needed for training deep networks.However,insufficient samples in many widely used FER datasets lead to overfitting in the trained model.Fine-tuning a network that has been well pre-trained on a large face recognition dataset is commonly performed to solve the shortage of samples in FER datasets and prevent overfitting.The pre-trained network can capture facial information and the similarity between face recognition(FR)and FER domains facilitates the transfer of features.Although this transfer learning strategy demonstrates satisfactory performance,the fine-tuned FR network may still contain face-dominated information,which can weaken the network’s ability to represent different expressions.On the one hand,we expect to reserve the strong ability of the FR network to capture important facial information,such as face contour,and guide the FER network training in real cases.On the other hand,we want the network to learn additional expression-specific information.The FER model training using a continual learning approach is proposed to utilize the close relationship between FR and FER effectively and exploit the ability of the pre-trained FR network.Method This study aims to train an expression recognition network with auxiliary significant information of face recognition network instead of only using a fine-tuning approach.We first introduce a continual learning approach into the field of FER.Continual learning analyzes the problem learning from an infinite stream of data with the objective of gradually extending the acquired knowledge and using it for future learning.Synaptic intelligence consolidates important parameters of previous tasks to solve the problem of catastrophic forgetting and alleviate the reduction in performance by preventing those important parameters from changing in future tasks.Similar to continual learning,we conduct the FR task before the FER task is added.However,we only focus on the performance of the later task while continual learning also aims to alleviate the catastrophic forgetting of the original task.Sequential tasks in continual learning commonly contain a small number of classes so that important parameters are related to current classes.However,important parameters are more likely to capture common facial features rather than specific classes due to the large amount of categories in the FR task,thereby remarkably increasing their contributions to the total loss.Hence,a two-stage training strategy is proposed in this study.We train a FR network and compute each parameter’s importance while training in the first stage.We refine the pre-trained network with the supervision of expression label information while preventing important parameters from excessively changing in the second stage.The loss function for expression classification is composed of two parts,namely,softmax loss and parameter-wise importance regularization.Result We conduct experiments on three widely used FER datasets,including CK+(the extended Cohn-Kanade database),Oulu-CASIA,and RAF-DB(real-word affective faces database).RAF-DB is an in-the-wild database while the two other databases are laboratory-controlled.The use of RAF-DB achieves an accuracy of 88.04%,which improves the performance of direct fine-tuning by 1.83%and surpasses the state-of-the-art algorithm self-cure network(SCN)by 1.01%.The result using CK+improves the fine-tuning baseline by 1.1%.The experiment using Oulu-CASIA also indicated that the network has satisfactory generalization performance with the addition of parameter-wise importance regularization.Meanwhile,the effect of such regularization improves the performance on in-the-wild datasets more remarkblely due to the more complex faces under occlusion and pose variations.Conclusion We exploit the relationship between FR and FER and adopt the idea and algorithm of continual learning in FER to avoid overfitting in this study.The main purpose and effect of continual learning is to preserve the powerful feature extraction ability of the FR network via parameter-wise importance regularization and allow less-important parameters to learn additional expression-specific information.The experimental results showed that our training strategy helps the FER network to learn additional discriminative features and thus promote recognition performance.

作者江静邓伟洪 Jiang Jing;Deng Weihong(School of Artifical Intelligence,Beijing University of Posts and Telecommunications,Beijing 100876,China)

机构地区北京邮电大学人工智能学院

出处《中国图象图形学报》 CSCD 北大核心 2020年第11期2361-2369,共9页 Journal of Image and Graphics

基金国家自然科学基金项目(61871052) 国家重点研发计划项目(2019YFB1406504)。

关键词深度学习表情识别(FER) 人脸识别(FR) 预训练网络持续学习参数重要性正则 deep learning facial expression recognition(FER) face recognition(FR) pre-trained network continual learning parameter-wise importance regularization

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献22

1丁名都,李琳.基于CNN和HOG双路特征融合的人脸表情识别[J].信息与控制,2020,49(1):47-54. 被引量：17
2卢官明,朱海锐,郝强,闫静杰.基于深度残差网络的人脸表情识别[J].数据采集与处理,2019,34(1):50-57. 被引量：15
3潘仙张,张石清,郭文平.多模深度卷积神经网络应用于视频表情识别[J].光学精密工程,2019,27(4):963-970. 被引量：19
4姬秋敏,张灵,陈云华,麦应潮,向文,罗源.基于视觉机制与协同显著性的自发式表情识别[J].计算机工程与设计,2019,40(6):1741-1746. 被引量：3
5胡敏,余胜男,王晓华.基于约束性循环一致生成对抗网络的人脸表情识别方法[J].电子测量与仪器学报,2019,31(4):169-177. 被引量：4
6王田辰,吴秦,宗海燕.几何显著变化的表情识别特征构造[J].计算机科学与探索,2019,13(7):1227-1238. 被引量：1
7李宏菲,李庆,周莉.基于多视觉描述子及音频特征的动态序列人脸表情识别[J].电子学报,2019,47(8):1643-1653. 被引量：9
8刘振,王甦菁,李擎.基于多任务中级特征个性化学习的微表情识别[J].计算机工程与应用,2019,55(18):151-154. 被引量：4
9向南,张明敏,杨黎丽.一种基于Hawkes过程的隐藏情绪倾向识别方法[J].北京理工大学学报,2019,39(10):1086-1090. 被引量：1
10刘全明,辛阳阳.端到端的低质人脸图像表情识别[J].小型微型计算机系统,2020,41(3):668-672. 被引量：18

引证文献3

1耿涛.基于位移特征与个性化学习的动态序列人脸表情识别方法[J].兰州文理学院学报（自然科学版）,2021,35(6):51-55. 被引量：2
2高涛,杨朝晨,陈婷,邵倩,雷涛.深度多尺度融合注意力残差人脸表情识别网络[J].智能系统学报,2022,17(2):393-401. 被引量：11
3刘晖,吴倩颖,张小俊,王银茂,苏晓幸.新型防弹玻璃设计方法[J].建筑玻璃与工业玻璃,2024(4):23-25.

二级引证文献13

1孙兰兰.复杂场景下多姿态人脸知识蒸馏识别方法[J].黑龙江工业学院学报（综合版）,2022,22(9):92-97. 被引量：2
2吴家辉,周涛,罗明新,肉扎吉·依马穆.基于C3D CNN的人脸表情识别系统设计与开发[J].信息与电脑,2022,34(14):104-107.
3张洁,穆静,钱智哲.改进的ResNeXt50神经网络面部表情识别方法[J].西安工业大学学报,2022,42(6):610-619. 被引量：1
4朱鸿杰,吕志刚,邸若海,孙晓静,郝可青.改进MD-MTD的神经网络锂电池寿命预测仿真[J].西安工业大学学报,2022,42(6):620-626.
5王彬,徐杨,石进,张显国.多分支精简双线性池化的人脸表情识别[J].计算机技术与发展,2023,33(3):27-33. 被引量：1
6闫河,李梦雪,张宇宁,刘建骐.面向表情识别的重影非对称残差注意力网络模型[J].智能系统学报,2023,18(2):333-340. 被引量：1
7王海平,刘宇轩,谷晓钢.关于检测专注度的人脸识别综述[J].信息与电脑,2023,35(12):166-168.
8倪锦园,张建勋.多尺度坐标注意力金字塔卷积的面部表情识别[J].计算机工程与应用,2023,59(22):242-250. 被引量：1
9张金栋,王宏志.基于多尺度和局部特征融合的人脸表情识别[J].长春工业大学学报,2023,44(4):300-305.
10袁德荣,张勇,唐颖军,李波燕,谢宝来.多尺度残差注意力网络及其表情识别算法[J].小型微型计算机系统,2024,45(1):30-36.

1张玉存,李亚彬,付献斌.基于曲率约束的点云分割去噪方法[J].计量学报,2020,41(10):1218-1225. 被引量：10
2罗琪.以深度学习方法为载体的医学影像实时变化检测算法分析[J].粘接,2020,44(12):132-135. 被引量：3
3丘秀桃.运用PBL项目式学习改进美术特长生英语话题写作教学[J].校园英语,2020(31):160-161.
4刘自然,李谦,颜丙生,尚坤.堆叠稀疏自编码深度神经网络算法及其在滚动轴承故障诊断中的应用[J].机床与液压,2020,48(23):208-213. 被引量：5
5夏伟.基于传感器的无线网络恶意节点检测研究[J].西安文理学院学报（自然科学版）,2020,23(4):41-46. 被引量：4
6李珊,邓伟洪.深度人脸表情识别研究进展[J].中国图象图形学报,2020,25(11):2306-2320. 被引量：30
7刘碧翠,覃仕鹤,余新华.垂体后叶素对咯血患者内分泌影响的临床分析[J].医药前沿,2020,10(23):41-42.
8刘风花.加味补肾活血汤辅治输卵管阻塞性不孕症临床观察[J].实用中医药杂志,2020,36(11):1449-1451. 被引量：1
9彭小江,乔宇.面部表情分析进展和挑战[J].中国图象图形学报,2020,25(11):2337-2348. 被引量：13
10桂涛.假如没有英吉利海峡[J].现代阅读,2020(12):53-53.

中国图象图形学报

2020年第11期

浏览历史

内容加载中请稍等...

持续学习改进的人脸表情识别被引量：3

同被引文献22

引证文献3

二级引证文献13

相关作者

相关机构

相关主题

浏览历史

持续学习改进的人脸表情识别 被引量：3

同被引文献22

引证文献3

二级引证文献13

相关作者

相关机构

相关主题

浏览历史

持续学习改进的人脸表情识别被引量：3