注意力引导的三流卷积神经网络用于微表情识别被引量：1

Attention-guided three-stream convolutional neural network for microexpression recognition

导出

摘要目的微表情识别在心理咨询、置信测谎和意图分析等多个领域都有着重要的应用价值。然而,由于微表情自身具有动作幅度小、持续时间短的特点,到目前为止,微表情的识别性能仍然有很大的提升空间。为了进一步推动微表情识别的发展,提出了一种注意力引导的三流卷积神经网络(attention-guided three-stream convolutional neural network,ATSCNN)用于微表情识别。方法首先,对所有微表情序列的起始帧和峰值帧进行预处理;然后,利用TV-L1(total variation-L1)能量泛函提取微表情两帧之间的光流;接下来,在特征提取阶段,为了克服有限样本量带来的过拟合问题,通过3个相同的浅层卷积神经网络分别提取输入3个光流值的特征,再引入卷积块注意力模块以聚焦重要信息并抑制不相关信息,提高微表情的识别性能;最后,将提取到的特征送入全连接层分类。此外,整个模型架构采用SELU(scaled exponential linear unit)激活函数以加快收敛速度。结果本文在微表情组合数据集上进行LOSO(leave-one-subject-out)交叉验证,未加权平均召回率(unweighted average recall,UAR)以及未加权F1-Score(unweighted F1-score,UF1)分别达到了0.7351和0.7205。与对比方法中性能最优的Dual-Inception模型相比,UAR和UF1分别提高了0.0607和0.0683。实验结果证实了本文方法的可行性。结论本文方法所提出的微表情识别网络,在有效缓解过拟合的同时,也能在小规模的微表情数据集上达到先进的识别效果。 Objective In recent years,microexpression recognition has remarkable application value in various fields such as psychological counseling,lie detection,and intention analysis.However,unlike macro-expressions generated in conscious states,microexpressions often occur in high-risk scenarios and are generated in an unconscious state.They are characterized by small action amplitudes,short duration,and usually affect local facial areas.These features also determine the difficulty of microexpression recognition.Traditional methods used in early research mainly include methods based on local binary patterns and methods based on optical flow.The former can effectively extract the texture features of microexpressions,whereas the latter calculates the pixel changes in the temporal domain and the relationship between adjacent frames,providing rich,key input information for the network.Although the traditional methods based on texture features and optical flow features have made good progress in early microexpression recognition,they often require considerable cost and have room for improvement in recognition accuracy and robustness.Later,with the development of machine learning,microexpression recognition based on deep learning gradually became the mainstream of research in this field.This method uses neural networks to extract features from input image sequences after a series of preprocessing operations(facial cropping and alignment and grayscale processing)and classifies them to obtain the final recognition result.The introduction of deep learning has substantially improved the recognition performance of microexpressions.However,given the characteristics of microexpressions themselves,the recognition accuracy of microexpressions can still be improved considerably,while the limited scale of existing microexpression datasets also restricts the recognition effect of such emotional behaviors.To solve these problems,this paper proposes an attention-guided three-stream convolutional neural network(ATSCNN)for microexpression recognition.Method First,considering that the motion changes between adjacent frames of microexpressions are very subtle,to reduce redundant information and computation in the image sequence while preserving the important features of microexpressions,this paper only performs preprocessing operations such as facial alignment and cropping on the two key frames of microexpressions(onset frame and apex frame)to obtain a single-channel grayscale image sequence with a resolution of 128×128 pixels and to reduce the influence of nonfacial areas on microexpression recognition.Then,because optical flow can capture representative motion features between two frames of microexpressions,it can obtain a higher signal-to-noise ratio than the original data and provide rich,critical input features for the network.Therefore,this paper uses the total variation-L1(TV-L1)energy functional to extract optical flow features between two frames of microexpressions(the horizontal component of optical flow,the vertical component of optical flow,and the optical strain).Next,in the microexpression feature extraction stage,to overcome the overfitting problem caused by limited sample size,three identical four-layer convolutional neural networks are used to extract the features of the input optical flow horizontal component,optical flow vertical component,and optical strain,(the input channel numbers of the four convolutional layers are 1,3,5,and 8,and the output channel numbers are 3,5,8,and 16),thus improving the network performance.Afterward,because the image sequences in the microexpression dataset used in this paper inevitably contain some redundant information other than the face,a convolutional block attention module(CBAM)with channel attention and spatial attention serially connected is added after each shallow convolutional neural network in each stream to focus on the important information of the input and suppress irrelevant information,while paying attention to both the channel dimension and the spatial dimension,thereby enhancing the network’s ability to obtain effective features and improving the recognition performance of microexpressions.Finally,the extracted features are fed into a fully connected layer to achieve microexpression emotion classification(including negative,positive,and surprise).In addition,the entire model architecture uses the scaled exponential linear unit(SELU)activation function to overcome the potential problems of neuron death and gradient disappearance in the commonly used rectified linear unit(ReLU)activation function to speed up the convergence speed of the neural network.Result This paper conducted experiments on the microexpression combination dataset using the leaveone-subject-out(LOSO)cross-validation strategy.In this strategy,each subject serves as the test set,and all remaining samples are used for training.This validation method can fully utilize the samples and has a certain generalization ability.This method is the most commonly used validation in current microexpression recognition research.The results of this paper’s experiments on the unweighted average recall(UAR)and unweighted F1-score(UF1)reached 0.7351 and 0.7205,respectively.Compared with the Dual-Inception model,which performed best in the comparative methods,UAR and UF1 increased by 0.0607 and 0.0683,respectively.To verify further the effectiveness of the ATSCNN neural network architecture proposed in this paper,several ablation experiments were also conducted on the combined dataset,and the results confirmed the feasibility of this paper’s method.Conclusion The microexpression recognition network proposed in this paper can effectively alleviate overfitting,focus on important information of microexpressions,and achieve state-of-the-art(SOTA)recognition performance on small-scale microexpression datasets using LOSO cross-validation.Compared with other mainstream models,the proposed method achieved state-of-the-art recognition performance.In addition,the results of several ablation experiments made the proposed method more convincing.In conclusion,the proposed method remarkably improved the effectiveness of microexpression recognition.

作者赵明华董爽爽胡静都双丽石程李鹏石争浩 Zhao Minghua;Dong Shuangshuang;Hu Jing;Du Shuangli;Shi Cheng;Li Peng;Shi Zhenghao(School of Computer Science and Engineering,Xi’an University of Technology,Xi’an 710048,China;Shaanxi Key Laboratory of Network Computing and Security Technology,Xi’an 710048,China)

机构地区西安理工大学计算机科学与工程学院陕西省网络计算与安全技术重点实验室

出处《中国图象图形学报》 CSCD 北大核心 2024年第1期111-122,共12页 Journal of Image and Graphics

基金国家重点研发计划资助(2017YFB1402103-3) 国家自然科学基金项目(61901363,61901362) 陕西省自然科学基金项目(2020JQ-648,2019JM-381,2019JQ-729) 陕西省教育厅重点实验室基金项目(20JS086,20JS087)。

关键词微表情识别光流三流卷积神经网络卷积块注意力模块(CBAM) SELU激活函数 microexpression recognition optical flow three-stream convolution neural network convolutional block attention module(CBAM) SELU activation function

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1唐宏,朱龙娇,范森,刘红梅.基于光流法与伪三维残差网络的微表情识别[J].信号处理,2022,38(5):1075-1087. 被引量：3
2徐峰,张军平.人脸微表情识别综述[J].自动化学报,2017,43(3):333-348. 被引量：47
3张嘉淏,刘峰,齐佳音.一种基于Bottleneck Transformer的轻量级微表情识别架构[J].计算机科学,2022,49(S01):370-377. 被引量：3

二级参考文献6

1吴奇,申寻兵,傅小兰.微表情研究及其应用[J].心理科学进展,2010,18(9):1359-1368. 被引量：101
2刘帅师,田彦涛,万川.基于Gabor多方向特征融合与分块直方图的人脸表情识别方法[J].自动化学报,2011,37(12):1455-1463. 被引量：76
3刘帅师,田彦涛,王新竹.基于对称双线性模型的光照鲁棒性人脸表情识别[J].自动化学报,2012,38(12):1933-1940. 被引量：6
4贲晛烨,杨明强,张鹏,李娟.微表情自动识别综述[J].计算机辅助设计与图形学学报,2014,26(9):1385-1395. 被引量：45
5孙晓,潘汀,任福继.基于ROI-KNN卷积神经网络的面部表情识别[J].自动化学报,2016,42(6):883-891. 被引量：52
6马浩原,安高云,阮秋琦.平均光流方向直方图描述的微表情识别[J].信号处理,2018,34(3):279-288. 被引量：8

共引文献49

1刘洋,吴佩,万芷涵,石佳玉,朱立芳.用户微表情信息表征研究综述[J].知识管理论坛,2023(3):215-227. 被引量：2
2廖云峰,段文双,罗佳佳,赵文洁,吴旭.基于深度学习的人脸微表情识别[J].智能计算机与应用,2021,11(4):62-64. 被引量：3
3李霞,卢官明,闫静杰,张正言.多模态维度情感预测综述[J].自动化学报,2018,44(12):2142-2159. 被引量：26
4卢官明,杨成,杨文娟,闫静杰,李海波.基于LBP-TOP特征的微表情识别[J].南京邮电大学学报（自然科学版）,2017,37(6):1-7. 被引量：15
5薛耀锋,杨金朋,郭威,李卓玮.面向在线学习的多模态情感计算研究[J].中国电化教育,2018(2):46-50. 被引量：33
6涂亮,刘本永.微表情识别中面部动力谱特征提取的PCA改进[J].通信技术,2019,52(2):337-342. 被引量：1
7夏嘉欣,陈曦,林金星,李伟鹏,吴奇.基于带有噪声输入的稀疏高斯过程的人体姿态估计[J].自动化学报,2019,45(4):693-705. 被引量：6
8张延良,卢冰.基于信息增量特征选择的微表情识别方法[J].计算机工程,2019,45(5):261-266. 被引量：6
9刘缘,庾永波.在安检中加强“微表情”识别的思考——基于入藏公路安检的考察[J].四川警察学院学报,2019,31(1):61-68. 被引量：1
10姬秋敏,张灵,陈云华,麦应潮,向文,罗源.基于视觉机制与协同显著性的自发式表情识别[J].计算机工程与设计,2019,40(6):1741-1746. 被引量：3

同被引文献10

1康红蕾,翁赛峥.新冠肺炎流行期开展分级分类心理危机干预的思考[J].疾病预防控制通报,2020,35(5):81-84. 被引量：8
2林艳飞,龙媛,张航,刘志文,张政波.基于XGBoost的多种生理信号评估心理压力等级方法[J].北京理工大学学报,2022,42(8):871-880. 被引量：5
3武法提,赖松,高姝睿,李鲁越,任伟祎.联合面部线索与眼动特征的在线学习专注度识别[J].中国电化教育,2022(11):37-44. 被引量：8
4郝腾腾,郑欣,王慧宇,许开立,朱奕嬴.基于多模态信息融合的心理负荷评估[J].中国安全生产科学技术,2022,18(12):12-18. 被引量：1
5魏梓萱,刘安诺,尤梅,韩南南,丁原.基于决策树模型与Logistic回归模型分析安徽省护士心理资本的影响因素[J].护理研究,2023,37(16):2862-2870. 被引量：6
6胡晓俊,张鹏,甘国兵,吴斌,张烁.基于易感人格语言特征的抑郁风险分析与预测[J].中国健康心理学杂志,2023,31(9):1281-1287. 被引量：2
7黄树成,罗德广.基于光流和自编码器的微表情检测方法[J].计算机应用与软件,2023,40(9):171-176. 被引量：1
8刘成广,王善敏,刘青山.类别平衡调制的人脸表情识别[J].计算机科学与探索,2023,17(12):3029-3038. 被引量：1
9陆唯怡,张舒娴,朱静芬.上海部分企业职业人群健康素养与抑郁情绪的相关性研究[J].环境与职业医学,2023,40(10):1183-1189. 被引量：2
10李小保,于旭晨,吕厚超.平衡时间洞察力与心理健康的关系:作用机制与理论框架[J].心理科学进展,2024,32(1):138-150. 被引量：3

引证文献1

1白洁.基于生理信号的转换器模型多模态融合无扰式心理分类模型研究[J].自动化与仪器仪表,2024(8):14-18.

1袁博,薛珮芸,白静,师同同.基于双路特征和混合注意力的微表情识别[J].无线电工程,2024,54(3):615-622. 被引量：1
2诸葛铉烈.你在三流大学得过第一名吗[J].风流一代,2023(35):28-28.
3张茜.何殷震:一个现代女权思想的先声[J].南风窗,2024(3):52-55.
4王建平.大语言模型架构下的智能写作系统设计与实现[J].信息与电脑,2023,35(22):130-132. 被引量：1
5石光华.散说饮食的当季与当地[J].四川烹饪,2024(2):18-22.
6孟敏,史志英.基于大数据及物联网的数据库半结构化数据识别方法[J].信息与电脑,2023,35(22):193-195.
7周杰璐,林嘉希,朱锦舟.基于小样本学习算法构建结直肠息肉NICE分型的分类模型[J].现代消化及介入诊疗,2023,28(11):1372-1376.
8石闻达,杜劲松,李笛出乘.基于层次化多模态注意力机制循环神经网络的服装新品销售预测[J].Journal of Donghua University(English Edition),2024,41(1):21-27.
9杨静,刘炯.基于ARIMA-LSTM的企业财务长期变化趋势预测算法[J].湖北文理学院学报,2024,45(2):17-21. 被引量：1
10王万军.基于直觉模糊集对集结方法的隐私风险决策[J].吉林大学学报（信息科学版）,2024,42(1):111-123.

中国图象图形学报

2024年第1期

浏览历史

内容加载中请稍等...

注意力引导的三流卷积神经网络用于微表情识别被引量：1

参考文献3

二级参考文献6

共引文献49

同被引文献10

引证文献1

相关作者

相关机构

相关主题

浏览历史

注意力引导的三流卷积神经网络用于微表情识别 被引量：1

参考文献3

二级参考文献6

共引文献49

同被引文献10

引证文献1

相关作者

相关机构

相关主题

浏览历史

注意力引导的三流卷积神经网络用于微表情识别被引量：1