微表情峰值帧定位引导的分类算法

Apex frame spotting and recognition of micro-expression by optical flow

导出

摘要目的微表情是人在外界信息和刺激下做出的无意识面部动作,是判断受试人情绪和行为的重要佐证,在社会安全、商业谈判和心理辅导等领域都有着广泛的应用。微表情不同于一般的表情,分类与定位较为困难。针对这种情况,提出了一种基于光流窗口的双分支微表情定位网络(dual-branch optical flow spotting network,DFSN)和一种利用峰值帧光流信息的微表情分类网络,以识别视频中的微表情。方法在定位任务中,首先提取面部图像,选择光流窗口大小和位置,计算面部光流并进行预处理;接下来输入双分支网络中进行两次分类,分别针对有无微表情和在有微表情前提下微表情所处阶段分类,并结合两个损失函数抑制过拟合;最后绘制出微表情强度曲线,曲线峰值所处位置即为所求微表情峰值帧。在分类任务中,选取视频起始帧和定位网络取得的峰值帧作为光流窗口,并利用欧拉运动放大算法(Eulerian motion magnification,EMM)放大微表情,最后采用峰值帧光流信息分类微表情视频。结果微表情定位网络分别在CASME Ⅱ(Chinese Academy of Sciences Micro-expression Database Ⅱ)数据集和CASME数据集上按照使用留一被试交叉验证法进行了实验,与目前最好的定位方法比较,此网络在CASME Ⅱ上获得了最低的NMAE(normalized mean absolute error)值0.101 7,比Optical flow+UPC方法提高了9%。在CASME上获得的NMAE值为0.137 8,在此数据集上为次优定位方法。在定位网络得到的峰值基础上,分类网络在CASME Ⅱ上取得了89.79%的准确率,在CASME上取得了66.06%的准确率。若采用数据集标注的峰值,分类网络在CASME Ⅱ上取得了91.83%的准确率,在CASME上取得了76.96%的准确率。结论提出的微表情定位网络可以有效定位视频中微表情峰值帧的位置,帮助后续网络进行分类,微表情分类网络可以有效区分不同种类的微表情视频。 Objective Micro-expressions are unconscious facial actions made by people under external information and stimulation.These expressions are crucial proofs to judge people’s emotions and thoughts.Micro-expressions are widely used in the fields of social security,business negotiation,and psychological counseling.This type of expression is different from the general macro-expression and demonstrates characteristics of short duration, low expression intensity, and fastchange speed. Therefore, compared with macro-expressions, micro-expressions are more difficult to recognize and locate.Before the emergence of deep learning, researchers mostly used the traditional hand-crafted method, which utilizes the arti⁃ficially designed micro-expression extractors and complex parameter adjustment processes and algorithms to extract fea⁃tures. Some excellent algorithms can achieve competitive results, such as local binary pattern-three orthogonal plane andmain directional mean optical flow (MDMO). However, these algorithms mostly only extract shallow features, and improv⁃ing their accuracy is difficult. With the development of machine learning in the field of computer vision, the researchmethod of micro-expression based on deep learning has immediately become the mainstream. This method generally usesconvolutional neural network to extract and classify the image or video features. The accuracy of micro-expression identifi⁃cation is markedly improved due to its powerful feature extraction and learning capability. However, the spotting and classi⁃fication of micro-expressions are still difficult tasks due to the subtle characteristics of micro-expressions and the difficultyof extracting effective features. Therefore, this paper proposes a dual-branch optical flow spotting network based on opticalflow window, which can promote the solution of these problems. Method First, the size of the optical flow window isselected in accordance with the number of video frames, and three frames at both ends of the window are taken to stabilizethe optical flow intensity. Dlib library is used to detect faces, and Farneback method is used to extract facial optical flowfeatures and preprocess the optical flow image. The image size is finally converted into 224 × 224 pixels. The dual-branchnetwork is then inputted for two classifications to address the presence or absence of micro-expression and the rising or fall⁃ing state of micro-expression. The twice classification should be judged in accordance with the same characteristics. There⁃fore, the same network backbone is used, and then the branches are utilized to process the characteristics, thereby focus⁃ing on different directions. Combining two loss functions can suppress the overfitting of the network, complete classifica⁃tion, and improve the network performance. Finally, the micro-expression state in the video window is obtained by slidingthe window, and the intensity curve is drawn. Multiple windows are selected for positioning due to the different durations ofmicro-expression, and the highest point among them is taken as the apex frame. The classification network is different fromthe location network in two aspects. First, the front end of the window is the second to the fourth frame of the video and theback end uses the micro-expression part of the video. Second, Euler motion magnification is used to process video. Thismethod can amplify facial motion and improve expression intensity but will destroy some optical flow features;thus, themethod is not used in the positioning network. When classifying videos, the apex frame of the positioning network is takenas the center, and the five surrounding positions are selected as the input of the classification network. The classificationnetwork uses the uncomplicated network structure and obtains good results, proving the importance of apex frame spotting.Result The micro-expression spotting network is based on leave-one-subject-out cross-validation method on the ChineseAcademy of Sciences Micro-expression Database II (CASME II) and the Chinese Academy of Sciences Micro-expressionDatabase (CASME), which is the most commonly used validation method in the current micro-expression identificationresearch. Compared with the current best spotting method, the lowest normalized mean absolute error (NMAE) value of0. 101 7 is obtained on the CASME II, which is 9% lower than the current best spotting method. The NMAE value obtainedon the CASME is 0. 137 8, which is currently the second lowest number. Using this micro-expression spotting network, theclassification network achieved 89. 79% accuracy of three categories (positive, negative, and surprise) in the microexpression classification experiment of CASME II and 66. 06% accuracy of four categories (disgust, tense, repression, andsurprise) in the micro-expression classification experiment of CASME. Using the apex frame in dataset, the classificationnetwork achieved 91. 83% and 76. 96% accuracy on CASME II and CASME, respectively. Conclusion The proposedmicro-expression spotting network can effectively locate the position of the apex frame in the video and then extract its effec⁃tive micro-expression information. Extensive experimental evaluation proved that the spotting network has good spottingeffect. The subsequent classification network shows that the extraction of effective micro-expression information such as anapex frame can significantly help the network in classifying micro-expressions. Overall, the proposed micro-expressionspotting network can substantially improve the accuracy of micro-expression recognition.

作者李博凯吴从中项柏杨臧怀娟任永生詹曙 Li Bokai;Wu Congzhong;Xiang Baiyang;Zang Huaijuan;Ren Yongsheng;Zhan Shu(Institute of Artificial Intelligence,Hefei Comprehensive National Science Center,Hefei 230601,China;School of Computer and Information,Hefei University of Technology,Hefei 230601,China;School of Metallurgy and Energy Engineering,Kunming University of Science and Technology,Kunming 650093,China)

机构地区合肥综合性国家科学中心人工智能研究院合肥工业大学计算机与信息学院昆明理工大学冶金与能源工程学院

出处《中国图象图形学报》 CSCD 北大核心 2024年第5期1447-1459,共13页 Journal of Image and Graphics

基金国家自然科学基金项目(52104303) 安徽省教育厅安徽高校协同创新项目(GXXT-2022-041)。

关键词微表情定位情感计算峰值帧微表情分类图像识别深度学习 micro-expression spotting affective computing apex frame micro-expression classification image recognition deep learning

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献5

1牛瑞华,杨俊,邢斓馨,吴仁彪.基于卷积注意力模块和双通道网络的微表情识别算法[J].计算机应用,2021,41(9):2552-2559. 被引量：12
2刘德志,梁正友,孙宇.结合空间注意力机制与光流特征的微表情识别方法[J].计算机辅助设计与图形学学报,2021,33(10):1541-1552. 被引量：14
3闵睿朋,李一凡,黄瑶,杨剑宇,钟宝江.形状的全尺度可视化表示与识别[J].中国图象图形学报,2022,27(2):628-641. 被引量：2
4佘文祥,刘斌,陶建华,张昊,吕钊.多通道运动特征融合的微表情识别方法[J].计算机辅助设计与图形学学报,2021,33(9):1457-1465. 被引量：2
5阳治民,宋威.选择并融合粗细粒度特征的细粒度图像识别[J].中国图象图形学报,2023,28(7):2081-2092. 被引量：1

二级参考文献12

1周瑜,刘俊涛,白翔.形状匹配方法研究与展望[J].自动化学报,2012,38(6):889-910. 被引量：85
2贲晛烨,杨明强,张鹏,李娟.微表情自动识别综述[J].计算机辅助设计与图形学学报,2014,26(9):1385-1395. 被引量：45
3徐浩然,杨剑宇,黄伟国,尚丽.形状的不变量特征提取与识别[J].中国图象图形学报,2017,22(8):1068-1078. 被引量：4
4毕威,黄伟国,张永萍,高冠琪,朱忠奎.基于图像显著轮廓的目标检测[J].电子学报,2017,45(8):1902-1910. 被引量：15
5刘望舒,郑丹晨,韩敏.一种基于改进地貌形状上下文的形状匹配方法[J].自动化学报,2017,43(10):1749-1758. 被引量：1
6贾棋,于美玉,樊鑫,高新凯,郭禾.基于曲率分级的形状编码及识别方法[J].计算机学报,2018,41(11):2453-2466. 被引量：2
7张延良,卢冰,洪晓鹏,赵国英,张伟涛.基于局部区域方法的微表情识别[J].计算机应用,2019,39(5):1282-1287. 被引量：9
8刘汝涵,徐丹.视频放大和深度学习在微表情识别任务上的应用[J].计算机辅助设计与图形学学报,2019,31(9):1535-1541. 被引量：12
9张冬明,靳国庆,代锋,袁庆升,包秀国,张勇东.基于深度融合的显著性目标检测算法[J].计算机学报,2019,42(9):2076-2086. 被引量：34
10吴仁彪,赵娅倩,屈景怡,高爱国,陈文秀.基于CBAM-CondenseNet的航班延误波及预测模型[J].电子与信息学报,2021,43(1):187-195. 被引量：25

共引文献25

1刘洋,吴佩,万芷涵,石佳玉,朱立芳.用户微表情信息表征研究综述[J].知识管理论坛,2023(3):215-227. 被引量：2
2刘汝卿,李锋,蒋衍,朱精果.基于FPGA的运动目标实时检测系统设计[J].计算机测量与控制,2022,30(4):56-59. 被引量：6
3周伟航,肖正清,钱育蓉,马玉民,公维军,帕力旦·吐尔逊.微表情自动分析方法研究综述[J].计算机应用研究,2022,39(7):1921-1932. 被引量：4
4陈东升.构建潜意识互动的会计在线交流评估方法研究[J].中国新技术新产品,2022(13):146-148.
5Tongping Shen,Huanqing Xu.Facial Expression Recognition Based on Multi-Channel Attention Residual Network[J].Computer Modeling in Engineering & Sciences,2023(4):539-560. 被引量：1
6黄豪豪,李铭田,张富春.优化算法在人脸表情识别中的应用研究[J].延安大学学报（自然科学版）,2022,41(3):56-60.
7朱文球,李永胜,黄史记,阳昊彤.基于ACNN和Bi-LSTM的微表情识别[J].湖南工业大学学报,2022,36(6):34-41.
8于明,钟元想,王岩.人脸微表情分析方法综述[J].计算机工程,2023,49(2):1-14. 被引量：7
9陈思伟,戴丹,郑剑,郑辛煜,康浩愉,莫佳莉,顾晓波.基于改进的ResNet152V2模型对临安山核桃果仁等级分类研究[J].中国粮油学报,2023,38(1):90-100. 被引量：1
10丁东平,李海涛.基于DP-DBNet和MHA-CRNN的船牌号检测与识别[J].计算机系统应用,2023,32(3):209-216. 被引量：1

1刘雨萌,桑海峰.基于关键帧定位的人体异常行为识别[J].电子测量与仪器学报,2024,38(3):104-111.
2刘达,赵暾,张占月.高超声速飞行器三通道耦合制导律与鲁棒控制律设计[J].战术导弹技术,2023(5):97-103.
3陈果,刘科生.光照变化下的非接触式血氧饱和度检测方法研究[J].医疗卫生装备,2024,45(4):32-38.
4李俊,曹林,张帆,杜康宁,郭亚男.分布统计特征的孪生网络目标跟踪方法[J].计算机工程与应用,2024,60(8):213-224.
5清风.走出失恋的“心痛”[J].心理与健康,2024(5):74-75.
6汤萍.高中政治活动型课程的分类与定位[J].中文科技期刊数据库（全文版）教育科学,2019(3):221-221.
7Information for authors[J].Science in China(Series F),2007,50(3).
8田梦泽.敦煌艺术的影视化创作研究[J].戏剧之家,2024(10):175-177.
9王超君.中华优秀传统文化在中职商贸类教学中的运用[J].文化创新比较研究,2024,8(13):145-149.
10Information for authors[J].Acta Pharmacologica Sinica,2016,37(1).

中国图象图形学报

2024年第5期

浏览历史

内容加载中请稍等...

微表情峰值帧定位引导的分类算法

参考文献5

二级参考文献12

共引文献25

相关作者

相关机构

相关主题

浏览历史