构建并行卷积神经网络的表情识别算法被引量：47

Expression recognition algorithm for parallel convolutional neural networks

导出

摘要目的表情识别在商业、安全、医学等领域有着广泛的应用前景,能够快速准确地识别出面部表情对其研究与应用具有重要意义。传统的机器学习方法需要手工提取特征且准确率难以保证。近年来,卷积神经网络因其良好的自学习和泛化能力得到广泛应用,但还存在表情特征提取困难、网络训练时间过长等问题,针对以上问题,提出一种基于并行卷积神经网络的表情识别方法。方法首先对面部表情图像进行人脸定位、灰度统一以及角度调整等预处理,去除了复杂的背景、光照、角度等影响,得到了精确的人脸部分。然后针对表情图像设计一个具有两个并行卷积池化单元的卷积神经网络,可以提取细微的表情部分。该并行结构具有3个不同的通道,分别提取不同的图像特征并进行融合,最后送入Soft Max层进行分类。结果实验使用提出的并行卷积神经网络在CK+、FER2013两个表情数据集上进行了10倍交叉验证,最终的结果取10次验证的平均值,在CK+及FER2013上取得了94. 03%与65. 6%的准确率。迭代一次的时间分别为0. 185 s和0. 101 s。结论为卷积神经网络的设计提供了一种新思路,可以在控制深度的同时扩展广度,提取更多的表情特征。实验结果表明,针对数量、分辨率、大小等差异较大的表情数据集,该网络模型均能够获得较高的识别率并缩短训练时间。 Objective Face emotion recognition is widely applied in the fields of commercial,security,and medicine.Rap-id and accurate identification of facial expressions are of great significance for their research and application.Several tradi-tional machine learning methods,such as support vector machine( SVM),principal component analysis( PCA),and localbinary pattern( LBP)are used to identify facial expressions.However,these traditional machine learning algorithms re-quire manual feature extraction.In this process,some features are hidden or deliberately enlarged due to many human in-terventions,which affect accuracy.In recent years,convolutional neural networks( CNNs)have been used extensively inimage recognition due to their good self-learning and generalization capabilities.However,several problems,such as diffi-culty in facial expression feature extraction and long training time of neural network,are still observed with neural networktraining.This study presents an expression recognition method based on parallel CNN to solve the aforementioned problems.Method First,a series of preprocessing operations is performed on facial expression images.For example,an originalimage is detected by using an Ada Boost cascade classifier to remove the complex background and obtain the face part.Then,a face image is compensated by illumination,a histogram equalization method is used to stretch the image nonlinear-ly,and the pixel value of the image is reallocated.Finally,affine transformation is used to achieve face alignment.Thepreceding preprocessing can remove complex background effects,compensate lighting,and adjust the angle to obtain moreaccurate face parts than that of the original image.Then,a CNN with two parallel convolution and pooling structures,whichcan extract subtle expressions,is designed for facial expression images.This parallel unit is the core unit of the CNN andcomprises a convolutional layer,a pooling layer,and an activation function Re Lu.This parallel structure has three differentchannels,in which each channel has different number of convolutions,pooling layers,and Re Lu to extract different imagefeatures and fuse the extracted features.The second parallel processing unit can perform convolution and pooling on theextracted features by the first parallel processing unit and reduce the dimension of the image and shorten the training time ofCNN.Finally,the previously merged features are sent to the Soft Max layer for expression classification.Result CK+ andFER2013 expression datasets that have undergone pre-processing and data enhancement are divided into 10 equal parts.Then,training and testing are performed on 10 parts,and the final accuracy is the average of the 10 results.Experimentalresults show that the accuracy increases and time decreases remarkably compared with traditional machine learning meth-ods,such as SVM,PCA,and LBP or their combination and other classical CNNs,such as Alex Net and Goog Le Net.Final-ly,CK + and FER2013 achieve 94.03% and 65.6% accuracy,and the iteration time reaches 0.185 s and 0.101 s,respectively.Conclusion This study presents a new parallel CNN structure that extracts the features of facial expressions byusing three different convolutional and pooling structures.The three paths have different combinations of convolutional andpooling layers,and they can extract different image features.The different extracted features are combined and sent to thenext layer for processing.This study provides a new concept for the design of CNNs,which can extend the breadth of CNNand control the depth.The proposed CNN can extract many expressions that are ignored or difficult to extract.CK+ andFER2013 expression datasets have large difference in quantity,size,and resolution.The experiments of CK + andFER2013 show that the model can extract the precise and subtle features of facial expression images in a relatively shorttime under the premise of ensuring the recognition rate.

作者徐琳琳张树美赵俊莉 Xu Linlin;Zhang Shumei;Zhao Junli(College of Data Science and Software Engineering,Qingdao University,Qingdao 266071,China)

机构地区青岛大学数据科学与软件工程学院

出处《中国图象图形学报》 CSCD 北大核心 2019年第2期227-236,共10页 Journal of Image and Graphics

基金国家自然科学基金项目(41501698) 国家自然科学基金青年科学基金项目(61702293) 虚拟现实应用教育部工程研究中心开放基金项目(MEOBNUEVRA201601)~~

关键词表情识别深度学习卷积神经网络并行处理图像分类 expression recognition deeplearning convolutional neural network (CNN) parallel processing image classification

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1杨格兰,邓晓军,刘琮.基于深度时空域卷积神经网络的表情识别模型[J].中南大学学报（自然科学版）,2016,47(7):2311-2319. 被引量：13
2傅启明,刘全,王辉,肖飞,于俊,李娇.一种基于线性函数逼近的离策略Q(λ)算法[J].计算机学报,2014,37(3):677-686. 被引量：25
3何俊,蔡建峰,房灵芝,何忠文.基于LBP/VAR与DBN模型的人脸表情识别[J].计算机应用研究,2016,33(8):2509-2513. 被引量：21

二级参考文献42

1宋伟,赵清杰,宋红,樊茜.基于关键块空间分布与Gabor滤波的人脸表情识别算法[J].中南大学学报（自然科学版）,2013,44(S2):239-243. 被引量：7
2Ekman P, Friesen W V. Constants across culture in the face and emotion [J]. Journal of Personality Social Psychol, 1971, 17 (2) : 124-129.
3Mehrabian A. Communication without words [J ]. PSyChology To- day,1968, 2(4) : 53-56.
4Zeng Zhihong, Roisman G I, Huang T S. A survey of affect recogni- tion methods: audio, visual and spontaneous expression [J]. IEEE Yrans on Pattern Analysis and Machine Intelligence, 2009, 31 (1): 39-58.
5Cootes T F, Edwards G J, Taylor C J. Active appearance models [ J ]. IEEE Yrans on Patter Analysis and Machine Intelligence,2001, 23(6) : 681-685.
6Ojala T, Pietikanem M, Maenpaa T. Muhiresolution gray-scale and rotation invariant texture classification with local binary patterns [ J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 2002, 24(7) : 971-987.
7Hinton G E, Osindero S, Teh Y W. A fast learning algorithm for deep belief net [ J]. Neural Computation,2006, 18 (7) : 1527- 1554.
8Yu Dang, Deng Li. Deep learning and its applications to signal and information processing [J]. IEEE Signal Processing Magazine, 2011,28(1) : 145-154.
9Sarikaya R, Hinton G E, Deoras A. Application of deep belief net- works for natural language understanding [ J]. I EEE/ACM Trans on Audio, Speech, and Language Processing, 2014, 22 (4) : 778- 784.
10Jones N, Zhu Jun. The learning machines [J]. Natura,2014, 505 (7482) :146-148.

共引文献56

1马春华,邵俊倩,秦兵.听障教学中手语识别技术的研究进展[J].绥化学院学报,2022,42(10):23-27. 被引量：2
2梁蒙蒙,周涛,张飞飞,杨健,夏勇.卷积神经网络及其在医学图像分析中的应用研究[J].生物医学工程学杂志,2018,35(6):977-985. 被引量：16
3戈军,周莲英.面向交通信号的两层递阶控制解决方案[J].计算机工程与应用,2015,51(20):246-252. 被引量：1
4钟珊,刘全,傅启明,章宗长,朱斐,龚声蓉.一种近似模型表示的启发式Dyna优化算法[J].计算机研究与发展,2015,52(12):2764-2775. 被引量：4
5谢文达.云计算环境下人脸表情智能识别改进技术研究[J].计算机测量与控制,2017,25(5):162-164. 被引量：1
6王准,何元烈.基于混合价值计算的云存储缓存替换方案[J].计算机工程与设计,2017,38(6):1651-1656. 被引量：4
7刘全,翟建伟,钟珊,章宗长,周倩,章鹏.一种基于视觉注意力机制的深度循环Q网络模型[J].计算机学报,2017,40(6):1353-1366. 被引量：18
8马技,李晶皎,李珍妮.基于视觉注意机制深度强化学习的行人检测方法[J].中国科技论文,2017,12(14):1570-1577. 被引量：10
9孔英会,陈咨彤,车辚辚.基于关键子区域及特征提取的表情识别[J].科学技术与工程,2017,17(34):257-262. 被引量：2
10刘全,翟建伟,章宗长,钟珊,周倩,章鹏,徐进.深度强化学习综述[J].计算机学报,2018,41(1):1-27. 被引量：431

同被引文献248

1姚建华,吴加敏,杨勇,施祖贤.全卷积神经网络下的多光谱遥感影像分割[J].中国图象图形学报,2020,0(1):180-192. 被引量：15
2鲍光海,林善银,徐林森.基于改进型卷积网络的汽车高度调节器缺陷检测方法[J].仪器仪表学报,2020,41(2):157-165. 被引量：11
3丁名都,李琳.基于CNN和HOG双路特征融合的人脸表情识别[J].信息与控制,2020,49(1):47-54. 被引量：16
4奚琰.基于对比学习的细粒度遮挡人脸表情识别[J].计算机系统应用,2022,31(11):175-183. 被引量：3
5曹芬芳,张晋朝,王娟,刘坤锋,杨海娟.学术搜索引擎用户适应性学术信息搜寻行为影响因素研究[J].国家图书馆学刊,2019,0(6):82-89. 被引量：6
6孟昭兰.为什么面部表情可以作为情绪研究的客观指标[J].心理学报,1987,19(2):124-134. 被引量：23
7马希荣,刘琳,桑婧.基于情感计算的e-Learning系统建模[J].计算机科学,2005,32(8):131-133. 被引量：13
8刘晓旻,谭华春,章毓晋.人脸表情识别研究的新进展[J].中国图象图形学报,2006,11(10):1359-1368. 被引量：61
9邓洪波,金连文.一种基于局部Gabor滤波器组及PCA+LDA的人脸表情识别方法[J].中国图象图形学报,2007,12(2):322-329. 被引量：36
10张红英,彭启琮.数字图像修复技术综述[J].中国图象图形学报,2007,12(1):1-10. 被引量：157

引证文献47

1丁名都,李琳.基于CNN和HOG双路特征融合的人脸表情识别[J].信息与控制,2020,49(1):47-54. 被引量：16
2闫美阳,李原.多源域混淆的双流深度迁移学习[J].中国图象图形学报,2019,24(12):2243-2254. 被引量：1
3王建霞,陈慧萍,李佳泽,张晓明.基于多特征融合卷积神经网络的人脸表情识别[J].河北科技大学学报,2019,40(6):540-547. 被引量：13
4林克正,白婧轩,李昊天,李骜.深度学习下融合不同模型的小样本表情识别[J].计算机科学与探索,2020,14(3):482-492. 被引量：14
5刘全明,辛阳阳.端到端的低质人脸图像表情识别[J].小型微型计算机系统,2020,41(3):668-672. 被引量：15
6刘尚旺,刘承伟,张爱丽.基于深度可分卷积神经网络的实时人脸表情和性别分类[J].计算机应用,2020,40(4):990-995. 被引量：7
7姚梦竹,黄官伟.基于卷积神经网络的人脸表情识别[J].电脑知识与技术,2020,16(16):19-23. 被引量：1
8黄俊,张娜娜,章惠.融合头部姿态和面部表情的互动式活体检测[J].计算机应用,2020,40(7):2089-2095. 被引量：1
9周涛,吕晓琪,任国印,谷宇,张明,李菁.基于集成卷积神经网络的面部表情分类[J].激光与光电子学进展,2020,57(14):316-327. 被引量：7
10张翔,史志才,陈良.基于SWA优化级联网络的表情识别方法[J].电子科技,2020,33(9):16-20. 被引量：3

二级引证文献166

1傅博,王洪光,宋屹峰.融合全局和局部特征的单幅图像去雨方法[J].信息与控制,2023,52(4):531-541.
2程龙欢,李舜酩.多源振动信号融合方法综述[J].计算机应用研究,2020,37(S02):12-14. 被引量：1
3吴青云,邹亚囡,史雪莹.基于卷积神经网络的电子鼻分类识别[J].吉林化工学院学报,2022,39(11):38-41. 被引量：1
4王珂,赵慧,张成,魏子涵.基于改进的YOLOv5人脸口罩识别算法[J].信息化研究,2022,48(6):38-45.
5陈雪,周子腾.基于生成对抗网络的非重复性CT几何伪影去除算法可行性研究[J].信息化研究,2022,48(6):33-37.
6陈帅,李焕锋,沙杰,崔巍,刘梦园.基于YOLOv5的砂纸表面缺陷检测方法研究[J].电子测量技术,2023,46(14):73-79. 被引量：1
7张华清,黄少华.基于眼动追踪的传统纹样提取过程研究[J].包装工程,2023,44(S01):209-216. 被引量：1
8杜英魁,刘鑫,王馨鹤,刘洪安,李若溪,原忠虎.面向青光眼患者家庭应用的可穿戴眼压监测终端[J].传感器与微系统,2020,39(6):154-157. 被引量：2
9张四平,王梅,邓华侔,胡念.远程医疗监护报警系统中的人脸表情识别算法研究[J].信息与电脑,2020,32(14):68-70. 被引量：3
10潘哲琦,付晓峰,陈旭坤.基于表情识别的情绪影集剪辑系统[J].电子技术与软件工程,2020(14):158-160. 被引量：1

1杜云,张璐璐,潘涛.基于卷积神经网络的矿工面部表情识别方法[J].工矿自动化,2018,44(5):95-100. 被引量：1
2黄丽雯,杨欢欢,王勃.非对称方向性局部二值模式人脸表情识别[J].计算机工程与应用,2018,54(23):183-188. 被引量：3
3李昊轩.基于深度学习的医疗图像分割[J].电子制作,2019,27(4):53-55. 被引量：4
4郭文强,高文强,肖秦琨,徐成,李梦然.基于小数据集下贝叶斯网络建模的面部表情识别[J].科学技术与工程,2018,18(35):179-183. 被引量：2
5孙登第,孟欠欠,马云鹏.图正则化迁移稀疏概念编码的跨域图像分类[J].计算机工程与应用,2019,55(6):197-203.
6苏岑,金瑜成,孙凯悦,戚国亮,黄佳杰,朱浩威,张石清.基于Gabor小波和主成分分析的人脸表情识别[J].台州学院学报,2018,40(6):12-17. 被引量：3
7吴欣怡.新媒体应用技术在教学中的应用[J].电脑迷,2018(3):44-44. 被引量：1
8杜云,张璐璐,潘涛.基于改进的主成分分析法的矿工表情识别[J].河北科技大学学报,2019,40(1):45-50. 被引量：5
9廖勇,过李峤.光伏发电最大功率点跟踪系统设计及潮流计算[J].无线互联科技,2019,16(1):47-49.
10微信安全团队.2018年11月朋友圈十大谣言[J].中国信息安全,2018(12):16-17.

中国图象图形学报

2019年第2期

浏览历史

内容加载中请稍等...

构建并行卷积神经网络的表情识别算法被引量：47

参考文献3

二级参考文献42

共引文献56

同被引文献248

引证文献47

二级引证文献166

相关作者

相关机构

相关主题

浏览历史

构建并行卷积神经网络的表情识别算法 被引量：47

参考文献3

二级参考文献42

共引文献56

同被引文献248

引证文献47

二级引证文献166

相关作者

相关机构

相关主题

浏览历史

构建并行卷积神经网络的表情识别算法被引量：47