期刊文献+

基于改进AlexNet的人脸表情识别 被引量:24

Facial Expression Recognition Based on Improved AlexNet
原文传递
导出
摘要 人脸表情会受到姿势、物体遮挡、光照变化以及人种性别年龄等因素的影响,需要卷积神经网络更有效准确地学习特征。AlexNet在表情识别中准确率不高,对输入图像尺寸有限制,针对这些问题,提出了改进AlexNet网络的人脸表情识别算法。在AlexNet网络中引入多尺度卷积更加适用于小尺寸的表情图像,提取出不同尺度的特征信息,并在把多个低层次特征信息在向下传递的同时与高层次特征信息进行跨连接特征融合,从而可以更加完整准确地反映图像信息,构造出更准确的分类器。跨连接会产生参数爆炸,导致网络训练困难,影响识别效果,因此利用全局平均池化对低层次特征信息进行降维,可减少跨连接产生的参数和过拟合现象。本文算法在CK+、JAFFE数据库上的准确率分别为94.25%和93.02%。 Face expressions are affected by factors such as poses,object occlusion,lighting changes,race,gender,and age.Convolutional neural networks are required to learn features more effectively and accurately.AlexNet has low accuracy in expression recognition and strong input image size limitation.In response to these problems,this paper proposes an improved facial expression recognition algorithm for improved AlexNet networks.Introducing multi-scale convolution to the AlexNet network is more suitable for small-scale expression images,extracting feature information of different scales,and cross-connecting feature fusion with higher-level feature information can be realized while the multiple lower-level feature information is transfered downwards,which can reflect the image information more completely and accurately,and construct a more accurate classifier.Because cross-connections will generate parameter expansion,making network training difficult and affecting recognition results.Therefore,we use global average pooling to reduce the dimensionality of low-level feature information,reduce parameters generated by cross-connections,and reduce overfitting.The accuracy of our algorithm on CK+and JAFFE databases is 94.25%and 93.02%,respectively.
作者 杨旭 尚振宏 Yang Xu;Shang Zhenhong(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming,Yunnan 650500,China)
出处 《激光与光电子学进展》 CSCD 北大核心 2020年第14期235-242,共8页 Laser & Optoelectronics Progress
基金 国家自然科学基金(11873027,61462052)。
关键词 图像处理 图像分类 表情识别 AlexNet 特征提取 多尺度卷积 跨连接 全局平均池化 特征融合 image processing image classification expression recognition AlexNet feature extraction multi-scale convolution cross-connection global average pooling feature fusion
  • 相关文献

参考文献7

二级参考文献84

  • 1PANTIC M, ROTHKRANTZ L. Automatic analysis of facial expressions : The state of the art [ J ]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010,22 ( 12 ) : 1424 - 1445.
  • 2FASEL B, LUETYIN J. Automatic facial expression analysis : A survey [ J ]. Pattern Recognition, 2003, 36 ( 1 ) : 259 - 275.
  • 3COOTES T F, EDWARDS G J, TAYLOR C J. Active ap- pearance models [ J ]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2001,23 (6) :681 - 685.
  • 4GU Wenfei, XIANG Cheng, VENKATESH Y V, et al. Facial expression recognition using radial encoding of local Gabor features and classifier synthesis [ J ]. Pattern Recog- nition,2012,45 ( 1 ) :80 - 91.
  • 5SHAN C, GONG S, MCOWAN P W. Facial expression recognition based on local binary patterns: A comprehensive study [ J ]. Image and Vision Computing, 2009,27 ( 6 ) : 803 -816.
  • 6WANG X, JIN C, LIU W, et al. Feature fusion of hog and wld for facial expression recognition [ C ] // IEEE/SICE In- ternational Symposium on System Integration (SII). 2013 : 227 - 232.
  • 7BENGIO Y, COURVILLE A, VINCENT P. Representation leaming:A review and new perspectives [ J ]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013,35(8) :1798 - 1828.
  • 8ROWEIS S T, SAUL L K. Nonlinear dimensionality reduction by locally linear embedding [ J ]. Science, 2000,290 : 2323 - 2326.
  • 9BELKIN M, NIYOGI P. Laplacian eigenmaps for dimensionality reduction and data representation [ J ]. Neural Computation,2003,15 (6) : 1373 - 1396.
  • 10TENENBAUM J B, SILVE V D, LANGFORD J C. A global geometric framework for nonlinear dimensionality re- duction [ J ]. Science, 2000,290 : 2319 - 2323.

共引文献209

同被引文献125

引证文献24

二级引证文献116

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部