基于VGG和ResNet的小样本人脸识别模型

Small Sample Face Recognition Model Based on VGG and ResNet

下载PDF

导出

摘要随着科学技术的发展和互联网信息的传播,人们有了更加广泛的学习与娱乐平台,可以通过互联网了解到世界各地的信息。人们的娱乐方式和以前相比也有了巨大提升。观众在观看国外比赛时,由于不了解比赛人员信息,很多时候会出现“脸盲”的情况。因此本文致力于通过分析国外明星的人脸信息,帮助人们更快了解想要知道的明星情况。人脸识别技术在人们生活中许多方面都发挥着很大的作用。用来研究这项技术的方法也有很多。其中卷积神经网络(Convolutional Neural Networks, CNN)是当前推动机器学习的一项重要技术,该项技术在图像分类中也表现出很好的效果。当数据集是小样本时,使用预训练模型进行预测分析有比较好的效果,是一种高效的识别方法。本文主要通过CNN和已有的预训练模型帮助进行人脸识别。通过几何变换、图像模糊化、调整亮度和对比度等方法,进行随机搭配组合实现数据增强,达到扩充数据集的效果。数据增强消除了样本数据的尺度、位置和视角差异等因素,满足模型的平移不变性和尺度不变性,增强了训练模型的鲁棒性,提高了训练模型的识别准确率。此外在输入层加入Batch Normalization操作,使每层神经网络输入是相同分布,加快训练速度,提高学习率,并且使用自适应ReLU和RMSProp算法来提高收敛速度、降低错误率。最终该网络模型达到了76.6%的准确率。本文选择VGG16、VGG19和ResNet50预训练模型对样本数据进行拟合。通过大量试参数调整分析得到VGG19的模型效果最好,达到了89.4%的准确率。 With the development of science and technology and the spread of information on the Internet, people have a wider platform for learning and entertainment, and can learn about information from all over the world through the Internet. People’s entertainment methods have also improved tremendously compared to the past. When watching foreign games, viewers are often “face-blind”, because they do not know the information of the players. Therefore, this paper is dedicated to analyzing the face information of foreign celebrities to help people know more about the celebrities they want to know more quickly. Face recognition technology plays a big role in many aspects of people’s lives. There are also many methods used to study this technology. One of them is Convolutional Neural Networks (CNN), an important technology that is currently driving machine learning, which has also shown good results in image classification. When the data set is a small sample, using pretrained models for predictive analysis has better results and is an efficient recognition method. This paper focuses on face recognition with the help of CNN and existing pretrained models. Data augmentation is achieved by random pairwise combinations of geometric transformation, image blurring, and adjustment of brightness and contrast to expand the data set. The data enhancement eliminates the scale, position and view-point differences of sample data, satisfies the translation invariance and scale invariance of the model, enhances the robustness of the training model, and improves the recognition accuracy of the training model. In addition, the Batch Normalization operation is added to the input layer so that the input of each neural network layer is identically distributed to speed up the training and improve the learning rate, and the adaptive ReLU and RMSProp algorithms are used to improve the convergence speed and reduce the error rate. The final network model achieves an accuracy of 76.6%. In this paper, VGG16, VGG19 and ResNet50 pretraining models are selected to fit the sample data. The best model of VGG19 was obtained through a large number of trial parameters adjustment analysis, and achieved an accuracy of 89.4%.

作者黄嘉悦

机构地区贵州大学数学与统计学院

出处《运筹与模糊学》 2023年第3期2474-2486,共13页 Operations Research and Fuzziology

关键词卷积神经网络人脸识别图像分类预训练模型

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

参考文献7

1庄福振,罗平,何清,史忠植.迁移学习研究进展[J].软件学报,2015,26(1):26-39. 被引量：462
2王文成,蒋慧,乔倩,祝捍皓,郑红.基于ResNet50网络的十种鱼类图像分类识别研究[J].农村经济与科技,2019,30(19):60-62. 被引量：30
3田佳鹭,邓立国.基于改进VGG16的猴子图像分类方法[J].信息技术与网络安全,2020,39(5):6-11. 被引量：10
4马俊,张荣福,郭天茹,张喆嫣,李卿,王蓉,李子莹.基于迁移学习的VGG-16网络芯片图像分类[J].光学仪器,2020,42(3):21-27. 被引量：9
5梁雁,刘广峰.基于卷积神经网络的人脸识别研究[J].数字通信世界,2021(1):101-102. 被引量：5
6左羽,陶倩,吴恋,王永金.基于卷积神经网络的植物图像分类方法研究[J].物联网技术,2020,10(3):72-75. 被引量：15
7赵浩.基于TensorFlow的卷积神经网络图像分类实践策略研究[J].价值工程,2020,39(9):205-207. 被引量：4

二级参考文献111

1Ben-David S,Blitzer J,Crammer K,Pereira F.Analysis of representations for domain adaptation.In:Platt JC,Koller D,Singer Y,Roweis ST,eds.Proc.of the Advances in Neural Information Processing Systems 19.Cambridge:MIT Press,2007.137-144.
2Blitzer J,McDonald R,Pereira F.Domain adaptation with structural correspondence learning.In:Jurafsky D,Gaussier E,eds.Proc.of the Int’l Conf.on Empirical Methods in Natural Language Processing.Stroudsburg PA:ACL,2006.120-128.
3Dai WY,Xue GR,Yang Q,Yu Y.Co-Clustering based classification for out-of-domain documents.In:Proc.of the 13th ACM Int’l Conf.on Knowledge Discovery and Data Mining.New York:ACM Press,2007.210-219.[doi:10.1145/1281192.1281218].
4Dai WY,Xue GR,Yang Q,Yu Y.Transferring naive Bayes classifiers for text classification.In:Proc.of the 22nd Conf.on Artificial Intelligence.AAAI Press,2007.540-545.
5Liao XJ,Xue Y,Carin L.Logistic regression with an auxiliary data source.In:Proc.of the 22nd lnt*I Conf.on Machine Learning.San Francisco:Morgan Kaufmann Publishers,2005.505-512.[doi:10.1145/1102351.1102415].
6Xing DK,Dai WY,Xue GR,Yu Y.Bridged refinement for transfer learning.In:Proc.of the Ilth European Conf.on Practice of Knowledge Discovery in Databases.Berlin:Springer-Verlag,2007.324-335.[doi:10.1007/978-3-540-74976-9_31].
7Mahmud MMH.On universal transfer learning.In:Proc.of the 18th Int’l Conf.on Algorithmic Learning Theory.Sendai,2007.135-149.[doi:10,1007/978-3-540-75225-7_14].
8Samarth S,Sylvian R.Cross domain knowledge transfer using structured representations.In:Proc.of the 21st Conf.on Artificial Intelligence.AAAI Press,2006.506-511.
9Bel N,Koster CHA,Villegas M.Cross-Lingual text categorization.In:Proc.of the European Conf.on Digital Libraries.Berlin:Springer-Verlag,2003.126-139.[doi:10.1007/978-3-540-45175-4_13].
10Zhai CX,Velivelli A,Yu B.A cross-collection mixture model for comparative text mining.In:Proc.of the 10th ACM SIGKDD Int’l Conf.on Knowledge Discovery and Data Mining.New York:ACM,2004.743-748.[doi:10.1145/1014052.1014150].

共引文献525

1康文杰,田苗,林岚,孙珅,吴水才.深度卷积生成对抗网络对神经影像通用数据特征的学习[J].智慧健康,2020(31):1-4. 被引量：2
2张政,严哲,顾汉明.基于残差网络与迁移学习的断层自动识别[J].石油地球物理勘探,2020(5):950-956. 被引量：23
3陈曙,叶俊民,刘童.一种基于领域适配的跨项目软件缺陷预测方法[J].软件学报,2020,31(2):266-281. 被引量：15
4吴锐帆,代海洋,杨坦,江颖,蔡志杰.直肠癌淋巴结转移的智能诊断研究[J].数学建模及其应用,2019,8(4):30-37. 被引量：2
5刘世晶,刘阳春,钱程,郑浩君,周捷,张成林.基于CycleGAN和注意力增强迁移学习的小样本鱼类识别[J].农业机械学报,2023,54(S01):296-302. 被引量：1
6张璐,黄琳,李备备,陈鑫,段青玲.基于多尺度融合与无锚点YOLO v3的鱼群计数方法[J].农业机械学报,2021,52(S01):237-244. 被引量：16
7张红洋,田瑞盟.基于SOLO分类理论的科学思维学业质量评价[J].湖南中学物理,2021(2):1-4. 被引量：1
8范保江,孙磊,何杨帆,范晓飞,李玉超,索雪松.基于机器视觉技术的稻米等级快速自动判定方法及系统研究[J].电子测量与仪器学报,2022,36(10):123-130. 被引量：2
9林峰,郭鹏,刘旭斌.基于叶片表面污垢预处理与CNN的风电机组叶片表面损伤识别[J].动力工程学报,2020(12):975-981. 被引量：5
10齐金龙,张俊峰,戴贤萍,张劲松,胡陟.基于机器视觉的零部件的缺陷检测[J].智能计算机与应用,2021,11(3):167-171. 被引量：2

1波克城市:立足“游戏+”,弘扬正能量[J].上海企业,2023(4):52-53.
2姜玫.基层群众美术文化活动组织与辅导策略探究[J].文化月刊,2022(7):128-130.
3范光宁.大型立式水泵的水导轴窝调整分析[J].电子技术（上海）,2023,52(4):256-257.
4白明丽,王明文.基于改进Cascade R-CNN的布匹瑕疵检测算法[J].计算机科学,2023,50(S01):312-317. 被引量：1
5杨燕慧.不同抗凝监测指标对成人体外膜肺氧合肝素抗凝及临床预后影响[J].中国科技期刊数据库医药,2023(7):78-81.
6赵松璞,郑翔,彭志远,赵昕,梁洪军,杨利萍.基于混合注意力及自适应多尺度的语义分割算法研究[J].无线电工程,2023,53(7):1563-1571.
7伏海明,段胜腾,陈丁刚.某山地学校建筑的地震力调整分析和嵌岩桩设计[J].云南建筑,2023(2):63-66.
8潘海燕,杨璇.基于产业结构演进的高职专业结构与区域产业结构适配性研究——以湖南省为例[J].职业技术教育,2023,44(2):12-17.
9林宇萌,张敏,于晨宇,郭懿萱,朱明珠.基于决策树算法的互联网娱乐支付交易保险平台设计[J].信息与电脑,2023,35(6):126-129.
10伍金田,邹胜,徐静.油气资产折耗会计核算与企业所得税纳税调整分析[J].注册税务师,2023(4):44-46. 被引量：1

运筹与模糊学

2023年第3期

浏览历史

内容加载中请稍等...

基于VGG和ResNet的小样本人脸识别模型

参考文献7

二级参考文献111

共引文献525

相关作者

相关机构

相关主题

浏览历史