期刊文献+

基于迁移学习和批归一化的菜肴图像识别方法 被引量:4

FOOD IMAGE RECOGNITION BASED ON TRANSFER LEARNINGAND BATCH NORMALIZATION
下载PDF
导出
摘要 菜肴图像识别属于图像细粒度识别。针对菜肴子类之间差距小、外观差异大且受外界因素影响难以识别问题,提出一种基于迁移学习和批归一化结合的深度学习模型菜肴图像识别方法。以预训练的VGG-16为迁移学习基础,对部分卷积层以及全连接层输出做批归一化处理,最终得到尺度变换和平移后的特征集合。通过迁移学习解决深度学习所带来的过拟合问题,获取比人工特征更具有鉴别性的深度特征;通过批归一化处理缓解深度学习中存在的梯度消失问题。迁移学习的相关实验中以loss、top1、top5准确率为指标;批归一化相关实验中以top1准确率和top5准确率为指标。实验表明,在VireoFood172和UEC-Food100数据集上,所提出的模型与原始模型相比,loss明显下降,准确率有大幅提升,并且与现有方法相比在菜肴图像识别的top1和top5准确率上均有所提升。 Food image recognition is a kind of fine-grained image recognition.Considering small gaps among subclasses of various food,large differences in appearance and other uncertain external factors make it difficult to recognize food images,a deep learning model based on transfer learning and batch normalization is put forward to deal with these problems.Based on the pre-trained VGG-16 model,outputs of partial convolution layers and all fully connected layers were normalized,and we obtained the features after scale transform and scale translation.Transfer learning was applied to the model to overcome over-fitting caused by deep learning in some way as well as obtaining more discriminative in-depth features than artificial features.Batch normalization could help solve the problem of gradient disappearance in deep learning.The indicators in the related experiments of transfer learning were loss,top1 precision and top5 precision,while top1 precision and top5 precision were used as indicators in experiments related with batch normalization.The results of the experiments show that the loss decreases significantly,and the precision is greatly improved on VireoFood 172 and UEC-Food 100 datasets compared with the primitive model.Compared with the existing methods,the accuracy of top 1 and top 5 of food image recognition is improved.
作者 郭心悦 胡沁涵 刘纯平 杨季文 Guo Xinyue;Hu Qinhan;Liu Chunping;Yang Jiwen(School of Computer Science and Technology,Soochow University,Suzhou 215006,Jiangsu,China)
出处 《计算机应用与软件》 北大核心 2021年第3期124-133,共10页 Computer Applications and Software
基金 国家自然科学基金项目(61773272,61272258) 江苏高校优势学科建设工程项目。
关键词 菜肴识别 卷积神经网络 VGG-16 迁移学习 批归一化 Food image recognition Convolutional neural network(CNN) VGG-16 Transfer learning Batch normalization
  • 相关文献

参考文献5

二级参考文献37

  • 1Wah C, Branson S, Welinder P, et al. The Caltech-UCSD birds- 200-2011 dataset: Computation & Neural Systems Technical Report, CNS-TR-2011-001 [ R]. San Diego, USA: California Institute of Technology, 2011.
  • 2Nilsback M E, Zisserman A. Automated flower classification over a large number of classes [ C ~//Proceedings of the 6th Indian Conference on Computer Vision, Graphics & Image Processing. Bhubaneswar, India: IEEE, 2008 : 722-729. [DOI: 10.1109/ ICVGIP. 2008.47 ].
  • 3Deng J, Dong W, Socher R, et al. ImageNet: a large-scale hier- archical image database[ C ]//Proceedings of 2009 IEEE Confer- ence on Computer Vision and Pattern Recognition. Miami, FL, USA: IEEE, 2009: 248-255. [DOI: 10. ll09/CVPR. 2009. 5206848 ].
  • 4Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks [ C ~//Proceedings of Ad- vances in Neural Information Processing Systems 25, NIPS. Lake Tahoe, Nevada, USA: MIT press, 2012, 25 (2) : 1097-1105.
  • 5Zeiler M D, Fergus R. Visualizing and understanding eonvolu- tional networks [ C ]//Proceedings of the 13th European Confer- ence on Computer Vision-ECCV 2014. Zurich, Switzerland: Springer International Publishing, 2014: 818-833. [DOI: 10. 1007/978 -3 -319 -10590-1 _531.
  • 6Russakovsky O, Deng J, Huang Z H, et al. Detecting avocados to zucchinis: what have we done, and where are we going? [ C]//Proceedings of the 2013 IEEE International Conference on Computer Vision. Sydney, NSW: IEEE, 2013: 2064-2071. [ DOI : 10.1109/ICCV. 2013. 2581.
  • 7Zhang N, Donahue J, Girshick R, et al. Part-based R-CNNs for fine-grained category detection [ C ]//Proceedings of the 13th Eu- ropean Conference on Computer Vision-ECCV 2014. Zurich, Switzerland: Springer International Publishing, 2014 : 834-849. [ DOI: 10. 1007/978-3-319-10590-1_54].
  • 8Farrell R, Oza O, Zhang N, et al. Birdlets: subordinate categori- zation using volumetric primitives and pose-normalized appear- ance [ C ~//Proceedings of the 2011 International Conference on Computer Vision. Barcelona, Spain: IEEE, 2011: 161-168. [ DOI : 10.1109/ICCV. 2011. 6126238 ].
  • 9Greeting C, Rodner E, Freytag A, et al. Nonparametric part transfer for fine-grained recognition [ C ]//Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recogni- tion. Columbus, OH, USA: IEEE, 2014: 2489-2496. [ DOI: 10.1109/CVPR. 2014.319 ].
  • 10Zhang N, Farrell R, Iandola F, et al. Deformable part descriptors for fine-grained recognition and attribute prediction [ C ]//Pro- ceedings of the 2013 IEEE International Conference on ComputerVision. Sydney, NSW, Australia: IEEE, 2013: 729-736. DOI: 10. ll09/ICCV. 2013.96].

共引文献176

同被引文献30

引证文献4

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部