基于深度全卷积神经网络的图像识别研究被引量：1

Research on Image Recognition Based on Deep Fully Convolutional Neural Network

下载PDF

导出

摘要将要建立多层卷积网络模型,并使用AlexNet预训练模型,在此基础上进行迁移学习,使用kaggle的猫狗数据集进一步训练,模型最终能高灵活度、高准确率的识别猫狗图像,并且不受图像中猫狗的占比大小影响。该网络模型共有6 000万参数,一共包含8个卷积层,其中某些卷积层带有归一化层和池化层,最后一层是具有两个通道的图像输出,每个通道的值分别代表图像为猫和狗的概率。整个网络模型,弃用全连接层,选用全卷积网络来代替全连接层,大大提高网络的灵活性,解决了输入图像分辨率的限制问题,并且全卷积网络的前向传播更加高效,加快了训练的速度。为了方便分析以及进一步的研究,将可视化一层卷积和二层卷积所得到的卷积核和特征图。 A multi-layer convolutional network model will be established, and the AlexNet pre-training model will be used. On this basis, migration learning will be performed. Kaggle’s cat and dog data set will be used for further training. The model will eventually be able to recognize cat and dog images with high flexibility and high accuracy without being affected by the proportion of cats and dogs in the image. The network model has a total of 60 million parameters and a total of 8 convolutional layers. Some of the convolutional layers have a normalization layer and a pooling layer. The last layer is an image output with two channels, and the value of each channel respectively represent the probability whether the image is a cat or a dog. The fully connected layer is abandoned in the entire network model, and the fully connected layer is replaced by a fully convolutional network, which greatly improves the flexibility of the network, solves the limitation of input image resolution, and the forward propagation of the fully convolutional network is more efficient speeds up training. In order to facilitate analysis and further research, this article will visualize the convolution kernel and feature maps obtained by one-layer convolution and two-layer convolution.

作者姬壮伟 JI Zhuang-wei(Department of Computer Science,Changzhi University,Changzhi Shanxi,046011)

机构地区长治学院计算机系

出处《山西大同大学学报（自然科学版）》 2022年第2期27-29,74,共4页 Journal of Shanxi Datong University(Natural Science Edition)

关键词深度学习全卷积网络卷积可视化 deep learning fully convolutional network convolutional visualization

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献7

1白林亭,海钰琳.基于梯度分析的卷积神经网络可视化方法[J].信息技术与信息化,2021(4):61-63. 被引量：1
2王磊.人工神经网络原理、分类及应用[J].科技资讯,2014,12(3):240-241. 被引量：16
3孙瑜阳.深度学习及其在图像分类识别中的研究综述[J].信息技术与信息化,2018(1):138-140. 被引量：19
4李梦洁,董峦.基于PyTorch的机器翻译算法的实现[J].计算机技术与发展,2018,28(10):160-163. 被引量：16
5万磊,佟鑫,盛明伟,秦洪德,唐松奇.Softmax分类器深度学习图像分类方法应用综述[J].导航与控制,2019,0(6):1-9. 被引量：63
6葛梦颖,于重重,周兰,马钰锡.基于协同半监督的深度学习图像分类算法[J].计算机仿真,2019,36(2):196-200. 被引量：9
7王亮申,欧宗瑛,朱玉才,侯杰,于京诺.基于SVM的图像分类[J].计算机应用与软件,2005,22(5):98-99. 被引量：18

二级参考文献32

1Alberto Del Bimbo and Pietro Pala.Visual Image Retrieval by Elastic Matching of User Sketches.IEEE Transactions on Pattern Analysis and Machine Intelligence,1997,19(2):121～132.
2Smith J R,Chang S F.Transform feature for texture classification and discrimination in large image database.In:Proc of IEEE Int'l Conf on Image Processing.Austrin,Texas,1994.
3Guodong Guo,Stan Z.Li,Kap Luk Chan.Support Vector Machines Recognition.Image and Vision Computing ,19(2001):631～638.
4Pierre M L Drezet,Robert F Harrison.A new method for sparsity control in support vector classification and regression.Pattern recognition,2001,34:111～125.
5Vapnik V,Lerner A.Pattern Recognition Using Generalized Portrait.Automation and Remote Control.1963,24(6):774～780.
6S.Amari,S Wu.Improving Support Vector Machine Classifiers by Modifying Kernel Functions.Neural Networks 12(199):783～789.
7Victor L Brailovsky,Ofir Barzily,Rabin Shahave.On Global Local and Neighborhood Kernels for Support Vector Machines.Pattern Recognition Letters,20(1999):1183～1190.
8Courant R,Hibert B.Methods of mathematical physics.Vol.1.New York:Widely-interscience,1953.
9王亮申.[D].大连:大连理工大学,2003.
10金连文,徐秉铮.基于多级神经网络结构的手写体汉字识别[J].通信学报,1997,18(5):21-27. 被引量：19

共引文献134

1吴荣火,甘馥榕,李姗姗,秦丹.考研类网站访问量的统计预测模型及其应用[J].玉林师范学院学报,2020,41(3):115-121.
2韩庆生.TensorFlow与Pytorch环境的搭建[J].计算机产品与流通,2020,0(5):124-124. 被引量：4
3刘斌,贾浩强,杨一,申佳,盖美辰,宋天霖.基于改进OpenPose算法的矿工危险行为识别研究[J].电视技术,2023,47(2):20-23. 被引量：2
4丁胜男,李威,蔡立明,李蒙,胡常青.基于目标特征分布增强卷积神经网络的红外目标检测算法[J].导航与控制,2024,23(1):97-106.
5许洋洋,袁华.一种基于内容的广告垃圾图像过滤方法[J].山东大学学报（理学版）,2006,41(3):73-78. 被引量：9
6许将军,赵辉.高光谱遥感图像特征提取及分类研究——基于离散余弦变换(DCT)及支撑向量机技术[J].佳木斯大学学报（自然科学版）,2006,24(4):468-470.
7汤井田,胡丹,龚智敏.基于SVM的SAR图像分类研究[J].遥感技术与应用,2008,23(3):341-345. 被引量：13
8汤井田,胡丹,龚智敏.基于SVM的图像纹理特征分类研究[J].计算机工程与科学,2008,30(8):44-45. 被引量：10
9章智儒.SVM在图像分类中的应用[J].信息技术,2009,33(8):133-136. 被引量：7
10王立辉,黄进良,孙俊英.基于SVM的环境减灾卫星HJ-1B影像作物分类识别研究[J].世界科技研究与发展,2009,31(6):1029-1032. 被引量：3

同被引文献3

1李成豪,张静,胡莉,肖贤鹏,张华.基于多尺度感受野融合的小目标检测算法[J].计算机工程与应用,2022,58(12):177-182. 被引量：10
2郑云飞,王晓兵,张雄伟,曹铁勇,孙蒙.基于金字塔知识的自蒸馏HRNet目标分割方法[J].电子学报,2023,51(3):746-756. 被引量：4
3徐高,周武杰,叶绿.基于边界图卷积的机器人行驶路障场景解析[J].浙江科技学院学报,2023,35(5):402-411. 被引量：1

引证文献1

1张喻铭,周武杰,叶绿.基于自蒸馏和双模态的室内场景解析算法[J].浙江科技学院学报,2024,36(3):218-227.

1覃东妮.有偿委托合同中任意解除权的适用及其限制问题[J].黑龙江人力资源和社会保障,2022(15):95-97. 被引量：1

山西大同大学学报（自然科学版）

2022年第2期

浏览历史

内容加载中请稍等...

基于深度全卷积神经网络的图像识别研究被引量：1

参考文献7

二级参考文献32

共引文献134

同被引文献3

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于深度全卷积神经网络的图像识别研究 被引量：1

参考文献7

二级参考文献32

共引文献134

同被引文献3

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于深度全卷积神经网络的图像识别研究被引量：1