基于改进DPGN的少样本图像分类算法研究

Research on image classification algorithm with few-shot based on im proved DPGN

下载PDF

导出

摘要 DPGN(distribution propagation graph network)是基于深度学习的少样本图像分类算法,在数据稀疏的条件下可以顺利完成图像分类,但其分类的准确率仍需进一步提升。以DPGN算法为研究对象,提出SFOD_DPGN(SinAM_FRN_layer_ODConv_DM&EMD_distribution propagation graph network)算法。在骨干神经网络Resnet12的残差块中融入注意力机制;将Resnet12网络中批量归一化与ReLu激活函数搭配使用的方式改为滤波器响应归一化与阈值线性单元激活函数搭配使用的方式;在分类器模块中选用全维动态卷积替换普通卷积;使用马氏距离和推土机距离替换L2距离度量函数。在CUB-200-2011数据集上的实验表明,在5way-1shot和5way-5shot分类任务下,SFOD_DPGN算法比DPGN算法的准确率提升约7.97%和2.66%。 The distribution propagation graph network(DPGN)is a few-shot image classification algorithm based on deep learning.Unfortunately,the DPGN algorithm completely ignores semantic information,which is important for fine-grained classification.Therefore,it delivers poor classification performances.This paper proposes a new Few-shot learning algorithm based on the DPGN algorithm,SinAM-FRN_layer-ODConv-DM&EMD_Distribution Propagation Graph Network(SFOD_DPGN).First,to address the inability to extract image features by the feature extraction module of the DPGN algorithm,the SimAM attention mechanism is integrated into four residual blocks of the feature extraction network ResNet12.The SimAM attention mechanism can generate three-dimensional weights for feature maps from both spatial and channel dimensions,and then aggregates the generated weights with the feature maps to enable the improved ResNet12 to learn more and richer image features;Second,in view that the normalization method of the ResNet12 is affected by the number of images selected in training,the combination of batch normalization and the ReLu activation function in the main path of each residual block of the ResNet12 is changed to the combination of the filter response normalization(FRN)and the threshold linear unit activation function(TLU).Because of the FRN without mean operation,it easily leads to activation with arbitrary bias far from zero.If the FRN combines with the ReLu activation function,this bias has adverse effects on training.This paper employs the TLU after the FRN to address the problem.The SFOD_DPGN algorithm improves the classification accuracy and ensures its inference speed.Then,it optimizes the classifier module of the DPGN algorithm.To solve poor classification performance of the classifier module,the full dimensional dynamic convolution(ODConv)is selected to replace the common convolution in the classifier module.The ODconv employs a linear combination of n convolutional kernels and parallel strategies to introduce multidimensional attention mechanisms for dynamic weighting,making the convolution operation dependent on the input.The ODconv improves the robustness of the SFOD_DPGN algorithm.Finally,the DPGN algorithm uses the L2 distance measurement method in the classifier module,easily causing errors in calculating the distance between samples.Based on the characteristics of distance measurement methods,the Mahalanobis Distance(MD)is suitable for calculating the distance between samples(point graphs).The Earth Moves’s Distance(EMD)distance ismore suitable for calculating the distance between distribution graphs.This paper uses the MD and EMD to replace the L2 in order to improve the ability of the classifier to measure the distance between samples.It improves the classification accuracy of the SFOD_DPGN algorithm.Experiments on the CUB-200-2011 dataset shows the SFOD_DPGN algorithm is superior to the DPGN algorithm over 5way-1shot and 5way-5shot classification tasks.The accuracy improves by 7.97%and 2.66%respectively.Meanwhile,ablation experiments are performed for each part to verify the effect of the improved ResNet12 and the classifier module.Compared to the DPGN algorithm,after the SimAM attention mechanism is integrated into the ResNet12,the accuracy improves by 2.77%and 1.16%over 5way-1shot and 5way-5shot classification tasks respectively.Furthermore,after the improving the normalization method and activation function of the ResNet12,the accuracy is 5.00%and 2.04%higher respectively over 5way-1shot and 5way-5shot classification tasks.After the further replacement of the common convolution with the ODconv,the accuracy is up by 7.25%and 2.42%respectively over 5way-1shot and 5way-5shot classification tasks.Our experimental results demonstrate all improvements are effective to improve classification accuracy of the SFOD_DPGN algorithm.

作者王玲孙莹王鹏白燕娥 WANG Ling;SUN Ying;WANG Peng;BAI Yan'e(Changchun University of Science and Technology,Changchun 130022,China)

机构地区长春理工大学计算机科学技术学院

出处《重庆理工大学学报（自然科学）》 CAS 北大核心 2024年第2期161-169,共9页 Journal of Chongqing University of Technology：Natural Science

基金吉林省自然科学基金项目(20210101413JC)。

关键词深度学习少样本图像分类注意力机制全维动态卷积马氏距离推土机距离 deep learning few-shot image classification attention mechanism omni-dimensional dynamic convolution mahalanobis distance earth mover’s distance

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1赵志成,罗泽,王鹏彦,李健.基于深度残差网络图像分类算法研究综述[J].计算机系统应用,2020,29(1):14-21. 被引量：56
2李晓旭,刘忠源,武继杰,曹洁,马占宇.小样本图像分类的注意力全关系网络[J].计算机学报,2023,46(2):371-384. 被引量：4

二级参考文献1

1张帆,张良,刘星,张宇.基于深度残差网络的脱机手写汉字识别研究[J].计算机测量与控制,2017,25(12):259-262. 被引量：9

共引文献58

1杨伟健.基于深度残差网络图像分类算法研究[J].计算机产品与流通,2020(11):100-100. 被引量：2
2邓宇平,王桂棠.基于GoogleNet网络与残差网络的织物纹理分析[J].电子测量技术,2021,44(7):31-38. 被引量：3
3吴晓玲,黄金雪,何文海.基于深度卷积神经网络的塑料垃圾分类研究[J].塑料科技,2020,48(4):86-89. 被引量：7
4郭功举,岳照溪,潘琛.基于深度学习的高分辨率影像样本构建与分类算法优化[J].测绘科学与工程,2020,40(4):24-30.
5陈娟,刘锋林,黄麒之,文泉.基于成果导向教学的人工智能课程改革[J].软件导刊,2020,19(12):19-22. 被引量：4
6付丽君,赵晨兵,杨青,张齐鹏.基于图像的集合型电机轴承故障诊断方法[J].沈阳理工大学学报,2020,39(5):8-12. 被引量：6
7敖卓缅.一种基于网络图的计算机算法研究[J].电子技术与软件工程,2021(1):5-6. 被引量：1
8文泉,张懿虎,陈娟.面向OBE和新工科的统计学习课程实践案例[J].计算机教育,2021(4):82-84. 被引量：1
9黄彩萍,甘书宽,谭金甲,黄志祥.基于深度学习的混凝土表观病害智能分类器[J].华中科技大学学报（自然科学版）,2021,49(4):96-101. 被引量：4
10赵敬娇,赵志宏,杨绍普.基于残差连接和1D-CNN的滚动轴承故障诊断研究[J].振动与冲击,2021,40(10):1-6. 被引量：30

1王晓兵,张雄伟,曹铁勇,郑云飞,王勇.基于尺度注意知识迁移的自蒸馏目标分割方法[J].计算机应用,2024,44(1):129-137.
2杨正宇,沈志强,郑成源.灰狼算法优化SVR的10kV配网线损率预测研究[J].计算机技术与发展,2024,34(3):35-40. 被引量：1
3曾中华,曹东.基于改进Unet的矿石图像分割[J].电子测量技术,2023,46(21):176-182.
4侯颖,杨林,胡鑫,贺顺,宋婉莹,赵谦.基于SwinT-YOLOX模型的自动扶梯行人安全检测算法[J].计算机工程,2024,50(3):277-289.
5袁新颜,黄嘉爽.基于图核的动态脑网络状态构建方法及其应用[J].计算机应用与软件,2023,40(12):108-113.
6陆煜,俞经虎,朱行飞,张不凡.基于卷积神经网络的轻量级水稻叶片病害识别模型[J].江苏农业学报,2024,40(2):312-319.
7王光辉,白天水,丁爽,何欣.基于代理选举的高效异构联邦学习方法[J].计算机应用研究,2024,41(3):688-693.
8陈昊峰,刘学军,王步美.基于评论文本和融入专业度评分的跨域混合推荐[J].计算机工程与设计,2024,45(3):755-761.
9程楠楠,彭吉琼,吴璇.融合知识图谱与注意力机制的产学合作推荐研究[J].信息技术与信息化,2024(2):62-66.
10李慧,胡耀华,徐存真.考虑评论情感表达力及其重要性的个性化推荐算法[J].数据分析与知识发现,2024,8(1):69-79.

重庆理工大学学报（自然科学）

2024年第2期

浏览历史

内容加载中请稍等...

基于改进DPGN的少样本图像分类算法研究

参考文献2

二级参考文献1

共引文献58

相关作者

相关机构

相关主题

浏览历史