基于transformer自适应特征向量融合的图像分类被引量：1

Image classification based on transformer adaptive feature vector fusion

导出

摘要针对目前基于transformer的图像分类模型直接应用在小数据集上性能较差的问题,本文提出了transformer自适应特征向量融合网络,该网络在特征提取器中将不同阶段的特征进行融合,减少特征信息丢失的同时获得更多不同感受野下的信息,同时利用最大池化来去除特征中的冗余信息,从而使提取的特征更具有判别性。此外,为了充分利用图像的各级特征信息来进行分类预测,本文将网络各阶段产生的特征向量进行融合,使融合后的特征向量更具有表征能力,从而减少网络对大数据集的依赖,使网络在小数据集中也能获得很好的性能。实验表明,本文提出的算法在数据集Mini-ImageNet-100、CIFAR-100和ImageNet-1k上的TOP-1准确率分别达到了74.22%、85.86%和81.4%。在没有增加计算量的情况下,在baseline上分别提高了6.0%、3.0%和0.1%,且参数量减少了18.3%。本文代码开源在“https://github.com/xhutongxue/afvf”。 Aiming at the problem of poor performance that the current transformer-based image classification model is directly applied to the small data sets,this paper proposes a transformer adaptive feature vector fusion network,which fuses features at different stages in the feature extractor,reduces the loss of feature information and obtains more information under different receptive fields,and uses maximum pooling to remove redundant information of features,so that the extracted features are more discriminative.In addition,in order to make full use of the feature information at all levels of the image for classification prediction,this paper fuses the feature vectors generated at each stage of the network to make the fused feature vectors more representative.Thereby reducing the network's dependence on large data sets,so that the network can also obtain good performance in small data sets.Experiments show that the algorithm proposed in this paper achieves 74.22%,85.86%and 81.4%of the TOP-1 accuracy on the datasets Mini-ImageNet-100,CIFAR-100 and ImageNet-1k,respectively.Without increasing the amount of computation,the baselines are improved by 6.0%,3.0%,and 0.1%,respectively,and the amount of parameters is reduced by 18.3%.The code of this article is open source at"https://github.com/xhutong xue/afvf".

作者胡义黄勃淳李凡 HU Yi;HUANG Bochun;LI Fan(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming,Yunnan 650500,China)

机构地区昆明理工大学信息工程与自动化学院

出处《光电子．激光》 CAS CSCD 北大核心 2023年第6期602-609,共8页 Journal of Optoelectronics·Laser

基金国家自然科学基金(61862036,61962030,81860318)资助项目。

关键词 TRANSFORMER 图像分类自适应特征向量融合卷积神经网络(CNN) 模式识别 transformer image classification adaptive feature vector fusion convolutional neural net-work(CNN) pattern recognition

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1肖进胜,饶天宇,贾茜,宋金钟,易本顺.基于图切割的拉普拉斯金字塔图像融合算法[J].光电子．激光,2014,25(7):1416-1424. 被引量：21
2吴效莹,温显斌,徐海霞,袁立明.基于张量投影的多特征非局部动态核稀疏表示的SAR图像分类[J].光电子．激光,2021,32(7):742-752. 被引量：1
3杨航,吴笑天,贺柏根,朱明.基于多尺度导引滤波的图像融合方法[J].光电子．激光,2015,26(1):170-176. 被引量：29

二级参考文献26

1Goshtasby A A,NIkolov S.Image fusion:advances in the state of the art[J ].Inf.Fusion,2007,8(2):114-118.
2Pohl C,Van Genderen J L.Multisensor image fusion in remote se nsing:concepts,methods and applications[J].Int.J.Remote Sens.,1998,19(5):823-854.
3Mitianoudis N,Stathaki T.Pixel-based and region-based image fusion schemes using ICA bases[J].Inf.Fusion,2007,8(2):131-142.
4Rockinger O.Pixel level fusion of image sequences using wavele t frames[C].Proc.16th Leeds Annual Statistical Research Workshop,[C].1996,149-154.
5Chung K L,Yang W J,Yan W M.Efficient edge-preserving algorithm for color contrast enhancement with application to color image segmentation[J].J.Vis.Commun.Image Represent .,2008,19(5):299-310.
6Chen C, Wang C D.A simple edge-preserving filtering techniq ue for constructing multi-resolution systems of images[J].Pattern Recognit.Lett.,1999,20(5) :495-506.
7Perona P,Malik J.Scale-space and edge dete ction using anisotropic diffusion,IEEE Trans.Pattern Anal.Mach.Intell.,1990,12(7):629-639.
8Tomasi C,Manduchi R.Bilateral filtering for g ray and color images[C].Proc.Int.Conf.on Computer Vision[C].1998,9-846.
9Farbman Z,Fattal R,Lischinski D,et al.Edge-pr eserving decompositions for multi-scale tone and detail manipulation[J].ACM Trans.Graph.,2008,27(3) :1-10.
10Xu L,Lu C,Xu Y,et al.Image smoothing via L0gradient minimization[J].ACM Trans.Graph.,2011,30 (6):174-12.

共引文献47

1杨航,吴笑天,贺柏根,朱明.基于多尺度导引滤波的图像融合方法[J].光电子．激光,2015,26(1):170-176. 被引量：29
2彭红,肖进胜,程显,李必军,宋晓.基于扩展卡尔曼滤波器的车道线检测算法[J].光电子．激光,2015,26(3):567-574. 被引量：23
3肖进胜,饶天宇,贾茜,唐路敏,岳显昌.改进的自适应冲击滤波图像超分辨率插值算法[J].计算机学报,2015,38(6):1131-1139. 被引量：13
4王涛,董灵波,刘兆刚,张凌宇,陈莹.大兴安岭天然次生林林木补植空间优化[J].北京林业大学学报,2019,41(5):127-136. 被引量：6
5包观笑,孙刘杰,于海娇.基于拉普拉斯金字塔的数字水印防伪技术[J].包装工程,2016,37(1):130-133. 被引量：10
6张惊雷,胡晓婷,温显斌.基于Shearlet变换与区域分割的遥感图像融合[J].光电子．激光,2015,26(12):2393-2399. 被引量：4
7孙刘杰,包观笑,汪祖辉,李毓彬.空间域与频率域相结合的抗图像处理全息水印[J].光电子．激光,2016,27(1):61-66. 被引量：7
8芦碧波,陈静,郑艳梅,王建龙.基于引导滤波的iCAM06色调映射算法[J].光学技术,2016,42(2):130-135. 被引量：6
9李美丽.基于多尺度变换的PCNN和FOA图像融合[J].光电子．激光,2016,27(7):767-772. 被引量：5
10汪玉美,陈代梅,赵根保.基于目标提取与拉普拉斯变换的红外和可见光图像融合算法[J].激光与光电子学进展,2017,54(1):98-106. 被引量：28

同被引文献11

1赵雪梅,吴军,陈睿星.RMFS-CNN:遥感图像分类深度学习新框架[J].中国图象图形学报,2021,26(2):297-304. 被引量：7
2徐科杰,邓培芳,黄鸿.HSRS-SC:面向遥感场景分类的高光谱图像数据集[J].中国图象图形学报,2021,26(8):1809-1822. 被引量：5
3潘尔婷,马泳,黄珺,樊凡,李皞,马佳义.跨数据集评估的高光谱图像分类[J].中国图象图形学报,2021,26(8):1969-1977. 被引量：4
4邓培芳,徐科杰,黄鸿.基于CNN-GCN双流网络的高分辨率遥感影像场景分类[J].遥感学报,2021,25(11):2270-2282. 被引量：9
5余甜微,郑恩让,沈钧戈,王凯.基于多级别跨层双线性融合的光学遥感图像场景分类[J].光子学报,2022,51(2):250-263. 被引量：6
6白坤,慕晓冬,陈雪冰,朱永清,尤轩昂.融合半监督学习的无监督遥感影像场景分类[J].测绘学报,2022,51(5):691-702. 被引量：6
7倪康,赵雨晴,陈志.稀疏二阶注意力机制驱动的多尺度卷积遥感图像场景分类网络[J].光子学报,2022,51(6):386-397. 被引量：3
8余东行,徐青,赵传,郭海涛,卢俊,林雨准,刘相云.注意力引导特征融合与联合学习的遥感影像场景分类[J].测绘学报,2023,52(4):624-637. 被引量：6
9张艺超,郑向涛,卢孝强.基于层级Transformer的高光谱图像分类方法[J].测绘学报,2023,52(7):1139-1147. 被引量：4
10辛紫麒,李忠伟,王雷全,许明明,胡亚斌,梁建.基于光谱-空间联合Transformer模型的黄河三角洲湿地高光谱影像分类[J].海洋科学,2023,47(5):90-101. 被引量：1

引证文献1

1薛洁,黄鸿,蒲春宇,杨鄞铭,李远,刘英旭.面向高光谱场景分类的空—谱模型蒸馏网络[J].中国图象图形学报,2024,29(8):2205-2219.

1郑丹丹,王毅.大数据集实时查询的缓存技术实现[J].金融电子化,2023(11):87-88.
2谢梅源,何耀平,张焰林.医院智能分诊系统训练数据自动标注方法研究[J].微型电脑应用,2023,39(6):42-45. 被引量：1
3王春阳,帅闻,肖博,黄思玲,王大森.基于环摆式双面抛光法加工预测模型的去除均匀性研究[J].光学学报,2023,43(9):134-144. 被引量：1
4Zhifang Liao,Yiqi Zhao,Shengzong Liu,Yan Zhang,Limin Liu,Jun Long.The Measurement of the Software Ecosystem’s Productivity with GitHub[J].Computer Systems Science & Engineering,2021,36(1):239-258.
5张晶,鞠佳良,任永功.基于双生成器网络的Data-Free知识蒸馏[J].计算机研究与发展,2023,60(7):1615-1627. 被引量：1
6Qi Zhang,Jingyu Xiao,Chunwei Tian,Jerry Chun‐Wei Lin,Shichao Zhang.A robust deformed convolutional neural network(CNN)for image denoising[J].CAAI Transactions on Intelligence Technology,2023,8(2):331-342. 被引量：11
7V.Joseph Raymond,R.Jeberson Retna Raj.Investigation of Android Malware with Machine Learning Classifiers using Enhanced PCA Algorithm[J].Computer Systems Science & Engineering,2023,44(3):2147-2163. 被引量：1

光电子．激光

2023年第6期

浏览历史

内容加载中请稍等...

基于transformer自适应特征向量融合的图像分类被引量：1

参考文献3

二级参考文献26

共引文献47

同被引文献11

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于transformer自适应特征向量融合的图像分类 被引量：1

参考文献3

二级参考文献26

共引文献47

同被引文献11

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于transformer自适应特征向量融合的图像分类被引量：1