基于局部选择Vision Transformer的遥感场景分类算法

Remote Sensing Scene Classification Based on Local Selection Vision Transformer

导出

摘要遥感场景分类旨在为航空图像指定特定的语义标签,是遥感图像解译中一个基础且重要的任务。现有的研究主要利用卷积神经网络(CNN)学习全局和局部特征,提高网络的判别性表达。然而基于CNN的方法的感受野在建模局部特征的远程依赖性方面存在局限性。近年来,Vision Transformer(ViT)在传统的分类任务中表现出了强大的性能。Transformer的自我注意力机制将每个Patch标记与分类标记连接起来,捕捉图像像素之间的上下文关系,考虑空间域中的全局信息。提出一个基于局部选择ViT的遥感场景分类网络。首先将输入图像分割成小块的Patch,将其展开转换成序列,并进行位置编码添加到序列中;然后将得到的序列输入编码器中;除此之外,为了学习到局部判别特征,在最后一层输入前加入局部选择模块,选择具有判别性的Token作为输入,得到最后用于分类的输出。实验结果表明,所提方法在两个大型遥感场景分类数据集(AID和NWPU)取得不错的效果。 Remote sensing scene classification aims to assign specific semantic labels to aerial images,which is a fundamental and important task in remote sensing image interpretation.Existing studies have used convolutional neural networks(CNN)to learn global and local features and improve the discriminative representation of networks.However,the perceptual wilderness of CNN-based approaches has limitations in modeling the remote dependence of local features.In recent years,Vision Transformer(ViT)has shown powerful performances in traditional classification tasks.Its selfattention mechanism connects each Patch with a classification token and captures the contextual relationship between image pixels by considering global information in the spatial domain.In this paper,we propose a remote sensing scene classification network based on local selection ViT,in which an input image is first segmented into small chunks of Patch that are unfolded and converted into sequences with position encoding;thereafter,the obtained sequences are fed into an encoder.In addition,a local selection module is added before the last layer of input in order to learn the local discriminative features,and Token with discriminative properties are selected as input to obtain the final classification output.The experimental results show that the proposed method achieves good results on two large remote sensing scene classification datasets(AID and NWPU).

作者杨凯卢孝强 Yang Kai;Lu Xiaoqiang(Key Laboratory of Spectral Imaging Technology,Xi’an Institute of Optics and Precision Mechanics,Chinese Academy of Sciences,Xi’an 710119,Shaanxi,China;University of Chinese Academy of Sciences,Beijing 100049,China)

机构地区中国科学院西安光学精密机械研究所光谱成像技术重点实验室中国科学院大学

出处《激光与光电子学进展》 CSCD 北大核心 2023年第22期319-325,共7页 Laser & Optoelectronics Progress

基金国家杰出青年科学基金(61925112)。

关键词遥感场景分类深度学习 Vision Transformer 局部特征 remote sensing scene classification deep learning Vision Transformer local feature

分类号 TP751.1 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献2

1Jinpu Lin,Florian Haberstroh,Stefan Karsch,Andreas Döpp.Applications of object detection networks in high-power laser systems and experiments[J].High Power Laser Science and Engineering,2023,11(1):52-60. 被引量：19
2Fuyuan Wu,Xiaohu Yang,Yanyun Ma,Qi Zhang,Zhe Zhang,Xiaohui Yuan,Hao Liu,Zhengdong Liu,Jiayong Zhong,Jian Zheng,Yutong Li,Jie Zhang.Machine-learning guided optimization of laser pulses for direct-drive implosions[J].High Power Laser Science and Engineering,2022,10(2):35-41. 被引量：8

共引文献22

1席晓峰,郭冰,符长波,吕冲,张国强.高功率激光驱动核反应研究进展与展望[J].原子能科学技术,2023,57(5):865-887. 被引量：2
2余永建,王越,李寰,周文超,舒风风,高明,吴一辉.融合通道层注意力机制的UNet的衍射极限荧光点检测和定位[J].激光与光电子学进展,2023,60(14):245-254. 被引量：1
3胡待方,仝秋红,柴国庆,王凯,穆雨薇,苏胜君.雨天车辆检测的两阶段渐进式图像去雨算法[J].激光与光电子学进展,2023,60(22):103-112.
4张敏,邓洋洋,李亚军,张苗辉.基于语义对齐与图节点交互的实例分割算法[J].激光与光电子学进展,2023,60(22):123-130.
5高小强,常侃,凌铭阳,银梦雨.多模态自适应特征融合的目标检测[J].激光与光电子学进展,2023,60(24):100-109.
6Andreas Döpp,Christoph Eberle,Sunny Howard,Faran Irshad,Jinpu Lin,Matthew Streeter.Data-driven science and machine learning methods in laser-plasma physics[J].High Power Laser Science and Engineering,2023,11(5):10-50. 被引量：7
7王美乔,徐泽鲲,吴福源,张杰.等容预压缩等离子体中的快点火热斑形成与燃烧波传播[J].物理学报,2024,73(5):247-256.
8景宁,赵俊鹏,张敏娟.面向等效时间采样的人工智能均衡器[J].激光与光电子学进展,2024,61(5):211-214.
9夏晓华,苏建功,王耀耀,刘洋,李明臻.基于DeepLabv3+的轻量化路面裂缝检测模型[J].激光与光电子学进展,2024,61(8):172-181. 被引量：1
10李佰强,潘光绪,李天倩,朱冬,白露,阳小明,刘培刚,文坤强.基于有界分类器的深度学习青铜器年代鉴别方法[J].激光与光电子学进展,2024,61(8):192-200.

1比尔·希利尔,杨滔,林旭辉(译).结构或:空间句法是否需要从根本上扩展其空间组构理论?[J].城市设计,2022(4):26-45. 被引量：1
2陈成琳,鲍春,曹杰,郝群.基于改进YOLOv3的遥感小目标检测网络[J].计算机仿真,2023,40(8):30-35. 被引量：3
3付琨,王佩瑾,冯瑛超,李俊希,何琪彬,肖思宁,刁文辉,孙显.遥感跨模态智能解译:模型、数据与应用[J].中国科学：信息科学,2023,53(8):1529-1559. 被引量：2
4陈文纯,王玥,李晓敏.基于空间句法下农村滨水生态景观空间的研究与优化--以潮安龙湖古寨为例[J].现代园艺,2024,47(1):22-24. 被引量：1
5康宇哲,冯桂林,张易诚,康逸云,沈炜.基于无锚解耦头的航空图像旋转目标检测方法研究[J].计算机时代,2023(12):85-88.
6李胜永,王超男,王孟.极轻量的航空影像港口船舶目标检测器[J].计算机工程与设计,2023,44(12):3606-3612.
7潘林朋,谢凤英,赵薇薇,周颖,刘畅,王艳.基于弱监督的遥感图像镶嵌质量盲评价[J].北京航空航天大学学报,2023,49(9):2518-2526.

激光与光电子学进展

2023年第22期

浏览历史

内容加载中请稍等...

基于局部选择Vision Transformer的遥感场景分类算法

参考文献2

共引文献22

相关作者

相关机构

相关主题

浏览历史