基于门控注意力和多尺度残差融合的双源遥感图像语义分割

Semantic Segmentation of Dual-Source Remote Sensing Images Based on Gated Attention and Multiscale Residual Fusion

导出

摘要遥感图像语义分割是基于地理对象进行遥感图像分析的关键和重要步骤。遥感影像数据与高程数据可形成有效的特征互补,进而提升像素级分割精度。以Swin Transformer为主干网络提取多尺度特征,融合自适应门控注意力机制和多尺度残差融合策略,提出双源遥感图像语义分割模型——STAM-SegNet。自适应门控注意力机制包含门控通道注意力机制和门控空间注意力机制。门控通道注意力通过竞争/合作的机制提升双源数据特征之间的相关性,有效提取双源数据的互补特征。门控空间注意力利用空间上下文信息动态地过滤掉部分高层语义特征,筛选出精确的细节特征。多尺度特征残差融合策略通过多尺度细化和残差结构充分捕获多尺度上下文信息,加强对阴影、边界等细节特征的关注,同时提升模型的训练速度。在Vaihingen和Potsdam数据集上进行实验,所提方法分别取得了89.66%和92.75%的平均F1-score,具有比DeepLabV3+、UperNet、DANet、TransUNet、Swin-UNet等网络更高的分割精度。 The semantic segmentation of remote sensing images is a crucial step in the analysis of geographic-object-based remote sensing images.Combining remote sensing image data with elevation data effectively enhances feature complementarity,thereby improving pixel-level segmentation accuracy.This study proposes a dual-source remote sensing image semantic segmentation model,STAM-SegNet,that leverages the Swin Transformer backbone network to extract multiscale features.The proposed model integrates an adaptive gating attention mechanism and a multiscale residual fusion strategy.The adaptive gated attention mechanism includes gated channel attention and gated spatial attention mechanisms.Gated channel attention enhances the correlation between dual-source data features through competition/cooperation mechanisms,effectively extracting complementary features of dual-source data.In contrast,gated spatial attention uses spatial contextual information to dynamically filter out high-level semantic features and select accurate detail features.The multiscale feature residual fusion strategy captures multiscale contextual information via multiscale refinement and residual structure,thereby emphasizing detailed features,such as shadows and boundaries,and improving the model’s training speed.Experiments conducted on the Vaihingen and Potsdam datasets demonstrate that the proposed model achieved an average F1-score of 89.66%and 92.75%,respectively,surpassing networks such as DeepLabV3+,UperNet,DANet,TransUNet,and Swin-UNet in terms of segmentation accuracy.

作者郭文杨虹刘畅 Guo Wen;Yang Hong;Liu Chang(School of science,Beijing Information Science and Technology University,Beijing 100029,China;Institute of Applied Mathematics,Beijing Information Science and Technology University,Beijing 100101,China)

机构地区北京信息科技大学理学院北京信息科技大学应用数学研究所

出处《激光与光电子学进展》 CSCD 北大核心 2024年第18期450-460,共11页 Laser & Optoelectronics Progress

基金国家自然科学基金(62171044) 北京市自然科学基金(4222104)。

关键词遥感图像解译语义分割双源遥感数据自适应门控注意力机制多尺度残差融合 remote sensing image interpretation semantic segmentation dual-source remote sensing data adaptive gating attention mechanism multiscale residual fusion

分类号 TP753 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

1彭琼.“双一流”背景下高校青年教师教学科研能力提升路径研究[J].科教导刊,2024(25):62-64.
2王大光.“一带一路”倡议下跨国会展合作的机制与模式研究[J].商展经济,2024(20):7-9.
3张珩,熊梅.基于遥感影像数据和POI数据的城市建设用地提取[J].河北省科学院学报,2024,41(5):60-66.
4李富华,吴陈.一种用于道路场景分割的轻量级特征融合网络[J].计算机与数字工程,2024,52(8):2329-2335.
5杨晓文,靳瑜昕,韩慧妍,况立群,无.融合编码器多尺度特征的RGB-D图像语义分割[J].计算机仿真,2024,41(9):205-212.
6王晓玲,朱开渲,余红玲,蔡志坚,王成.考虑时空相关性的大坝渗压组合深度学习预测模型[J].水力发电学报,2023,42(11):78-91. 被引量：1
7张国君,于小川,张猛,蒋海龙,张常兴,王大鹏,白翔宇,翟欣欣,赵玉妹,韩晨.基于边界感知网络的遥感影像输电线路通道隐患地物变化检测[J].电力设备管理,2024(17):175-178.
8聂岩,蒋鹏飞,边防,贾方圆.基于Unet和SVM耦合的遥感影像地物分类优化改进研究[J].新一代信息技术,2023,6(18):7-12.
9单伏顺.基于改进YOLOv7的安全头盔检测算法[J].智能计算机与应用,2024,14(10):227-230.
10宋建辉,胡强强,刘晓阳,赵亚威.基于注意力机制及多尺度特征融合的图像去雨[J].沈阳理工大学学报,2024,43(6):28-33.

激光与光电子学进展

2024年第18期

浏览历史

内容加载中请稍等...

基于门控注意力和多尺度残差融合的双源遥感图像语义分割

相关作者

相关机构

相关主题

浏览历史