期刊文献+

基于金字塔语义token全局信息增强的高分光学遥感影像变化检测

High-resolution optical images change detection based on global information enhancement by pyramid semantic token
下载PDF
导出
摘要 针对复杂背景、光谱变化等因素导致高分辨率遥感影像中细小地物检测缺失,几何结构检测不完整等问题,本文联合卷积网络和Transformer网络优势,提出一种基于金字塔语义token全局信息增强的变化检测网络(PST-GIENet)。首先,利用无最大池化层的ResNet18网络提取多时相影像深度特征以构建融合特征,并采用联合注意力机制和深监督策略提高融合特征表达能力;然后,通过空间金字塔池化将影像特征表示为多尺度语义token,进而利用Transformer编码器和解码器对融合特征空间进行全局上下文建模;最后,通过逐层上采样解码器生成最终变化图。为验证本文方法有效性,采用LEVIR-CD、CDD和WHU-CD 3个公开变化检测数据集进行对比试验与分析,定量结果表明PST-GIENet在3个数据集中均取得最优精度指标,其F 1值分别达到91.71%、96.16%和94.08%。目视结果表明PST-GIENet可有效抑制复杂背景、光谱变化等因素干扰,显著增强网络对地物边缘结构和多尺度变化的捕捉能力,取得最佳目视效果。 Due to the influence of complex background and spectral changes,missing detection of small objects and incomplete detection of geometric structures and details easily arise in remote sensing change detection(CD)domain.To address these issues,this paper proposes a pyramid semantic token guided global information enhancement change detection network(PST-GIENet)by combining the advantages of convolutional neural network(CNN)and Transformer network.Firstly,ResNet18 network without max-pooling layer is adopted to generate bi-temporal deep features,which are fused and refined by joint attention mechanism and deep supervision strategy.Secondly,image features are represented as multi-scale semantic token through spatial pyramid pooling,a Transformer encoder-decoder is subsequently employed to model the global context of the fused features.Finally,change map is produced through a layer-wise up-sampling decoder.To verify the effectiveness of the proposed method,extensive experiments and analysis were conducted on three publicly available CD datasets,including LEVIR-CD,CDD,and WHU-CD.The quantitative results showed that PST-GIENet achieved the highest metric scores in all the three datasets,with F 1 scores of 91.71%,96.16%,and 94.08%,respectively.In addition,visual results indicate that PST-GIENet can effectively suppress the interference from complex backgrounds and spectral distortions,which significantly enhances the network's ability to capture edge structures and multi-scale changes of ground objects,achieving the best visual performance.
作者 彭代锋 翟晨晨 周顶蔚 张永军 管海燕 臧玉府 PENG Daifeng;ZHAI Chenchen;ZHOU Dingwei;ZHANG Yongjun;GUAN Haiyan;ZANG Yufu(School of Remote Sensing and Geomatics Engineering,Nanjing University of Information Science and Technology,Nanjing 210044,China;Technology Innovation Center for Integrated Applications in Remote Sensing and Navigation,Ministry of Natural Resources,Nanjing 210044,China;Key Laboratory of National Geographic Census and Monitoring,Ministry of Natural Resources,Wuhan 430079,China;Key Laboratory of Land Satellite Remote Sensing Application,Ministry of Natural Resources,Nanjing 210013,China;School of Remote Sensing and Information Engineering,Wuhan University,Wuhan 430079,China)
出处 《测绘学报》 EI CSCD 北大核心 2024年第6期1195-1211,共17页 Acta Geodaetica et Cartographica Sinica
基金 国家自然科学基金(42371449,41801386) 自然资源部遥感导航一体化应用工程技术创新中心开放基金(TICIARSN-2023-07) 自然资源部地理国情监测重点实验室开放基金(2023NGCM02) 自然资源部国土卫星遥感应用重点实验室开放基金(KLSMNR-G202308)。
关键词 高分辨率遥感影像 变化检测 金字塔语义token 全局依赖性 注意力机制 high-resolution remote sensing images change detection pyramid semantic tokens global dependency attention mechanism
  • 相关文献

参考文献7

二级参考文献56

共引文献50

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部