高分辨率遥感影像的边缘损失增强地物分割被引量：3

Segmentation of high-resolution remote sensing image by collaborating with edge loss enhancement

导出

摘要目的针对高分辨率遥感影像语义分割中普遍存在的分割精度不高、目标边界模糊等问题,提出一种综合利用边界信息和网络多尺度特征的边缘损失增强语义分割方法。方法对单幅高分辨率遥感影像,首先通过对VGG-16(visual geometry group 16-layer net)网络引入侧边输出结构,提取到图像丰富的特征细节;然后使用深度监督的短连接结构将从深层到浅层的侧边输出组合起来,实现多层次和多尺度特征融合;最后添加边缘损失增强结构,用以获得较为清晰的目标边界,提高分割结果的准确性和完整性。结果为了验证所提方法的有效性,选取中国北方种植大棚遥感影像和Google Earth上的光伏板组件遥感影像进行人工标注,并制作实验数据集。在这两个数据集上,将所提方法与几种常用的语义分割方法进行对比实验。实验结果表明,所提方法的精度在召回率为0分析了各个功能模块对最终结果的贡献。结论与当前先进方法相比,本文提出的边缘损失增强地物分割方法能够更加精确地从遥感影像的复杂背景中提取目标区域,使分割时提取到的目标拥有更加清晰的边缘。 Objective Semantic analysis of remote sensing(RS)images has always been an important research topic in computer vision community.It has been widely used in related fields such as military surveillance,mapping navigation,and urban planning.Researchers can easily obtain various informative features for the following decision making by exploring and analyzing the semantic information of RS images.However,the richer,finer visual information in high-resolution RS images also puts forward higher requirements for image segmentation techniques.Traditional segmentation methods usually employ low-level visual features such as grayscale,color,spatial texture,and geometric shape to divide an image into several disjoint regions.Generally,such features are called hand-crafted ones,which are empirically defined and may be less semantically meaningful.Compared with traditional segmentation methods,semantic segmentation approaches based on deep convolutional neural networks(CNNs)are capable of learning hierarchical visual features for representing images in different semantic levels.Typical CNN-based semantic segmentation approaches mainly focus on mitigating semantic ambiguity via providing rich information.However,RS images have higher background complexity than images of nature scene.For example,they usually contain many types of geometric objects and cover massive redundant background areas.Simply employing a certain type of feature or even CNN-based ones may not be sufficient in such case.Taking single-category object extraction task in RS images for example,on the one hand,negative objects may have similar visual presentations with the expected target.These redundant,noisy semantic information may confuse the network and finally decrease the segmentation performance.On the other hand,the CNN-based feature is good at encoding the context information rather than the fine details of an image,making the CNN-based models have difficulty obtaining the precise prediction of object boundaries.Therefore,aiming at these problems in high-resolution RS image segmentation,this paper proposes an edge loss enhanced network for semantic segmentation that comprehensively utilizes the boundary information and hierarchical deep features.Method The backbone of the proposed model is a fully convolutional network that is abbreviated from a visual geometry group 16-layer net(VGG-16)structure by removing all fully connected layers and its fifth pooling layer.A side output structure is introduced for each convolutional layer of our backbone network to extract all possible rich,informative features from the input image.The side output structure starts with a(1×1,1)convolutional layer(a specific convolutional layer is denoted as(n×n,c)where n and c are the size and number of kernels,respectively),followed by an element-wise summation layer for accumulating features in each scale.Then,a(1×1,1)convolutional layer is used to concentrate hybrid features.The side output structure makes full use of the features of each convolutional layer of our backbone and helps the network capture the fine details of the image.The side-output features are further gradually aggregated from the deep layers to shallow layers by a deep-supervised short connection structure to enhance the connections between features crossing scales.To this end,each side output feature is first encoded by a residual convolution unit then introduced to another one of a nearby shallow stage with necessary upsampling.The short connection structure enables a multilevel,multiscale fusion during feature encoding and is proven effective in the experiment.Finally,for each fused side output feature,a(3×3,128)convolutional layer is first used to unify its number of feature channels then send it to two paralleled branches,namely,an edge loss enhancement branch and an ordinary segmentation branch.In each edge loss enhancement branch,a Laplace operator coupled with a residual convolution unit is adopted to obtain the target boundary.The detected boundary is supervised by the ground truth that is generated by directly computing the gradient of existing semantic annotation of training samples.It does not require additional manual work for edge labeling.Experimental results show that the edge loss enhancement branch helps refine the target boundary as well as maintain the integrity of the target region.Result First,two datasets with human annotations that include the RS images of the planted greenhouses in the north of China and the photovoltaic panels collected by Google Earth are organized to evaluate the effectiveness of the proposed method.Then,visual and numerical comparisons are conducted between the proposed method and several popular semantic segmentation methods.In addition,an ablation study is included to illustrate the contribution of essential components in the proposed architecture.The experimental results show that our method outperforms other competing approaches on both datasets in the comparisons of precision-recall curves and mean absolute error(MAE).The precision achieved by our method is constantly above 0.8 when recall rate in the range of 0 to 0.9.The MAE achieved by our method is 0.0791/0.0362 which is the best of all evaluation results.In addition,the ablation study clearly illustrates the effectiveness of each individual functional block.First,the baseline of the proposed architecture obtains a poor result with MAE of 0.2044 on the northern greenhouse dataset.Then,the residual convolutional units help reduce MAE by 31%,and the value further drops to0.0848 when the short connection structure is added to fuse the multiscale features of the network.Finally,the edge loss enhancement structure helps successfully lower MAE to 0.0791,which is decreased by 61%compared with the baseline model.The results indicate that all components are necessary to obtain a good feature segmentation result.Conclusion In summary,compared with the competing methods,the proposed method is capable of extracting the target region more accurately from the complex background of RS images with a clearer target boundary.

作者陈琴朱磊吕燧栋吴谨 Chen Qin;Zhu Lei;Lyu Suidong;Wu Jin(School of Information Science and Engineering,Wuhan University of Science and Technology,Wuhan 430081,China;WISDRI Continuous Casting Technology Engineering Company Ltd.,Wuhan 430223,China)

机构地区武汉科技大学信息科学与工程学院中冶南方连铸技术工程有限责任公司

出处《中国图象图形学报》 CSCD 北大核心 2021年第3期674-685,共12页 Journal of Image and Graphics

基金国家自然科学基金项目(61502358,61502357)。

关键词高分辨率遥感影像卷积神经网络语义分割多特征融合边缘损失增强网络平均绝对误差 high resolution remote sensing imagery convolutional neural network(CNN) semantic segmentation multi-feature fusion edge loss reinforced network mean absolute error(MAE)

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1李欣,唐文莉,杨博.利用深度残差网络的高分遥感影像语义分割[J].应用科学学报,2019,37(2):282-290. 被引量：12

共引文献11

1栗风永,叶彬,秦川.基于奇偶交叉卷积的轻量级图像语义分割网络[J].应用科学学报,2022,40(3):448-456.
2尚群锋,沈炜,帅世渊.基于深度学习高分辨率遥感影像语义分割[J].计算机系统应用,2020,29(7):180-185. 被引量：6
3韩彬彬,张月婷,潘宗序,台宪青,李芳芳.残差密集空间金字塔网络的城市遥感图像分割[J].中国图象图形学报,2020,25(12):2656-2664. 被引量：7
4段增强,刘杰东,鹿鸣,孔祥斌,杨娜.CNN-ISS遥感影像分类的瓦片边缘效应及消除方案[J].农业工程学报,2021,37(2):209-217. 被引量：2
5张涵,秦昆,毕奇,张晔,许凯.注意力引导的三维卷积网络用于遥感场景变化检测[J].应用科学学报,2021,39(2):272-280. 被引量：5
6叶沅鑫,谭鑫,孙苗苗,王蒙蒙.基于增强DeepLabV3网络的高分辨率遥感影像分类[J].测绘通报,2021(4):40-44. 被引量：13
7陈敏,潘佳威,李江杰,徐璐,刘加敏,韩健,陈奕云.结合VGGNet与Mask R-CNN的高分辨率遥感影像建设用地检测[J].遥感技术与应用,2021,36(2):256-264. 被引量：5
8刘晓伟,刘科学,谢枫,王莉,巩冬梅.基于语义分割的工单标签数据自动分类标注系统[J].电子设计工程,2021,29(17):112-116. 被引量：1
9王阳,陈薇伊,马军山.基于卷积神经网络的乳腺癌良恶性诊断[J].软件工程,2022,25(1):6-9. 被引量：4
10陈孝如,曾碧卿.结合上下文注意力的卷积自校正图像语义分割[J].计算机工程与设计,2022,43(2):525-533. 被引量：1

同被引文献43

1韩慧慧,李帷韬,王建平,焦点,孙百顺.编码—解码结构的语义分割[J].中国图象图形学报,2020,0(2):255-266. 被引量：11
2姚建华,吴加敏,杨勇,施祖贤.全卷积神经网络下的多光谱遥感影像分割[J].中国图象图形学报,2020,0(1):180-192. 被引量：16
3杨德刚,肖照林,杨恒,王庆.基于光场分析的多线索融合深度估计方法[J].计算机学报,2015,38(12):2437-2449. 被引量：5
4林雨准,张保明,王丹菂,陈小卫,徐俊峰.多特征融合的高分辨率遥感影像建筑物分级提取[J].中国图象图形学报,2017,22(12):1798-1808. 被引量：15
5张娟,汪西莉,杨建功.基于深度学习的形状建模方法[J].计算机学报,2018,41(1):132-144. 被引量：13
6邸凯昌,万文辉,赵红颖,刘召芹,王润之,张飞舟.视觉SLAM技术的进展与应用[J].测绘学报,2018,47(6):770-779. 被引量：68
7郭文强,高文强,侯勇严,李然.基于稀缺数据集下BN参数学习的目标识别[J].计算机工程与应用,2018,54(17):122-125. 被引量：4
8朱绍程,刘利民.低空飞行目标声音优化识别研究[J].计算机仿真,2018,35(11):12-16. 被引量：4
9Zhi-hua LU,Meng-yao ZHU,Qing-wei YE,Yu ZHOU.Performance analysis of two EM-based measurement bias estimation processes for tracking systems[J].Frontiers of Information Technology & Electronic Engineering,2018,19(9):1151-1165. 被引量：2
10黄俊杰,杨健晟,刘晓波,胡丹晖,方圆.基于双目视觉监控的输电线路立体空间建模[J].电力系统保护与控制,2018,46(19):102-108. 被引量：9

引证文献3

1管辉,李翰山,张晓倩.基于光场成像的双弹丸重合成像识别方法[J].电光与控制,2021,28(7):93-98. 被引量：1
2高云波,陈辉,张承威,张贤,都伟杰.基于AI地物识别与分类技术的输电工程设计应用研究[J].电力大数据,2021,24(10):28-36.
3栾晓梅,刘恩海,武鹏飞,张军.基于边缘增强的遥感图像弱监督语义分割方法[J].计算机工程与应用,2022,58(20):188-196. 被引量：2

二级引证文献3

1郭晓磊,刘悦,王青正.基于改进高斯卷积核的复杂场景红外图像目标识别研究[J].激光杂志,2023,44(5):169-173. 被引量：3
2王海燕,江烨皓,黎煊,马云龙,刘小磊.基于弱监督数据集的猪只图像实例分割[J].农业机械学报,2023,54(10):255-265.
3杨大伟,迟津生,毛琳.基于边界辅助的弱监督语义分割网络[J].计算机应用研究,2024,41(2):623-628.

1黎玲利,孟令兵,李金宝.多尺度特征提取和多级别特征融合的显著性目标检测方法[J].工程科学与技术,2021,53(1):170-177. 被引量：7
2黄文超,王林军,刘晋玮,陈保家.基于多特征融合与GA-BP模型的滚动轴承故障识别[J].机床与液压,2021,49(6):170-173. 被引量：7
3王鑫城,范红,刘锡泽,胡晨熙,林威,禹素萍.基于多特征融合的商品识图匹配算法研究[J].信息技术与网络安全,2021,40(4):70-74. 被引量：2
4翟煕照,韩同群,杨正才.利用视觉传感器图像处理技术实现前方车道线识别方法研究[J].汽车实用技术,2021,46(6):29-34. 被引量：4
5孙翠清,徐向阳.基于深度残差网络的电力系统暂态稳定预测[J].计算机仿真,2021,38(2):77-81. 被引量：4
6王海波,赖婵,张雅洁.充盈乡村振兴的产业之“实”[J].当代广西,2021(3):40-40.
7李健.高中生物学“酸奶的制作”实验教学探究[J].读与写（中旬）,2021(3):225-225.
8许福顺.提高译文准确性和完整性的若干方法[J].中国朝鲜语文,2021(1):74-79.
9薛阳,王舒,张亚飞,王琳,吴海东.太阳能无人机编队基于能量状态的连接控制[J].控制工程,2021,28(2):401-407. 被引量：1
10马巧梅,王明俊,梁昊然.复杂场景下基于改进YOLOv3的车牌定位检测算法[J].计算机工程与应用,2021,57(7):198-208. 被引量：21

中国图象图形学报

2021年第3期

浏览历史

内容加载中请稍等...

高分辨率遥感影像的边缘损失增强地物分割被引量：3

参考文献1

共引文献11

同被引文献43

引证文献3

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

高分辨率遥感影像的边缘损失增强地物分割 被引量：3

参考文献1

共引文献11

同被引文献43

引证文献3

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

高分辨率遥感影像的边缘损失增强地物分割被引量：3