加强类别关系的农作物遥感图像语义分割被引量：5

CRNet: class relation network for crop remote sensing image semantic segmentation

导出

摘要目的遥感图像处理技术在农作物规划、植被检测以及农用地监测等方面具有重要的作用。然而农作物遥感图像上存在类别不平衡的问题,部分样本中农作物类间相似度高、类内差异性大,使得农作物遥感图像的语义分割更具挑战性。为了解决这些问题,提出一种融合不同尺度类别关系的农作物遥感图像语义分割网络CRNet(class relation network)。方法该网络将ResNet-34作为编码器的主干网络提取图像特征,并采用特征金字塔结构融合高阶语义特征和低阶空间信息,增强网络对图像细节的处理能力。引入类别关系模块获取不同尺度的类别关系,利用一种新的类别特征加强注意力机制(class feature enhancement, CFE)结合通道注意力和加强位置信息的空间注意力,使得农作物类间的语义差异和农作物类内的相关性增大。在解码器中,将不同尺度的类别关系融合,增强了网络对不同尺度农作物特征的识别能力,从而提高了对农作物边界分割的精度。通过数据预处理、数据增强和类别平衡损失函数(class-balanced loss, CB loss)进一步缓解了农作物遥感图像中类别不平衡的问题。结果在Barley Remote Sensing数据集上进行的实验表明,CRNet网络的平均交并比(mean intersection over union, MIoU)和总体分类精度(overall accuracy, OA)分别达到68.89%和82.59%,性能在评价指标和可视化效果上均优于PSPNet(pyramid scene parsing network)、FPN(feature pyramid network)、LinkNet、DeepLabv3+、FarSeg(foreground-aware relation network)以及STLNet(statistical texture learning network)。结论 CRNet网络通过类别关系模块,在遥感图像复杂的地物背景中更加精准地区分相似的不同农作物,识别特征差异大的同种农作物,并融合多级特征使得提取出的目标边界更加清晰完整,提高了分割精度。 Objective Remote sensing based image processing technology plays an important role in crop planning, vegetation detection and agricultural land detection. The purpose of crop-relevant remote sensing image semantic segmentation is to classify the crop-relevant remote sensing image at pixel level and segment the image into regions with different semantic identification. The semantic segmentation of crop-relevant remote sensing image has been challenging in contrast to natural scene on the two aspects: 1) the number of samples of different categories varies greatly and the distribution is extremely unbalanced. For example, there are much more background-related samples with less samples remaining. The following overfitting and poor robustness problems are appeared for network training. 2) The similarity of appearance features of different crops is presented higher, which makes it difficult to distinguish similar appearance for the network, while the appearance features of the same crop are different, which could cause misclassify the same crop. We develop a semantic segmentation network called class relation network(CRNet) for crop-relevant remote sensing image, which integrates multiple scale class relations. Our experimental data is carried out on Barley Remote Sensing Dataset derived from the Tianchi Big Data Competition. Since the dataset consists of 4 large-size high-resolution remote sensing images, it cannot be as an input to a neural network. First, it is necessary to process the image and cut it into many sub-graphs of 512×512 pixels. Next, there are 11 750 sub-graphs in the dataset after cutting, including 9 413 images in the training set and 2 337 images in the test set. The ratio of the training set is about 4 ∶1 to the test set. Method Our CRNet is composed of three parts like variant of feature pyramid network encoder, category relation module and decoder. 1) In the encoder, ResNet-34 is used as the backbone network to extract the image features from bottom to top gradually, which can process image details better. Similar to the original feature pyramid structure(from top to bottom), horizontal links are used to fuse high-level semantic features and low-level spatial information. 2) The category relation module consists of three layers of paralleled structure. After the features of the three layers outputted by the encoder pass through the 1×1 convolution layer, the channel dimension is reduced to 5. The 1×1 convolutional layer here can be regarded as a classifier that maps global features into 5 channels, corresponding to the classification category, and each channel can represent features of a targeted category. Then, the feature map of each layer is input into the category feature enhancement(CFE) attention mechanism. The CFE attention module is segmented to channel-based and spatial-relevant. Assigned weights for each category is conducted by learning the correlation between the features of each channel. To clarify the features between different categories, the channel attention mechanism is focused on strengthening the strong-correlated features and suppressing the weak-correlated features. The channel information is encoded in the spatial dimension through global average pooling and global max pooling, and the global context information is modeled to obtain the global features of each channel. The spatial attention module enhances the location information of crops, such as the sites of crops in the farmland. Each location is connected with the horizontal or vertical direction in the feature image via learning the spatial information in the horizontal and vertical directions. The CFE attention module can obtain more distinct features in different categories. The feature differences are identified further between multiple crops. At the same time, more context information is improved for the feature of the same category, which aids to reduce the misclassification of the same crop. 3) In the decoder, the classification relations of different scales are fused and restored to the initial resolution, and the final classification is carried out by fully combining the feature information of each scale. In addition, we use data enhancement to reduce the proportion of background samples and expand the number of samples of other categories. To further alleviate the problem of class imbalance in crop-relevant remote sensing images, a class-balanced loss(CB loss) function is introduced. Result To verify the effectiveness of the CRNet, our training model is tested on Barley Remote Sensing dataset, and the mean intersection over union(MIoU) is 68.89%, and the overall accuracy(OA) is 82.59%. Our CRNet is increased by 7.42%, 4.86%, 4.57%, 4.36%, 4.05%, and 3.63% respectively in MIoU in contrast to the Linknet, pyramid scene parsing network(PSPNet), DeepLabv3+, foreground-aware relation network(FarSeg), statistical texture learning network(STLNet) and feature pyramid network(FPN), and our OA is improved by 4.35%, 2.6%, 3.01%, 2.5%, 2.45% and 1.85% of each. The number of parameters and inference speed of CRNet are reached to 21.98 MB and 68 frames/s. Compared to LinkNet and FPN, its number of parameters and inference speed are increased, which are 7.42% and 4.35% higher than LinkNet, 3.63% and 1.85% higher than FPN in MIoU and OA. Conclusion In the combination of multi-level features and the introduction of category relation module, our CRNet network can distinguish the similar crops more accurately. The same crops are sorted out in the complex ground object background of remote sensing image. The completed target boundary can be extracted more. The experiment shows that our CRNet has its priority for crop-relevant semantic segmentation methods.

作者董荣胜马雨琪刘意李凤英 Dong Rongsheng;Ma Yuqi;Liu Yi;Li Fengying(Guangxi Key Laboratory of Trusted Software,Guilin University of Electronic Technology,Guilin 541004,China)

机构地区桂林电子科技大学广西可信软件重点实验室

出处《中国图象图形学报》 CSCD 北大核心 2022年第11期3382-3394,共13页 Journal of Image and Graphics

基金国家自然科学基金项目(62062029,61762024) 广西自然科学基金项目(2017GXNSFDA198050)。

关键词农作物遥感图像语义分割类别关系模块注意力机制类别平衡损失函数(CB loss) Barley Remote Sensing数据集 crop remote sensing image semantic segmentation category relation module attention mechanism class-balanced loss(CB loss) Barley Remote Sensing dataset

分类号 TP751.1 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献4

1侯红英,高甜,李桃.图像分割方法综述[J].电脑知识与技术,2019,15(2Z):176-177. 被引量：16
2炼晨.遥感技术让农业生产更“智慧”[J].中国农资,2021(6):15-15. 被引量：2
3刘硕.阈值分割技术发展现状综述[J].科技创新与应用,2020(24):129-130. 被引量：25
4王秋萍,张志祥,朱旭芳.图像分割方法综述[J].信息记录材料,2019,20(7):12-14. 被引量：39

二级参考文献13

1罗希平,田捷,诸葛婴,王靖,戴汝为.图像分割方法综述[J].模式识别与人工智能,1999,12(3):300-312. 被引量：233
2王森,伍星,刘韬,张印辉.基于二进小波变换的多尺度图切割方法[J].计算机工程与应用,2015,51(13):9-14. 被引量：10
3姜枫,顾庆,郝慧珍,李娜,郭延文,陈道蓄.基于内容的图像分割方法综述[J].软件学报,2017,28(1):160-183. 被引量：132
4侯红英,高甜,李桃.图像分割方法综述[J].电脑知识与技术,2019,15(2Z):176-177. 被引量：16
5顾梅花,苏彬彬,王苗苗,王志磊.彩色图像灰度化算法综述[J].计算机应用研究,2019,36(5):1286-1292. 被引量：47
6李妍妍,田瑞娟,张弦弦.一种基于帧差法结合Kalman滤波的运动目标跟踪方法[J].兵工自动化,2019,38(4):24-27. 被引量：6
7朱学海,张帅,张东星,张阿平,罗陨飞.基于机器视觉与深度学习的船舶水尺智能识别技术研究与应用[J].检验检疫学刊,2019,29(2):101-104. 被引量：7
8陈华,孙宇晨.基于计算机视觉的交通流实时监控综述[J].微型电脑应用,2019,35(5):1-4. 被引量：6
9陈申渭,马汉杰,冯杰,许佳立.摄屏类图像重构算法[J].计算机系统应用,2019,28(5):110-118. 被引量：2
10金旭旸,贾鹤鸣.基于水波优化算法的多阈值图像分割方法[J].科技创新与生产力,2019(4):51-53. 被引量：1

共引文献74

1李振波,赵远洋,杨普,吴宇峰,李一鸣,郭若皓.基于机器视觉的鱼体长度测量研究综述[J].农业机械学报,2021,52(S01):207-218. 被引量：4
2李玉,张雪英,赵静.泰森多边形区域高斯连接函数的遥感影像分割[J].测绘科学,2022,47(10):142-152. 被引量：1
3张艳红,杨思,徐增波.图像分割技术在服装领域的应用[J].软件导刊,2020,19(4):238-241. 被引量：5
4何文斌,魏爱云,明五一,贾豪杰.基于机器视觉的水果品质检测综述[J].计算机工程与应用,2020,56(11):10-16. 被引量：25
5张冬梅,武杰,李丕丁.基于机器视觉的运动目标检测算法综述[J].智能计算机与应用,2020,10(3):192-195. 被引量：10
6刘硕.阈值分割技术发展现状综述[J].科技创新与应用,2020(24):129-130. 被引量：25
7傅玉川,余行.医学影像自动分割技术在放射治疗中的应用及发展趋势[J].中国医疗器械杂志,2020,44(5):420-424. 被引量：8
8罗琴,李丽.无需初始轮廓的分片常值图像分割模型[J].信息与电脑,2020,32(17):57-59.
9刘怡然,马亚州,张勇,张宏娇.基于计算机视觉的母猪运动规律分析[J].无线互联科技,2020,17(14):70-73.
10孙凯,姚旭峰,黄钢.基于机器学习的白细胞六分类研究[J].软件,2020,41(10):98-101. 被引量：7

同被引文献38

1洪洲.基于ArcGIS的矿山资源遥感监测符号库的建立[J].测绘与空间地理信息,2013,36(2):81-84. 被引量：2
2马彧,何英昊,姜绍君,谢印庆.物联网感知层课程建设探索[J].科技创新导报,2012,9(36):186-186. 被引量：4
3翟伟康,许自舟,张健.河北省近岸海域赤潮灾害特征分析[J].海洋环境科学,2016,35(2):243-246. 被引量：13
4王荣,王昭生,刘晓曼.多尺度多准则的遥感影像线状地物信息提取[J].测绘科学,2016,41(11):146-150. 被引量：8
5卓鑫.近十年福州沿海赤潮的基本特征研究[J].海洋预报,2018,35(4):34-40. 被引量：10
6田萱,王亮,丁琪.基于深度学习的图像语义分割方法综述[J].软件学报,2019,30(2):440-468. 被引量：225
7许晓路,程林,周正钦,韩剑,李梦齐,陈佳.基于移动作业的变电智能运检装置设计与应用[J].仪表技术与传感器,2019(6):115-117. 被引量：6
8陈顺,孟青青,李登峰.结合图像增强和改进Canny算子的遥感图像边缘检测[J].河南大学学报（自然科学版）,2020,50(5):623-630. 被引量：11
9吕小艳,竞霞,薛琳,徐海清,张超,黄健熙.遥感技术在烟草长势监测及估产中的应用进展[J].中国农学通报,2020,36(25):137-141. 被引量：12
10张阳,屠乃美,陈舜尧,谢会雅,傅雪平,邓浏平.基于Sentinel-2A数据的县域烤烟种植面积提取分析[J].烟草科技,2020,53(11):15-22. 被引量：9

引证文献5

1王振,姚玲洁,钟方强.图像语义分割在智慧农业中的应用[J].信息与电脑,2022,34(24):32-34. 被引量：1
2曾维军,赵昊,郑宏刚,廖丽君,葛兴燕,郭晓飞.基于“点-线-面-体”的遥感与物联网高原农业监测技术体系构建[J].数字技术与应用,2023,41(8):154-156. 被引量：1
3崔宾阁,方喜,路燕,黄玲,刘荣杰.RTDNet:面向高分辨率卫星影像的赤潮探测网络[J].中国图象图形学报,2023,28(12):3911-3921.
4郝戍峰,高宇,刘萍,李宇昂,张华栋,任鸿杰,田帅杰,寇文韬.融合轻量化ASPP和U-Net的遥感影像烤烟种植区域提取[J].航天返回与遥感,2024,45(4):139-149.
5蔡翔宇.基于改进DeepLabV3+模型的遥感图像语义分割[J].计算机科学与应用,2023,13(3):587-600.

二级引证文献2

1朱锦钊.基于深度学习的遥感图像语义分割技术研究与应用[J].价值工程,2023,42(34):109-111.
2杜娟娟,魏秋娟,武月莲,张立东,张凤起,唐丽华,麻建丽,赵陵南,孟和达来.基于物联网的智慧农业数据采集与管理系统设计[J].现代农业装备,2024,45(3):50-53. 被引量：1

1郝达慧,王池社,陈敏.基于深度学习的复杂场景下车牌定位与识别[J].现代计算机,2021,27(24):119-123. 被引量：3
2Evaldo Araújo de Oliveira,Augusto José Pereira Filho.Looking at the Statistical Texture Approach Applied to Weather Radar Rainfall Fields[J].Journal of Geographic Information System,2022,14(1):29-39.
3Liqiang LIN,Pengdi HUANG,Chi-Wing FU,Kai XU,Hao ZHANG,Hui HUANG.On learning the right attention point for feature enhancement[J].Science China(Information Sciences),2023,66(1):127-139.
4杨云,周瑶,陈佳宁.基于多尺度混合卷积网络的高光谱图像分类[J].液晶与显示,2023,38(3):368-377. 被引量：3
5岳有军,耿连欣,赵辉,王红君.基于ARD-PSPNet网络下的水下鱼类图像分割算法研究[J].光电子．激光,2022,33(11):1173-1182. 被引量：3
6袁军,陈欣悦,常时新,王乐,周自明.基于T1增强成像的人工智能算法在肛瘘内口诊断中的可行性研究[J].安徽医药,2023,27(3):447-452. 被引量：1
7张瑞欣,张正炳.基于改进DenseNet网络的脑部MRI图像分类模型[J].信息技术与信息化,2022(12):123-126. 被引量：1
8于志强,文永华,高明虎,杨曼.基于语义差异的汉-缅平行句对生成方法[J].云南民族大学学报（自然科学版）,2023,32(1):118-123.
9郭崇,齐天缘.基于孪生网络的天空图像辐射照度预测[J].信息技术与信息化,2023(1):150-153.
10彭豪,李晓明.利用金字塔空间注意力与特征推理的图像修复[J].计算机辅助设计与图形学学报,2023,35(1):87-98. 被引量：2

中国图象图形学报

2022年第11期

浏览历史

内容加载中请稍等...

加强类别关系的农作物遥感图像语义分割被引量：5

参考文献4

二级参考文献13

共引文献74

同被引文献38

引证文献5

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

加强类别关系的农作物遥感图像语义分割 被引量：5

参考文献4

二级参考文献13

共引文献74

同被引文献38

引证文献5

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

加强类别关系的农作物遥感图像语义分割被引量：5