结合特征图切分的图像语义分割被引量：11

Feature map slice for semantic segmentation

导出

摘要目的基于全卷积神经网络的图像语义分割研究已成为该领域的主流研究方向。然而,在该网络框架中由于特征图的多次下采样使得图像分辨率逐渐下降,致使小目标丢失,边缘粗糙,语义分割结果较差。为解决或缓解该问题,提出一种基于特征图切分的图像语义分割方法。方法本文方法主要包含中间层特征图切分与相对应的特征提取两部分操作。特征图切分模块主要针对中间层特征图,将其切分成若干等份,同时将每一份上采样至原特征图大小,使每个切分区域的分辨率增大;然后,各个切分特征图通过参数共享的特征提取模块,该模块中的多尺度卷积与注意力机制,有效利用各切块的上下文信息与判别信息,使其更关注局部区域的小目标物体,提高小目标物体的判别力。进一步,再将提取的特征与网络原输出相融合,从而能够更高效地进行中间层特征复用,对小目标识别定位、分割边缘精细化以及网络语义判别力有明显改善。结果在两个城市道路数据集CamVid以及GATECH上进行验证实验,论证本文方法的有效性。在CamVid数据集上平均交并比达到66. 3%,在GATECH上平均交并比达到52. 6%。结论基于特征图切分的图像分割方法,更好地利用了图像的空间区域分布信息,增强了网络对于不同空间位置的语义类别判定能力以及小目标物体的关注度,提供更有效的上下文信息和全局信息,提高了网络对于小目标物体的判别能力,改善了网络整体分割性能。 Objective Deep convolutional neural networks have recently shown outstanding performances in object recognition and have also been the first choice for dense classification problems,such as semantic segmentation. Fully convolutional network based methods have become the main research direction in the field of image semantic segmentation. However,repeated downsampling operations in these methods,such as pooling or convolution striding,lead to a significant decrease in the initial image resolution,which results in poor object delineation,small target losing,and weak segmentation output.Although some studies have solved this problem in recent years,determining how to effectively handle this problem remains an open question and deserves further attention. This study proposes a feature map slice module for semantic segmentation to solve this problem. Method The proposed method mainly includes two parts: middle layer feature map segmentation and corresponding feature extraction network. The feature map slice module mainly focuses on the middle layer feature map.The feature map is sliced into several small cubes,and then each cube is upsampled to the corresponding resolution of the original feature map,which enlarges the small target in the local area. Each cube is equivalent to a subregion of the original feature map by the proposed feature map slice module. After upsampling these cubes,the objects in these subregions are enlarged. Thus,the small objects in these regions can be regarded as relatively large objects,which are difficult to detect through the entire feature map. Therefore,in the process of feature extraction,attention must be focused on the small target objects in these subregions,which are difficult to detect if we handle the entire feature map. A weight-shared feature extraction network is thus designed for sliced feature maps. The feature extraction network adopts multiple convolution operations( different kernel sizes) to extract different scale feature information. For each input of the network,the dimension is reduced to half to save memory and dilation convolution is adopted to enlarge the network’s receptive field. We then concatenate a difficult feature map( obtained by different convolution operations) and add a channel-attention operation. The feature extraction network combines multi-scale convolution and attention mechanism;when subregions are passing through the feature extraction network,it can extract different semantic category information from corresponding subregions,as well as provide contextual and global information and discriminant information of each slice effectively. Accordingly,we can focus on small objects in local areas and improve the discriminability of small target objects. Each cube passes through the feature extraction network. The extracted feature in the corresponding position is assembled and the entire mosaic feature map is acquired. The network original output is upsampled and fused with the mosaic feature map by element-wise max operation. In this way,the middle-layer feature can be reused efficiently. To utilize the middlelayer feature information,this module is introduced at multiple scales,which enhances the capability of extracting small target characteristics and spatial information in local areas. It also utilizes the semantic information in different scales and exhibits an obvious improvement for extracting small target features,refining segmentation edge,and enhancing network discrimination. Result The proposed method is verified on two urban scene-understanding datasets,namely,Cam Vid and GATECH. Both datasets contain many common urban scene objects,such as building,car,and cyclist. Several ablation experiments are conducted on the two datasets and excellent performances are achieved. In particular,intersection-over-union scores of 66.3 and 52.6 are acquired on Cam Vid and GATECH,respectively. Conclusion The proposed method utilizes the spatial distribution information of images,enhances the network capability to determine the semantic categories of different spatial locations,pays considerable attention to small target objects,and provides effective context and global information. The proposed method is expanded into different resolutions of the network considering that different resolutions can provide rich-scale information. Thus,we utilize middle layer feature information,improve the network capability to discriminate small target objects,and enhance the overall segmentation performance of the network.

作者曹峰梅田海杰付君刘静 Cao Fengmei;Tian Haijie;Fu Jun;Liu Jing(School of Optics and Photonic,Beijing Institute of Technology,Beijing 100081,China;National Laboratory of Pattern Recognition,Institute of Automation,Chinese Academy of Sciences,Beijing 100190,China)

机构地区北京理工大学光电学院中国科学院自动化研究所模式识别国家重点实验室

出处《中国图象图形学报》 CSCD 北大核心 2019年第3期464-473,共10页 Journal of Image and Graphics

基金国家自然科学基金项目(61472422)~~

关键词深度学习全卷积神经网络语义分割场景解析特征切分多尺度特征复用 deep learning fully convolutional neural networks semantic segmentation scene parsing feature slice multiple scales feature reuse

分类号 TP391.41 [自动化与计算机技术—计算机应用技术] TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

同被引文献61

1周力,闵海.基于局部连接度和差异度算子的水平集纹理图像分割[J].中国图象图形学报,2019,24(1):39-49. 被引量：12
2孟霞霞,武志芳,张岩波,李思进.基于医学影像数据挖掘的肿瘤影像组学研究进展[J].中华核医学与分子影像杂志,2019,39(2):116-120. 被引量：9
3赵红泽,何桥,韦钊,原江涛.煤矿安全隐患排查治理能力集对分析评估模型[J].工矿自动化,2017,43(2):81-85. 被引量：8
4姜枫,顾庆,郝慧珍,李娜,郭延文,陈道蓄.基于内容的图像分割方法综述[J].软件学报,2017,28(1):160-183. 被引量：129
5宋亮,刘善军,虞茉莉,毛亚纯,吴立新.基于可见-近红外和热红外光谱联合分析的煤和矸石分类方法研究[J].光谱学与光谱分析,2017,37(2):416-422. 被引量：30
6王静,陈淑婷,李世银,卢兆林.煤泥浮选泡沫图像分割算法研究[J].煤炭技术,2017,36(3):311-313. 被引量：3
7周飞燕,金林鹏,董军.卷积神经网络研究综述[J].计算机学报,2017,40(6):1229-1251. 被引量：1644
8薛萍.基于超像素特征表示的图像前景背景分割算法[J].西安科技大学学报,2017,37(5):731-735. 被引量：8
9余彬,万燕珍,陈思超,翁利国.基于密度相似因子的电力红外图像分割方法[J].红外技术,2017,39(12):1139-1143. 被引量：13
10陈云芳,朱党生,刘宾.概述滑坡地质灾害治理措施[J].西部探矿工程,2018,30(1):27-29. 被引量：1

引证文献11

1罗会兰,张云.结合上下文特征与CNN多层特征融合的语义分割[J].中国图象图形学报,2019,24(12):2200-2209. 被引量：4
2李凯勇.基于数据挖掘的图像特征分割技术[J].现代电子技术,2020,43(15):60-64. 被引量：6
3王莉,陈兆熙,余丽.基于条件随机场的多标签图像分类识别方法[J].计算机仿真,2020,37(8):394-397. 被引量：3
4冯兴杰,孙少杰.一种融合多级特征信息的图像语义分割方法[J].计算机应用研究,2020,37(11):3512-3515. 被引量：7
5陈梓华,马占元,李敬兆.基于RNN的煤矿安全隐患信息关键语义智能提取系统[J].煤炭工程,2021,53(3):185-189. 被引量：1
6宣明慧,张荣国,胡静,李富萍,赵建.空间和通道注意力多级别特征网络图像语义分割[J].太原科技大学学报,2021,42(5):355-360. 被引量：1
7高琛,冯德俊,胡金林,王杰茜.改进特征金字塔网络的遥感影像崩滑体提取[J].测绘科学,2021,46(11):32-38. 被引量：1
8宣明慧,张荣国,李富萍,赵建,胡静.分解多空洞深度卷积的轻量级图像语义分割[J].太原科技大学学报,2022,43(3):191-196.
9陈劲宏,陈玮,尹钟.基于改进ExfuseNet模型的街景语义分割[J].电子科技,2022,35(6):28-34. 被引量：1
10路秋叶,刘法军,丁志国,郭鹏,宫锟霖.基于改进DeepLabV3+深度学习模型的冬小麦种植面积提取研究[J].无线电工程,2023,53(11):2564-2572. 被引量：1

二级引证文献25

1郭永锋.基于改进Retinex的目标图像分割方法研究[J].电子技术与软件工程,2020(16):136-137.
2马淼,李贻斌,武宪青,高金凤,潘海鹏.关键语义区域链提取的视频人体行为识别[J].中国图象图形学报,2020,25(12):2517-2529. 被引量：2
3冯兴杰,张天泽.基于分组卷积进行特征融合的全景分割算法[J].计算机应用,2021,41(7):2054-2061. 被引量：8
4梁志军,刘栋.基于姿态信息的人与物体交互检测模块网络[J].计算机应用研究,2021,38(8):2299-2302.
5巨志勇,翟春宇,张文馨.基于SVM与区域生长的彩色商品标签图像分割方法[J].电子科技,2021,34(10):69-74. 被引量：11
6吴章玉,朱成杰,王鸣雁.基于RNN的锂电池健康预测[J].绿色科技,2021,23(18):201-203. 被引量：5
7刘伟博,白鲲.基于神经网络的运动视频图像分类和识别研究[J].现代电子技术,2021,44(20):163-167. 被引量：4
8朱戎,叶宽,杨博,谢欢,赵蕾.基于改进DeeplabV3+的地物分类方法研究[J].计算机科学,2021,48(S02):382-385. 被引量：5
9王光宇,张海涛.轻量型图像分类神经网络改进研究[J].计算机应用研究,2021,38(12):3808-3813. 被引量：2
10高云波,陈辉,张承威,张贤,都伟杰.基于AI地物识别与分类技术的输电工程设计应用研究[J].电力大数据,2021,24(10):28-36.

1王琳,卫晨,李伟山,张钰良.结合金字塔池化模块的YOLOv2的井下行人检测[J].计算机工程与应用,2019,55(3):133-139. 被引量：19
2陈晓燕.浅述学科教学中“立德树人”应避免平常化与格式化[J].考试周刊,2019,0(28):14-14. 被引量：1
3何嘉俊,宋亚男,陈永康,徐荣华,殷李华.复杂背景下基于孪生结构的单目标跟踪网络的改进研究[J].电子世界,2019,0(5):24-25.
4马洋洋,于霄,吕昊,李钊,梁福来,薛惠君,张华,张杨.基于像素分割算法的超宽谱生物雷达目标识别定位技术研究[J].医疗卫生装备,2017,38(7):1-5. 被引量：2
5曹文龙,芮建武,李敏.神经网络模型压缩方法综述[J].计算机应用研究,2019,36(3):649-656. 被引量：12
6王倩颖.基于大数据的跨境电商交易档案管理研究[J].商场现代化,2019(1):33-34. 被引量：2
7余玉琴,蔡晨.基于Gloabl-Local评估方法的U-Net图像分割[J].计算机与数字工程,2019,47(4):914-918. 被引量：2
8梁礼明,卢明建,邓广宏,盛校棋.基于特定深度内部学习网络提高“不理想”图像分辨率[J].科学技术与工程,2019,19(10):144-149. 被引量：1
9冯冬梅,李燕.情景模拟演练在提高低年资助产士应急能力和急救技能中的应用[J].护理实践与研究,2019,16(5):121-122. 被引量：17
10何通能,尤加庚,陈德富.基于DenseNet的单目图像深度估计[J].计算机测量与控制,2019,27(2):233-236. 被引量：3

中国图象图形学报

2019年第3期

浏览历史

内容加载中请稍等...

结合特征图切分的图像语义分割被引量：11

同被引文献61

引证文献11

二级引证文献25

相关作者

相关机构

相关主题

浏览历史

结合特征图切分的图像语义分割 被引量：11

同被引文献61

引证文献11

二级引证文献25

相关作者

相关机构

相关主题

浏览历史

结合特征图切分的图像语义分割被引量：11