摘要
本文针对场景中目标多样性和尺度不统一等现象造成的边缘分割错误、特征不连续问题,提出了一种交叉特征融合和RASPP驱动的场景分割方法.该方法以交叉特征融合的方式合并编码器输出的多尺度特征,在融合高层语义信息时使用复合卷积注意力模块进行处理,避免上采样操作造成的特征信息丢失以及引入噪声的影响,细化目标边缘分割效果.同时提出了深度可分离残差卷积,在此基础上设计并实现了结合残差的金字塔池化模块——RASPP,对交叉融合后的特征进行处理,获得不同尺度的上下文信息,增强特征语义表达.最后,将RASPP模块处理后的特征进行合并,提升分割效果.在Cityscapes和CamVid数据集上的实验结果表明,本文提出方法相比现有方法具有更好的表现,并且对场景中的目标边缘有更好的分割效果.
This study proposes a cross feature fusion and RASPP-driven scene segmentation method to address the edge segmentation errors and feature discontinuity caused by target diversity and scale inconsistency in the scenes.This method combines the multi-scale features output by the encoder in the way of cross feature fusion and employs the compound convolution attention module to process high-level semantic information fusion.As a result,this avoids the feature information loss caused by the upsampling operation and the influence of noise and refines the segmentation effect of target edges.Meanwhile,this study proposes a depthwise separable convolution combining residual connections.Based on this,a pyramid pooling module RASPP combining residuals is designed and implemented to process the features after cross fusion,obtain contextual information at different scales,and enhance feature semantic expression.Finally,the features processed by the RASPP module are merged to improve the segmentation effect.The experimental results on the Cityscapes and CamVid datasets show that the proposed method outperforms existing methods and has better segmentation performance on target edges in the scenes.
作者
朱新杰
熊风光
谢帅康
宋宁栋
李文清
ZHU Xin-Jie;XIONG Feng-Guang;XIE Shuai-Kang;SONG Ning-Dong;LI Wen-Qing(School of Data Science and Technology,North University of China,Taiyuan 030051,China;Shanxi Province’s Vision Information Processing and Intelligent Robot Engineering Research Center,North University of China,Taiyuan 030051,China;Shanxi Key Laboratory of Machine Vision and Virtual Reality,North University of China,Taiyuan 030051,China)
出处
《计算机系统应用》
2024年第1期76-86,共11页
Computer Systems & Applications
基金
国家自然科学基金(62272426)
山西省回国留学人员科研基金(2020-113)
山西省科技成果转化引导专项基金(202104021301055)
山西省科技重大专项计划“揭榜挂帅”基金(202201150401021)
山西省自然科学基金(202203021212138,202303021211153,202203021222027)。
关键词
语义分割
交叉特征融合
金字塔池化
注意力机制
深度可分离卷积
semantic segmentation
cross feature fusion
pyramid pooling
attention mechanism
depthwise separable convolution