期刊文献+

基于改进SwiftNet的堆场图像实时分割网络

Real-Time Segmentation Network of Yard Images Based on Improved SwiftNet
下载PDF
导出
摘要 在堆场环境下,实时图像语义分割可以提供直观的场景类别信息。为节约工控机等边缘设备的硬件资源以及为多源信息融合提供图像语义类别信息,提出一种轻量化的实时语义分割网络模型。首先提出基于空间注意力引导的上采样融合模块,通过引入空间注意力和残差注意力结构设计一种轻量化的解码器,在上采样过程中还原空间细节,抑制冗余信息,进而融合不同来源的特征图;其次提出一种轻量化的级联空洞空间金字塔模块,利用级联的空洞卷积单元增大网络感受野,有效提取多尺度特征;最后使用通道分离、通道混洗、通道池化等操作,降低多尺度聚合过程中的计算开销。在公开数据集Camvid上,该模型的平均交并比(MIoU)为70.1%,推理速度为146.3帧/s,分割精度和推理速度优于ENet、ICNet等模型,消融实验结果也证明了所提各模块的有效性;在实际堆场图像数据集上,该模型的MIoU为93.5%,推理速度为123.8帧/s,证明模型结构具有良好的泛化性能。 In a storage yard environment,real-time image semantic segmentation can provide intuitive scene category information.To save the limited hardware resources of edge equipment,such as industrial computers,and provide image semantic category information for multi-source information fusion,this study proposes a lightweight real-time semantic segmentation network model.First,an upsampling fusion module based on spatial attention guidance is proposed.By introducing a spatial attention and residual attention structure,a lightweight decoder is designed to restore spatial details in the upsampling restoration process,suppress redundant information,and fuse feature maps from different sources.Second,a lightweight cascaded atrous space pyramid module is proposed,which uses cascaded atrous convolution elements to enhance the network receptive field and effectively extract multi-scale features.Simultaneously,the calculation cost of multi-scale polymerization is reduced by channel splitting,channel shufflement,and channel pooling.On the publicly available Camvid dataset,the Mean Intersection over Union(MIoU)of the model is 70.1%,inference speed is 146.3 frame/s,and the segmentation accuracy and inference speed are better than those of models such as ENet and ICNet.The ablation experiment results also prove the effectiveness of the proposed modules.In the actual storage yard image dataset,the MIoU of the model is 93.5%,and the inference speed is 123.8 frame/s,proving that the model structure has good generalization performance.
作者 陈晓玉 沈晨 沈阅 孔德明 CHEN Xiaoyu;SHEN Chen;SHEN Yue;KONG Deming(School of Information Science and Engineering,Yanshan University,Qinhuangdao 066004,Hebei,China;Hebei Yandayanruan Information System Technology Company,Qinhuangdao 066000,Hebei,China;School of Electrical Engineering,Yanshan University,Qinhuangdao 066004,Hebei,China)
出处 《计算机工程》 CAS CSCD 北大核心 2024年第6期296-303,共8页 Computer Engineering
基金 国家自然科学基金(62173289) 航空科学基金(20200016099002)。
关键词 实时语义分割 注意力机制 空洞卷积 感受野 堆场图像 real-time semantic segmentation attention mechanism atrous convolution receptive field yard image
  • 相关文献

参考文献4

二级参考文献29

  • 1苏金玲,王朝晖.基于Graph Cut和超像素的自然场景显著对象分割方法[J].苏州大学学报(自然科学版),2012,28(2):27-33. 被引量:7
  • 2汪海洋,潘德炉,夏德深.二维Otsu自适应阈值选取算法的快速实现[J].自动化学报,2007,33(9):968-971. 被引量:134
  • 3Lee C, Hun S, Ketter T A, et al. Unsupervised connectivitybased thresholding segmentation of midsagittal brain MR images[J]. Computers in Biology and Medicine, 1998, 28(3): 309~338.
  • 4McInerney T, Terzopoulos D. Deformable models in medical image analysis: A survey [J]. Medical Image Analysis, 1996, 1(2): 91~108.
  • 5Orphanoudakis S C, Tziritas G, Haris K. A hybrid algorithm for the segmentation of 2D/3D images [A]. In: Proceedings of International Conference on Information Processing in Medical Imaging, Brest, 1995. 385~386.
  • 6Pohle R, Toennies K D. Segmentation of medical images using adaptive region growing [A]. In: Proceedings of SPIE,Boston, Massachusetts, 2001, 4322: 1337~1346.
  • 7Pohle R, Tonnies K D. A new approach for model-based adaptive region growing in medical image analysis [A]. In:Proceedings of the 9th International Conference on Computer Analysis and Patterns, Warsaw, 2001. 238~246.
  • 8Zheng L, Jin J, Hugues T. Unseeded region growing for 3D image segmentation [J]. Journal of Research and Practice in Information Technology, 2001, 2:31~37.
  • 9Law T Y, Heng P A. Automated extraction of bronchus from3D CT images of lung based on genetic algorithm and 3D region growing [A]. In: Proceedings of SPIE, San Jose, California,2000, 3979:906~916.
  • 10Perona P, Malik J. Scale-space and edge detection using anisotropic diffusion [J]. IEEE Transactions on Pattern Analysis Machine Intelligence, 1990, 12 (7): 629~ 639.

共引文献319

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部