基于改进SwiftNet的堆场图像实时分割网络

Real-Time Segmentation Network of Yard Images Based on Improved SwiftNet

下载PDF

导出

摘要在堆场环境下,实时图像语义分割可以提供直观的场景类别信息。为节约工控机等边缘设备的硬件资源以及为多源信息融合提供图像语义类别信息,提出一种轻量化的实时语义分割网络模型。首先提出基于空间注意力引导的上采样融合模块,通过引入空间注意力和残差注意力结构设计一种轻量化的解码器,在上采样过程中还原空间细节,抑制冗余信息,进而融合不同来源的特征图;其次提出一种轻量化的级联空洞空间金字塔模块,利用级联的空洞卷积单元增大网络感受野,有效提取多尺度特征;最后使用通道分离、通道混洗、通道池化等操作,降低多尺度聚合过程中的计算开销。在公开数据集Camvid上,该模型的平均交并比(MIoU)为70.1%,推理速度为146.3帧/s,分割精度和推理速度优于ENet、ICNet等模型,消融实验结果也证明了所提各模块的有效性;在实际堆场图像数据集上,该模型的MIoU为93.5%,推理速度为123.8帧/s,证明模型结构具有良好的泛化性能。 In a storage yard environment,real-time image semantic segmentation can provide intuitive scene category information.To save the limited hardware resources of edge equipment,such as industrial computers,and provide image semantic category information for multi-source information fusion,this study proposes a lightweight real-time semantic segmentation network model.First,an upsampling fusion module based on spatial attention guidance is proposed.By introducing a spatial attention and residual attention structure,a lightweight decoder is designed to restore spatial details in the upsampling restoration process,suppress redundant information,and fuse feature maps from different sources.Second,a lightweight cascaded atrous space pyramid module is proposed,which uses cascaded atrous convolution elements to enhance the network receptive field and effectively extract multi-scale features.Simultaneously,the calculation cost of multi-scale polymerization is reduced by channel splitting,channel shufflement,and channel pooling.On the publicly available Camvid dataset,the Mean Intersection over Union(MIoU)of the model is 70.1%,inference speed is 146.3 frame/s,and the segmentation accuracy and inference speed are better than those of models such as ENet and ICNet.The ablation experiment results also prove the effectiveness of the proposed modules.In the actual storage yard image dataset,the MIoU of the model is 93.5%,and the inference speed is 123.8 frame/s,proving that the model structure has good generalization performance.

作者陈晓玉沈晨沈阅孔德明 CHEN Xiaoyu;SHEN Chen;SHEN Yue;KONG Deming(School of Information Science and Engineering,Yanshan University,Qinhuangdao 066004,Hebei,China;Hebei Yandayanruan Information System Technology Company,Qinhuangdao 066000,Hebei,China;School of Electrical Engineering,Yanshan University,Qinhuangdao 066004,Hebei,China)

机构地区燕山大学信息科学与工程学院河北燕大燕软信息系统有限公司燕山大学电气工程学院

出处《计算机工程》 CAS CSCD 北大核心 2024年第6期296-303,共8页 Computer Engineering

基金国家自然科学基金(62173289) 航空科学基金(20200016099002)。

关键词实时语义分割注意力机制空洞卷积感受野堆场图像 real-time semantic segmentation attention mechanism atrous convolution receptive field yard image

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献4

1田萱,王亮,丁琪.基于深度学习的图像语义分割方法综述[J].软件学报,2019,30(2):440-468. 被引量：215
2景庄伟,管海燕,彭代峰,于永涛.基于深度神经网络的图像语义分割研究综述[J].计算机工程,2020,46(10):1-17. 被引量：45
3陆剑锋,林海,潘志庚.自适应区域生长算法在医学图像分割中的应用[J].计算机辅助设计与图形学学报,2005,17(10):2168-2173. 被引量：68
4马素刚,陈期梅,侯志强,杨小宝,张子贤.基于密集连接与特征增强的语义分割算法[J].计算机工程,2023,49(3):263-270. 被引量：2

二级参考文献29

1苏金玲,王朝晖.基于Graph Cut和超像素的自然场景显著对象分割方法[J].苏州大学学报（自然科学版）,2012,28(2):27-33. 被引量：7
2汪海洋,潘德炉,夏德深.二维Otsu自适应阈值选取算法的快速实现[J].自动化学报,2007,33(9):968-971. 被引量：134
3Lee C, Hun S, Ketter T A, et al. Unsupervised connectivitybased thresholding segmentation of midsagittal brain MR images[J]. Computers in Biology and Medicine, 1998, 28(3): 309～338.
4McInerney T, Terzopoulos D. Deformable models in medical image analysis: A survey [J]. Medical Image Analysis, 1996, 1(2): 91～108.
5Orphanoudakis S C, Tziritas G, Haris K. A hybrid algorithm for the segmentation of 2D/3D images [A]. In: Proceedings of International Conference on Information Processing in Medical Imaging, Brest, 1995. 385～386.
6Pohle R, Toennies K D. Segmentation of medical images using adaptive region growing [A]. In: Proceedings of SPIE,Boston, Massachusetts, 2001, 4322: 1337～1346.
7Pohle R, Tonnies K D. A new approach for model-based adaptive region growing in medical image analysis [A]. In:Proceedings of the 9th International Conference on Computer Analysis and Patterns, Warsaw, 2001. 238～246.
8Zheng L, Jin J, Hugues T. Unseeded region growing for 3D image segmentation [J]. Journal of Research and Practice in Information Technology, 2001, 2:31～37.
9Law T Y, Heng P A. Automated extraction of bronchus from3D CT images of lung based on genetic algorithm and 3D region growing [A]. In: Proceedings of SPIE, San Jose, California,2000, 3979:906～916.
10Perona P, Malik J. Scale-space and edge detection using anisotropic diffusion [J]. IEEE Transactions on Pattern Analysis Machine Intelligence, 1990, 12 (7): 629～ 639.

共引文献319

1李林,李军华,邵晓宇.一种双水平集模型分割左心室膜的方法[J].计算机应用研究,2020,37(2):635-640. 被引量：1
2潘泽民,覃亚丽,郑欢,王荣芳,任宏亮.基于深度神经网络的块压缩感知图像重构[J].计算机科学,2022,49(S02):510-518. 被引量：3
3帖军,朱祖桐,郑禄,徐胜舟,马佳婷.基于混合空洞卷积与特征融合的肝脏肿瘤图像分割[J].电子测量技术,2023,46(22):122-130.
4李欣,杨懿,王宁,顾海燕,丁少鹏,李海涛.遥感影像样本自动生成与智能迭代分类方法[J].测绘科学,2022,47(8):197-203. 被引量：3
5赵敬伟,林珊玲,梅婷,林志贤,郭太良.基于YOLACT与Transformer相结合的实例分割算法研究[J].半导体光电,2023,44(1):134-140.
6项岱军,张天健,薛朝辉.基于改进DeepLabV3+的超高分辨率遥感光伏板识别与分割研究[J].现代测绘,2022,45(S01):37-45.
7翁璇,郑小林,姜海.医学图像分割技术研究进展[J].医疗卫生装备,2007,28(1):37-39. 被引量：10
8徐丹红,王保华,张勇,沈海东,叶有利.基于区域生长分割的三维心腔重建[J].中国医疗器械杂志,2007,31(1):17-21.
9戴珮璟,宋安平,周益琰,张武.一种滤波修正的K均值分割方法[J].计算机工程与应用,2007,43(23):226-228.
10杨民,孙翠丽,戚琦.基于3D-ICT图像的涡轮叶片形面点云高精度提取技术[J].计算机辅助设计与图形学学报,2007,19(9):1212-1217. 被引量：5

1曹国群,刘桂雄.基于双模态融合的线缆图像语义分割方法研究[J].电子测量技术,2023,46(10):184-188. 被引量：1
2闫德鑫,刘建军.无人机双目视觉系统在电力绝缘子故障检测与类型识别中的研究与应用[J].佳木斯大学学报（自然科学版）,2022,40(6):128-133. 被引量：2

计算机工程

2024年第6期

浏览历史

内容加载中请稍等...

基于改进SwiftNet的堆场图像实时分割网络

参考文献4

二级参考文献29

共引文献319

相关作者

相关机构

相关主题

浏览历史