结合生成对抗网络与混合注意力机制的街景图像语义分割

Semantic Segmentation of Street View Images Based on Generative Adversarial Networks and Mixed Attention Mechanisms

下载PDF

导出

摘要街景图像语义分割是自动驾驶领域的主要研究任务之一,对于路径规划和行人安全保障具有重要意义。目前,街景图像语义分割主要存在小目标物体分割不精确、模型容易出现过拟合的问题。为此,提出一种结合生成对抗网络与混合注意力机制的街景图像语义分割模型。具体而言,提出一种多尺度混合注意力模块,用于增强上下文语义信息、提高特征表征能力和对多尺度目标的适应性。同时,为了降低过拟合,引入BN层,结合DCGAN网络构建生成对抗网络分割模型,通过判别损失和分割损失共同约束训练,以增强模型稳定性、提高分割精度。实验结果表明,与DeepLabV3+相比,所提模型在Cityscapes数据集上的分割精度提高了2.4个百分点,mIoU值达到73.4%。 Semantic segmentation of street view images is one of the main research tasks in the field of autonomous driving.At present,seman⁃tic segmentation of street view images mainly suffers from inaccurate segmentation of small target objects and overfitting of models.Therefore,a street view image semantic segmentation model combining generative adversarial networks and hybrid attention mechanisms is proposed.Spe⁃cifically,a multi-scale hybrid attention module is proposed to enhance contextual semantic information,improve feature representation abili⁃ty,and adaptability to multi-scale targets.At the same time,in order to reduce overfitting,a BN layer is introduced and combined with a DC⁃GAN network to construct a generative adversarial network segmentation model.The training is constrained by both discriminative loss and seg⁃mentation loss to enhance model stability and improve segmentation accuracy.The experimental results showed that compared with Deep⁃LabV3+,the proposed model improved segmentation accuracy by 2.4 percentage points on the Cityscapes dataset,with a mIoU value of 73.4%.

作者吴炳剑高琳李衍志武志学李思源李倩 WU Bingjian;GAO Lin;LI Yanzhi;WU Zhixue;LI Siyuan;LI Qian(College of Blockchain Industry,Chengdu University of Information Technology,Chengdu 610225,China)

机构地区成都信息工程大学区块链产业学院

出处《软件导刊》 2024年第11期187-192,共6页 Software Guide

基金四川省科技计划项目(2020YFS0316)。

关键词街景语义分割生成对抗网络混合注意力机制混合损失函数 street view semantic segmentation generative adversarial network hybrid attention mechanism hybrid loss function

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1余果,李大成,杨毅.结合空洞卷积与注意力机制的道路提取方法[J].中国空间科学技术（中英文）,2024,44(5):175-185.
2邹凯鑫,张自嘉,孙伟,付锦燚.改进U型网络的路面缺陷图像分割算法[J].电子测量与仪器学报,2024,38(8):15-25.
3陈梦醒,陈巍,陈国军,郭铁铮.基于YOLOv8的水下人体声呐图像实时目标检测轻量化算法[J].轻工科技,2024,40(6):111-114.
4陈瑜萍,覃正优.一种基于WT⁃MMGR超像素技术的图像分割算法[J].现代计算机,2024,30(17):23-29.
5杨兴耀,李志林,张祖莲,于炯,陈嘉颖,王东晓.基于层间融合滤波器与社交神经引文网络的推荐算法[J].计算机工程,2024,50(11):98-106.
6兰凤崇,田小强,陈吉清,车宇翔,周云郊.融合语义信息与物体级几何特征的实时动态激光SLAM算法[J].汽车工程,2024,46(11):2028-2038.

软件导刊

2024年第11期

浏览历史

内容加载中请稍等...

结合生成对抗网络与混合注意力机制的街景图像语义分割

相关作者

相关机构

相关主题

浏览历史