期刊文献+

基于CNN和Transformer混合网络模型的车道线检测

Lane Line Detection Based on CNN and Transformer Hybrid Network
下载PDF
导出
摘要 车道线检测技术在自动驾驶系统中发挥着重要作用,目前基于深度学习的车道线检测方法通常在主干网络提取特征之后分别获取车道线关键点的置信度以及这些点相对车道线起始点的偏移。但由于车道线是细长结构,现有的主干网络无法有效提取这种结构特征,偏移网络也难以回归车道线上关键点相对起始点的偏移。鉴于注意力机制在提取空间结构特征、表征长距离图像序列间依赖关系方面的优越性能,在基于点的车道线检测方法的基础上提出了一种基于卷积神经网络(convolutional neural network,CNN)和Transformer的混合网络(CNN-Transformer hybrid network,CTNet)模型,该模型通过特征金字塔和增强的坐标注意力机制提高特征的表征能力,使用基于视觉Transformer的偏移网络回归关键点的偏移量,因此,CTNet能够提取细长车道线特征、捕获长距离点间的偏移,有效提升车道线检测的精度。实验对比了CTNet和6种常用车道线检测算法在数据集TuSimple和CULane上的效果,在TuSimple上CTNet各项精度指标均优于现有方法,在CULane数据集的9种不同车道场景中,CTNet在6个场景中取得了最佳精度。 Lane detection technology plays a crucial role in autonomous driving systems.Currently,deep learning-based methods for lane detection typically involve extracting fea-tures from a backbone network,followed by confidence estimation of key points on the lane lines and their offsets relative to a starting point.However,existing backbone networks struggle to effectively capture features of elongated lanes,and offset networks face challenges in regressing the offsets of key points along the lane line.In this paper,we propose a hybrid network model called CTNet(CNN-Transformer hybrid network)based on a pointbased lane detection approach.CTNet enhances feature representation through a feature pyramid network and an augmented coordinate attention mechanism.Additionally,it employs a vision transformer-based offset network to regress crucial offsets.Consequently,CTNet extracts elongated lane line features,captures long-range offsets between points,and significantly improves the accuracy of lane detection.Experiments conducted on the TuSimple and CULane datasets demonstrate that CTNet outperforms six commonly used lane detection algorithms across various accuracy metrics.Specifically,CTNet achieves superior results on TuSimple across all evaluation metrics.Furthermore,when tested across nine different lane scenarios in the CULane dataset,CTNet achieves the highest accuracy in six scenarios.
作者 唐洪 邓锋 张恺 聂学方 李光辉 TANG Hong;DENG Feng;ZHANG Kai;NIE Xuefang;LI Guanghui(School of Information and Software Engineering,East China Jiaotong University,Nanchang 330013,Jiangxi,China;Jiangxi Transportation Institute Co.,Ltd.,Nanchang 330038,Jiangxi,China)
出处 《应用科学学报》 CAS CSCD 北大核心 2024年第5期871-883,共13页 Journal of Applied Sciences
基金 国家自然科学基金(No.52062016) 江西省03专项(No.20203ABC03W07) 江西省自然科学基金面上项目(No.20212BAB202009) 江西省自然科学基金(No.20212BAB202004) 江西省教育厅科学基金(No.GJJ190319)资助。
关键词 车道线检测 视觉Transformer 坐标注意力 特征金字塔网络 lane line detection visual Transformer coordinate attention(CA) feature pyramid network(FPN)
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部