锚框校准和空间位置信息补偿的街道场景视频实例分割

Anchor Frame Calibration and Spatial Position Information Compensation for Street Scene Video Instance Segmentation

下载PDF

导出

摘要街道场景视频实例分割是无人驾驶技术研究中的关键问题之一,可为车辆在街道场景下的环境感知和路径规划提供决策依据.针对现有方法存在多纵横比锚框应用单一感受野采样导致边缘特征提取不充分以及高层特征金字塔空间细节位置信息匮乏的问题,本文提出锚框校准和空间位置信息补偿视频实例分割(Anchor frame calibration and Spatial position information compensation for Video Instance Segmentation,AS-VIS)网络.首先,在预测头3个分支中添加锚框校准模块实现同锚框纵横比匹配的多类型感受野采样,解决目标边缘提取不充分问题.其次,设计多感受野下采样模块将各种感受野采样后的特征融合,解决下采样信息缺失问题.最后,应用多感受野下采样模块将特征金字塔低层目标区域激活特征映射嵌入到高层中实现空间位置信息补偿,解决高层特征空间细节位置信息匮乏问题.在Youtube-VIS标准库中提取街道场景视频数据集,其中包括训练集329个视频和验证集53个视频.实验结果与YolactEdge检测和分割精度指标定量对比表明,锚框校准平均精度分别提升8.63%和5.09%,空间位置信息补偿特征金字塔平均精度分别提升7.76%和4.75%,AS-VIS总体平均精度分别提升9.26%和6.46%.本文方法实现了街道场景视频序列实例级同步检测、跟踪与分割,为无人驾驶车辆环境感知提供有效的理论依据. Due to the decision-making provision for vehicle environment perception and path planning,street scenes video instance segmentation as one of the key issues in research of self-driving technology has aroused wide concern.How-ever,current researches focus on insufficient edge feature extraction,which is caused by utilization of single receptive field sampling for multi-aspect ratio anchor frames and deficiencies of spatial detailed position information in the high-level fea-ture pyramid architecture.To alleviate these problems,we propose a network anchor frame calibration and spatial posi-tion information compensation for video instance segmentation(AS-VIS).Firstly,we conduct the anchor frame calibra-tion module as additional branch in parallel with three prediction branches to align multi-type receptive field sampling with different aspect ratio of anchor frame.Secondly,a multi-receptive field subsampling module is designed to fuse the features of various receptive fields achieving less information missing compared with traditional down-sampling.Finally,for spatial location information compensation and detail location information dispersion in the higher-level feature space,we design multi-receptive field subsampling module embedded in higher level to map active feature of target region in lower level of the feature pyramid.The street scene video dataset is extracted from Youtube-VIS benchmark,including 329 videos in training set and 53 videos in validation set.Quantitative comparison of experimental results with Yolact-Edge show that the average accuracy of anchor frame calibration is improved by 8.63%and 5.09%,spatial position infor-mation compensation feature pyramid network is improved by 7.76%and 4.75%,and the overall average accuracy of AS-VIS is improved by 9.26%and 6.46%.The proposed network AS-VIS realizes detection,tracking,and segmentation syn-chronously on instance-level street scene video sequences,and provides an effective theoretical basis for environment per-ception of self-driving vehicles.

作者张印辉赵崇任何自芬杨宏宽黄滢 ZHANG Yin-hui;ZHAO Chong-ren;HE Zi-fen;YANG Hong-kuan;HUANG Ying(Department of Mechanical and Electrical Engineering,Kunming University of Science and Technology,Kunming,Yunnan 650500,China)

机构地区昆明理工大学机电工程学院

出处《电子学报》 EI CAS CSCD 北大核心 2024年第1期94-106,共13页 Acta Electronica Sinica

基金国家自然科学基金(No.62061022,No.62171206)。

关键词街道场景视频实例分割锚框校准空间信息补偿无人驾驶 street scene video instance segmentation anchor frame calibration spatial information compensation self-driving vehicle

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献9

1徐国艳,牛欢,郭宸阳,苏鸿杰.基于三维激光点云的目标识别与跟踪研究[J].汽车工程,2020,42(1):38-46. 被引量：30
2王阳阳,刘之光,邓航云,胥哲先,潘定.电动小车自动变道环境感知系统[J].同济大学学报（自然科学版）,2019,47(8):1201-1206. 被引量：2
3张硕,叶勤,史婧,刘行.改进 RangeNet++损失函数的车载点云小目标语义分割方法[J].计算机辅助设计与图形学学报,2021,33(5):704-711. 被引量：10
4郑少武,李巍华,胡坚耀.基于激光点云与图像信息融合的交通环境车辆检测[J].仪器仪表学报,2019,40(12):143-151. 被引量：39
5王新竹,李骏,李红建,尚秉旭.基于三维激光雷达和深度图像的自动驾驶汽车障碍物检测方法[J].吉林大学学报（工学版）,2016,46(2):360-365. 被引量：28
6王中宇,倪显扬,尚振东.利用卷积神经网络的自动驾驶场景语义分割[J].光学精密工程,2019,27(11):2429-2438. 被引量：33
7孟琭,徐磊,郭嘉阳.一种基于改进的MobileNetV2网络语义分割算法[J].电子学报,2020,48(9):1769-1776. 被引量：27
8刘强,何自芬,张印辉.分支空洞卷积神经网络的机加工车间场景语义分割[J].计算机辅助设计与图形学学报,2021,33(1):126-141. 被引量：3
9邹逸群,肖志红,唐夏菲,赖普坚,汤松林,张泳祥,唐琎.Anchor-free的尺度自适应行人检测算法[J].控制与决策,2021,36(2):295-302. 被引量：13

二级参考文献44

1Reina G,Underwood J,Brooker G,et al.Radar‐based perception for autonomous outdoor vehicles[J].Journal of Field Robotics,2011,28(6):894-913.
2Alvarez J M A,Lopez A M.Road detection based on illuminant invariance[J].Intelligent Transportation Systems,IEEE Transactions on,2011,12(1):184-193.
3Danescu R,Nedevschi S.Probabilistic lane tracking in difficult road scenarios using stereovision[J].Intelligent Transportation Systems,IEEE Transactions on,2009,10(2):272-282.
4Rotaru C,Graf T,Zhang J.Color image segmentation in HSI space for automotive applications[J].Journal of Real-Time Image Processing,2008,3(4):311-322.
5Himmelsbach M,Wuensche H.Fast segmentation of 3dpoint clouds for ground vehicles[C]∥Intelligent Vehicles Symposium(IV),IEEE,2010:560-565.
6Steinhauser D,Ruepp O,Burschka D.Motion segmentation and scene classification from 3D LIDAR data[C]∥Intelligent Vehicles Symposium,IEEE,2008:398-403.
7Klasing K,Wollherr D,Buss M.A clustering method for efficient segmentation of 3Dlaser data[C]∥ICRA,Pasadena,California,USA,2008:4043-4048.
8Douillard B,Underwood J,Kuntz N,et al.On the segmentation of 3D LIDAR point clouds[C]∥Robotics and Automation(ICRA),2011IEEE International Conference on,IEEE,2011:2798-2805.
9Moosmann F.Interlacing Self-Localization,Moving Object Tracking and Mapping for 3DRange Sensors[D].Germany:KIT Scientific Publishing,2013.
10Milella A,Reina G,Underwood J,et al.Combining radar and vision for self-supervised ground segmentation in outdoor environments[C]∥Intelligent Robots and Systems(IROS),2011IEEE/RSJ International Conference on,IEEE,2011:255-260.

共引文献174

1陈慧娴,吴一全,张耀.基于深度学习的三维点云分析方法研究进展[J].仪器仪表学报,2023,44(11):130-158. 被引量：5
2涂新奎,郑少武,于善虎,李巍华.基于对称形状生成的三维目标检测网络[J].仪器仪表学报,2023,44(6):252-263. 被引量：1
3李烁,马云飞,谢谨.基于Wi-Fi入射信号到达角超分辨率估计的无源车速测量[J].仪器仪表学报,2020,41(10):268-276. 被引量：9
4林相泽,徐啸,彭吉祥.基于图像消冗与CenterNet的稻飞虱识别分类方法[J].农业机械学报,2022,53(9):270-276. 被引量：3
5宗长富,文龙,何磊.基于欧几里得聚类算法的三维激光雷达障碍物检测技术[J].吉林大学学报（工学版）,2020,50(1):107-113. 被引量：24
6俞林森,陈志国.融合前景注意力的轻量级交通标志检测网络[J].电子测量与仪器学报,2023,37(1):21-31. 被引量：4
7李斌,阎君宇.基于GAF-CNN的弓网电弧识别方法研究[J].电子测量与仪器学报,2022,36(1):188-195. 被引量：4
8赵玉田.基于大间隔分布Pin-SVM算法的车标分类识别[J].电子测量技术,2021,44(7):55-60.
9张业,徐婧.基于语义点云的巡航系统移动目标轨迹识别[J].北京测绘,2023,37(8):1115-1120.
10高浩荣.韩国“医药分离”改革面面观[J].半月谈,2000(5):36-37.

1操凤萍,张锐汀.融合深度特征提取和注意力机制的跨域推荐模型[J].深圳大学学报（理工版）,2023,40(3):266-274.
2李新蕊,王明生.在高质量发展中促进共同富裕[J].共产党员,2024(1):16-17.
3赵智奎.践行以人民为中心发展全过程人民民主[J].实践,2024(3):10-11.
4李伟伟,王丽妍,傅博,王娟,黄虹.基于多模态融合的深度神经网络图像复原方法[J].吉林大学学报（理学版）,2024,62(2):391-398.
5许敏,赵仕琦,陈乐怡,范芸榕.光伏驱动冰蓄冷共享田头小冷库的系统设计[J].中文科技期刊数据库（文摘版）工程技术,2024(2):0009-0012.
6郭意凡,杨大伟,毛琳.视频目标检测中位置注意力网络[J].大连民族大学学报,2024,26(1):29-35.
7孙侃.毛白杨树林中的蚜虫危害及不同治理方式对比研究[J].绿色科技,2024,26(3):143-147.
8Jintao Li,Bin Zhang,Yichen Luo,Huayong Yang.Design of a High Precision Multichannel 3D Bioprinter[J].Chinese Journal of Mechanical Engineering,2023,36(6):127-146.
9刘金林,程凡,马静.边境民族地区普通话推广的实践经验、历史方位及高质量普及路径——语言与国家治理系列研究之十一[J].民族教育研究,2023(6):132-141. 被引量：1
10Shaoxiong Zhou,Bangshao Dong,Yanguo Wang,Jingyu Qin,Weihua Wang.The Nanoscale Density Gradient as a Structural Stabilizer for Glass Formation[J].Engineering,2023(10):120-129.

电子学报

2024年第1期

浏览历史

内容加载中请稍等...

锚框校准和空间位置信息补偿的街道场景视频实例分割

参考文献9

二级参考文献44

共引文献174

相关作者

相关机构

相关主题

浏览历史