端到端自动驾驶系统研究综述

Survey of end-to-end autonomous driving systems

导出

摘要近年深度学习技术助力端到端自动驾驶框架的发展和进步,涌现出一系列创新研究议题与应用部署方案。本文首先以经典的模块化系统切入,对自动驾驶感知—预测—规划—决策4大功能模块进行简要概述,分析传统的模块化和多任务方法的局限性;其次从输入—输出模态到系统架构角度对当前新兴的端到端自动驾驶框架进行广泛地调研,详细描述弱解释性端到端与模块化联合端到端两大主流范式,深入探究现有研究工作存在的不足和弊端;之后简单介绍了端到端自动驾驶系统的开环—闭环评估方法及适用场景;最后总结了端到端自动驾驶系统的研究工作,并从数据挖掘和架构设计角度展望领域潜在挑战和亟待解决的关键问题。 Deep learning technologies have accelerated the development and advancement of end-to-end autonomous driving frameworks in recent years,sparking the emergence of numerous cutting-edge research topics and application deployment solutions.The“divide and conquer”architecture design concept,which aims to construct multiple independent but related module components,integrate them into the developed software system in a specific semantic or geometric order,and ultimately deploy these components to the actual vehicle,is the foundation for the majority of the autonomous driving systems currently in use,also known as modular systems.However,a well-developed modular design typically comprises thousands of components,placing a considerable burden on the graphics memory and processing capacity of automotive CPUs.Furthermore,the intrinsic mistakes of each stacked module during prediction will rise with the number of stacked modules,and upstream flaws cannot be fixed in downstream modules,presenting a major risk to vehicle safety.A multitask architecture based on the“task parallelism”principle aims to efficiently infer multiple tasks in parallel by designing various decoded heads with a shared backbone network to reduce computational consumption.However,the optimization goals for various tasks may not be consistent,and sharing features mindlessly can even degrade the overall performance of the system.In contrast to the previous two system architectures,the end-to-end technology paradigm eliminates information bottlenecks and cumulative errors due to the integration of numerous intermediate components based on rule interfaces,allowing the network to continually optimize toward a unified objective.A large model can be used to generate low-level control signals or vehicle motion planning based on inputs such as sensor data and vehicle status.With sensors serving as inputs,the early end-to-end design based on imitation and reinforcement learning directly outputs the final control commands for steering,braking,and acceleration.However,no explicit representation of driving scenarios in this completely“black box”network,which is also referred to as weakly interpretable end-to-end methods,is available.Thus,understanding the reasoning behind the decision or prediction of a vehicle is difficult for humans,making debugging,validation,and optimization challenging.Even worse,once the model malfunctions or unexpected situations occur,accurately detecting,avoiding,and repairing problems in a timely manner becomes difficult,all of which are crucial for maintaining the safe operation of intelligent vehicles.The component decoupling approach facilitates the development and optimization of individual modules in the conventional modular system,thereby guaranteeing steady representation performance and strong interpretability for each submodule.Unfortunately,this method falls short of achieving unified goals at the optimization level,that is,integrating optimization and learning toward the ultimate planning goal.A modular joint end-to-end autonomous driving architecture,which preserves the modular driving system while allowing the differentiability of each module,is a workable solution to ensure that every module has sufficient interpretability and overall automatic optimization capabilities.The basic idea behind this technology lies in the creation of a unique neural network that connects all independent modules and enables the gradients from the planning modules to be fed back down to the initial sensor input for end-to-end execution.In other words,this kind of approach merely modifies the submodule connection mechanism while maintaining the classic modular technology stack;that is,this approach substitutes a new implicit interface for the previous explicit interfaces,which were rule-based and required manual creation.Modular joint end-to-end procedures offer a certain interpretability because of the distinct separation between modules.The explicit end-to-end system is a relative decoupling based on overall design and exhibits some degree of logic in its sequential functioning from perception to prediction,and then to planning modules during decision inference.The model can be intentionally adjusted when it encounters unknown and uncontrollable results by understanding the operational logic underlying the explicit solution.Furthermore,visualization methods,such as internal features or intermediate results of specific tasks or modules,can be utilized to analyze the decision-making operation mechanism,which can prevent potential risks caused by black box models and ensure the safe and efficient driving of intelligent vehicles.Therefore,this article conducts comprehensive analysis and research on the emerging field of end-to-end autonomous driving with promising development prospects,which summarizes the main technical routes and representative research methods around the development path of end-to-end driving systems.More specifically,this article,which begins with the classic modular system,analyzes the shortcomings of conventional modular and multitasking approaches while providing a brief introduction to the four functional modules of the autonomous driving system.These modules primarily include perception,prediction,planning,and decision making.Subsequently,extensive research on the emerging end-to-end autonomous driving frameworks is conducted from the perspective of input-output modality to system architecture,describing in detail the two dominant paradigms and delving into the shortcomings and drawbacks of existing research work.The existing end-to-end architecture can be categorized into two categories based on interpretable performance:weakly interpretable end-to-end,which is explored from the aspects of imitation learning,reinforcement learning,and interpretability;or modular joint end-to-end,which is progressively investigated from bird’s-eye view representation,to joint perception prediction,and ultimately,planning-oriented end-to-end methods.Afterward,a thorough discussion of the end-to-end driving system assessment is provided for closed-and open-loop evaluations,along with the corresponding situations.Finally,the research works on end-to-end autonomous driving systems are summarized,and the potential challenges and key problems that still need to be addressed are discussed from the perspectives of data mining and architecture design.

作者陈妍妍田大新林椿眄殷鸿博 Chen Yanyan;Tian Daxin;Lin Chunmian;Yin Hongbo(School of Transportation Science and Engineering,Beihang University,Beijing 102200,China)

机构地区北京航空航天大学交通科学与工程学院

出处《中国图象图形学报》 CSCD 北大核心 2024年第11期3216-3237,共22页 Journal of Image and Graphics

基金国家自然科学基金项目(U20A20155,62173012,52202391)。

关键词人工智能(AI) 自动驾驶模块式系统端到端系统数据驱动可解释性 artificial intelligence(AI) autonomous driving modular driving system end-to-end system data driven interpretability

分类号 U495 [交通运输工程—交通运输规划与管理]

引文网络
相关文献

参考文献3

1潘峰,鲍泓.强化学习的自动驾驶控制技术研究进展[J].中国图象图形学报,2021,26(1):28-35. 被引量：15
2李熙莹,叶芝桧,韦世奎,陈泽,陈小彤,田永鸿,党建武,付树军,赵耀.基于图像的自动驾驶3D目标检测综述——基准、制约因素和误差分析[J].中国图象图形学报,2023,28(6):1709-1740. 被引量：7
3李升波,刘畅,殷玉明,段京良,王建强,李克强.汽车端到端自动驾驶系统的关键技术与发展趋势[J].人工智能,2023(5):1-16. 被引量：11

二级参考文献35

1孟醒:滴滴自动驾驶的自我进化[J].智能网联汽车,2021(3):42-44. 被引量：1
2郭文佳.从载人测试到取消安全员无人驾驶出租车渐行渐近[J].智能网联汽车,2021(1):21-25. 被引量：1
3陈念航.挑战特斯拉FSD,百度Apollo推出领航辅助驾驶ANP[J].企业观察家,2020(12):66-67. 被引量：2
4徐友春,王荣本,李兵,李斌.世界智能车辆近况综述[J].汽车工程,2001,23(5):289-295. 被引量：64
5李德毅.脑认知的形式化——从研发机器驾驶脑谈开去[J].科技导报,2015,33(24):125-125. 被引量：10
6李克强,戴一凡,李升波,边明远.智能网联汽车(ICV)技术的发展现状及趋势[J].汽车安全与节能学报,2017,8(1):1-14. 被引量：427
7韩向敏,鲍泓,梁军,潘峰,玄祖兴.一种基于深度强化学习的自适应巡航控制算法[J].计算机工程,2018,44(7):32-35. 被引量：13
8赵华卿,方志军,高永彬.三维目标检测中的先验方向角估计[J].传感器与微系统,2019,38(6):35-38. 被引量：2
9李升波,关阳,侯廉,高洪波,段京良,梁爽,汪玉,成波,李克强,任伟,李骏.深度神经网络的关键技术及其在自动驾驶领域的应用[J].汽车安全与节能学报,2019,10(2):119-145. 被引量：29
10赵邢,梁浩然,梁荣华.结合目标检测与双目视觉的三维车辆姿态检测[J].计算机辅助设计与图形学学报,2019,31(9):1518-1527. 被引量：8

共引文献29

1卢立阳,朱丽丽,刘楠,刘博.基于云边端协同的高速公路云控系统能力验证研究[J].公路交通科技,2022,39(S01):154-160. 被引量：1
2耿俊香,姜静,魏胜楠,段昶.CIDDPG的多智能体通信优化方法研究[J].沈阳理工大学学报,2021,40(4):29-34. 被引量：1
3王仕雄,许王勇,张本松.自动驾驶实验车故障诊断实验综述报告[J].延安职业技术学院学报,2021,35(5):98-101.
4杨钦宁,佘浩平,庞羽佳.基于改进Mask R-CNN的卫星目标部位检测方法[J].计算机测量与控制,2021,29(11):12-17. 被引量：2
5李慕梓,戴连君,万盛,张天舒,徐桂花.残疾人信息融合模式探索[J].人口与发展,2022,28(3):156-160. 被引量：2
6陈皓炜,贾新春,孙小明,侯鹏飞.SCR脱硝系统的强化学习复合串级控制[J].动力工程学报,2022,42(5):421-428. 被引量：11
7李岩,唐睢睢.面向高速公路运管的车路协同云控平台架构设计[J].汽车实用技术,2022,47(17):56-61. 被引量：3
8聂梓润,徐野,哈乐.基于强化学习虚拟链路驾驶行为仿真环境研究[J].工业控制计算机,2022,35(11):128-130.
9武子豪,张司雨,董仕鹏,毛亮.机器学习在纳米材料风险评估中的应用[J].生态毒理学报,2022,17(5):139-151. 被引量：2
10华一丁,孙航,张淼.自动驾驶测试场景标准化工作思考与展望[J].中国汽车,2023(7):3-6.

1沈国麟,张锦涛.模拟效果:新一代人工智能技术与国际传播效能提升[J].新媒体与社会,2024(2):14-25. 被引量：1
2张青琳,李然,张瑞婷.数字时代首都中小学生跨学科主题活动实践探索[J].中小学信息技术教育,2024(12):38-39.
3岳圣淞.制度性权力场域下的大国博弈与中国国际话语权的提升[J].亚太安全与海洋研究,2024(6):20-35.
4金圣悠.天天不见了(大班)[J].幼儿教育,2024(34):39-41.
5陆源,孙梅.5G共建共享背景下的承载网安全策略及演进方案[J].山东通信技术,2024,44(3):10-13.
6梁利华.近十年社会治理研究的脉络与展望[J].高等学校文科学术文摘,2024,41(10):145-146.
7郭子浩.高中语文古诗词群文阅读议题探究[J].文教资料,2024(13):132-135.
8张丽.公共图书馆文旅融合研究的主要方向和未来热点[J].当代图书馆,2024(4):59-65.
9谢非.面向智能设备的模组端侧大模型优化技术[J].中国宽带,2024,20(2):106-108.
10苏杭.医院财务会计转型为管理会计的策略[J].财经界,2024(18):114-116.

中国图象图形学报

2024年第11期

浏览历史

内容加载中请稍等...

端到端自动驾驶系统研究综述

参考文献3

二级参考文献35

共引文献29

相关作者

相关机构

相关主题

浏览历史