期刊文献+

基于深度强化学习的智能车辆行为决策研究 被引量:2

Intelligent Vehicles Behavior Decision-making Based on Deep Reinforcement Learning
下载PDF
导出
摘要 自动驾驶车辆决策系统直接影响车辆综合行驶性能,是实现自动驾驶技术需要解决的关键难题之一。基于深度强化学习算法DDPG(deep deterministic policy gradient),针对此问题提出了一种端到端驾驶行为决策模型。首先,结合驾驶员模型选取自车、道路、干扰车辆等共64维度状态空间信息作为输入数据集对决策模型进行训练,决策模型输出合理的驾驶行为以及控制量,为解决训练测试中的奖励和控制量突变问题,改进DDPG决策模型对决策控制效果进行优化,并在TORCS(the open racing car simulator)平台进行仿真实验验证。结果表明:所提出的决策模型可以根据车辆和环境实时状态信息输出合理的驾驶行为以及控制量,与DDPG模型相比,改进的模型具有更好的控制精度,且车辆横向速度显著减小,车辆舒适性以及车辆稳定性明显改善。 Autonomous driving vehicle decision-making system has direct influence on driving performance.It is one of the key challenges to be addressed to realize fully autonomous driving.To solve this problem,a driving decision-making system based on deep reinforcement learning algorithm deep deterministic policy gradient(DDPG)was proposed.Firstly,a total of 64 dimensions of state spaces information such as ego vehicle information,road information and obstacle vehicle information on the basis of a driver model were selected as input variables of the constructed model.Then the decision-making was trained and outputs reasonable driving behaviors and control variable values.Finally,aiming at the problems of reward value and control variable values saltation,the DDPG decision model was improved to optimize decision control effect.To verify the performance of the proposed decision making model,simulation experiments were conducted on the open racing car simulator(TORCS)platform.The results show that the proposed decision-making model can output reasonable driving behaviors and accurate control quantities based on real-time state information of vehicles and environment.Compared with the DDPG model,the improved decision-making model has better control accuracy,significantly reduces vehicle lateral speed,improves vehicle comfort and stability.
作者 周恒恒 高松 王鹏伟 崔凯晨 张宇龙 ZHOU Heng-heng;GAO Song;WANG Peng-wei;CUI Kai-chen;ZHANG Yu-long(School of Transportation and Vehicle Engineering,Shandong University of Technology,Zibo 255000,China)
出处 《科学技术与工程》 北大核心 2024年第12期5194-5203,共10页 Science Technology and Engineering
基金 国家自然科学基金(52102465)。
关键词 自动驾驶 行为决策 深度强化学习 深度确定性策略梯度算法 autonomous driving behavior decision-making deep reinforcement learning deep deterministic policy gradient
  • 相关文献

参考文献7

二级参考文献130

共引文献256

同被引文献12

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部