期刊文献+

基于DDPG算法的无人车辆防碰撞控制策略 被引量:9

Anti Collision Control Strategy of Unmanned Vehicle Based on DDPG Algorithm
原文传递
导出
摘要 目前,强化学习在无人驾驶领域得到了广泛应用,但是如何提高无人车辆的稳定性并满足在不同工况中同时完成路径跟踪和车辆避障的要求依旧是一个难题。针对无人车辆路径跟踪与避障功能需求,提出一种基于深度确定梯度策略(Deep Deterministic Policy Gradient,DDPG)算法的无人车辆防碰撞控制策略。首先,根据DDPG算法原理和车辆控制模型得到控制系统的输入输出量,并提出一种基于sin函数的变道轨迹规划方式,来提高车辆避障能力。其次,根据控制系统输入输出量设计神经网络控制器以及研究其策略探索方案,并提出一种基于对数函数的奖励塑造方案,以解决奖励稀疏问题。最后,通过仿真实验证明,基于DDPG算法的无人车辆控制策略能够更加安全、稳定地控制车辆完成路径跟踪与避障任务,且控制精度更高。 At present,reinforcement learning has been widely used in the field of unmanned driving,but how to improve the stability of unmanned vehicles and meet the requirements of path tracking and vehicle obstacle avoidance under different working conditions is still a difficult problem.Aiming at the functional requirements of path tracking and obstacle avoidance of unmanned vehicles,an anti-collision control strategy of unmanned vehicles based on deep deterministic policy gradient(DDPG)algorithm was proposed in this paper.Firstly,according to the principle of DDPG algorithm and vehicle control model,the input and output of the control system were obtained,and a lane change trajectory planning method based on sin function was proposed to improve the vehicle obstacle avoidance ability.Secondly,according to the input and output of the control system,the neural network controller was designed and its strategy exploration scheme was studied,and a reward shaping scheme based on logarithmic function was proposed to solve the problem of sparse reward.Finally,the simulation results show that the unmanned vehicle control strategy based on DDPG algorithm can control the vehicle to complete the path tracking and obstacle avoidance tasks more safely and stably,and the control accuracy is higher.
作者 赖金萍 李浩 石英 徐腊梅 闫浩 LAI Jin-ping;LI Hao;SHI Ying;XU La-mei;YAN Hao(School of Automation,Wuhan University of Technology,Wuhan 430070,China;Tianjin Port Information Technology Development Co Ltd,Tianjin 300456,China)
出处 《武汉理工大学学报》 CAS 2021年第10期68-76,共9页 Journal of Wuhan University of Technology
基金 国家自然科学基金(51805388)。
关键词 无人车辆 强化学习 DDPG 路径跟踪 防碰撞 unmanned vehicle strengthen learning DDPG path tracking anti collision
  • 相关文献

参考文献3

二级参考文献19

  • 1Ackermann J, Guldner J, Utkin V I. A Robust Nonlinear Control Approach to Automatic Path Tracking of a Car[ C ]. International Conference on Control, 1994 : 196 - 201.
  • 2Han-Shue T, Bougler B, Farrell J A, et al. Automatic Vehicle Steering Controls : DGPS/INS and Magnetic Markers [ C ]. Pro- ceedings of the American Control Conference, Denver, Colorado: IEEE ,2003160 - 65.
  • 3Ackermann J. Robust Control: The Parameter Space Approach [ M ]. 2nd ed. London: Springer,2002.
  • 4Broggi A, Bertozzi M, Fascioli A, et al. The ARGO Autonomous Vehicle's Vision and Control Systems [ J 1. The International Jour- nal of Intelligent Control and Systems, 1999,3 ( 4 ) :409 - 441.
  • 5Junmin W, Steiber J, Surampudi B. Autonomous Ground Vehicle Control System for High-speed and Safe Operation[ C ]. American Control Conference ,2008:218 - 223.
  • 6Thrun S, Montemerlo M, Dahlkamp H, et al. Stanley : The Robot that Won the DARPA Grand Challenge [ J ]. Journal of Field Ro- botics,2006,23 (9) :661 - 692.
  • 7Urmson C, Ragusa C, Ray D, et al. A Robust Approach to High- speed Navigation for Unrehearsed Desert Terrain [ J ]. Journal of Field Robotics ,2006,23 ( 8 ) :467 - 508.
  • 8J Y W. Theory of the Ground Vehicles [ M ]. New York : JOHN WILEY&SONS, INS,2001.
  • 9Doff R C,Bishop R H.现代控制系统[M].北京:高等教育出版社,2001.
  • 10Li L, Feiyue W. Advanced Motion Control and Sensing for Intelli- gent Vehicles [ M ]. Berlin : Springer,2007.

共引文献121

同被引文献126

引证文献9

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部