期刊文献+

基于分层强化学习的自动驾驶车辆掉头问题研究 被引量:1

Research on autonomous vehicle U-turn problem based on hierarchical reinforcement learning
下载PDF
导出
摘要 调头任务是自动驾驶研究的内容之一,大多数在城市规范道路下的方案无法在非规范道路上实施。针对这一问题,建立了一种车辆掉头动力学模型,并设计了一种多尺度卷积神经网络提取特征图作为智能体的输入。另外还针对调头任务中的稀疏奖励问题,结合分层强化学习和近端策略优化算法提出了分层近端策略优化算法。在简单和复杂场景的实验中,该算法相比于其他算法能够更快地学习到策略,并且具有更高的掉头成功率。 The U-turn task is one of the contents of autonomous driving research,and most of the solutions under the standard roads in cities cannot be implemented on non-standard roads.Aiming at solving this problem,this paper established a vehicle U-turn dynamical model and designed a multi-scale convolutional neural network to extract feature maps as the input of the agent.In addition,for the sparse reward problem in the U-turn task,this paper proposed a hierarchical proximal policy optimization algorithm that combined hierarchical reinforcement learning and proximal policy optimization algorithm.In experiments with simple and complex scena-rios,this algorithm learns policies faster and has a higher success rate of U-turn compared to other algorithms.
作者 曹洁 邵紫旋 侯亮 Cao Jie;Shao Zixuan;Hou Liang(Dept.of Computer&Communication,Lanzhou University of Technology,Lanzhou 730050,China)
出处 《计算机应用研究》 CSCD 北大核心 2022年第10期3008-3012,3045,共6页 Application Research of Computers
关键词 分层强化学习 汽车掉头 稀疏奖励 近端策略优化 hierarchical reinforcement learning car U-turn sparse rewards proximal policy optimization
  • 相关文献

参考文献5

二级参考文献61

  • 1WEILI QingtaiYE ChangmingZHU.APPLICATION OF HIERARCHICAL REINFORCEMENT LEARNING IN ENGINEERING DOMAIN[J].Journal of Systems Science and Systems Engineering,2005,14(2):207-217. 被引量:3
  • 2魏英姿 ,赵明扬 .一种基于强化学习的作业车间动态调度方法[J].自动化学报,2005,31(5):765-771. 被引量:19
  • 3高阳,周如益,王皓,曹志新.平均奖赏强化学习算法研究[J].计算机学报,2007,30(8):1372-1378. 被引量:38
  • 4FISCHER F, ROVATSOS M, WEISS G. Hierarchical reinforcement learning in communication-mediated multiagent coordination [ C ]// Proc of the 3rd International Conference on Autonomous Agents and Muhiagent Systems. New York: ACM Press, 2004.
  • 5HENGST B. Discovering hierarchy in reinforcement learning [ D ]. Sydney: University of New South Wales, 2003.
  • 6SKELLY M M. Hierarchical reinforcement learning with function approximation for adaptive control [ D ]. Ohio : Case Western Reserve University, 2004.
  • 7UTHER W T B. Tree based hierarchical reinforcement learning[ D]. Pittsburgh: Carnegie Mellon University, 2002.
  • 8BELLMAN R E, DREYFUS S E. Applied dynamic programming [ M ]. New Jersey : Princeton University Press, 1962.
  • 9WATKINS C, DAYAN P. Q-learning[J]. Machine Learning, 1992,8(3 ) :279-292.
  • 10PARR R. Hierarchical control and learning for Markov decision processes [ D ]. Berkeley, Califomia: University of California, 1998.

共引文献460

同被引文献1

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部