期刊文献+

改进DDPG算法在自动驾驶中的应用 被引量:24

Self-Driving Via Improved DDPG Algorithm
下载PDF
导出
摘要 深度确定性策略梯度算法(Deep Deterministic Policy Gradient,DDPG)作为深度强化学习中的经典算法,在连续控制问题上有着较大的优势,被应用于自动驾驶领域。针对DDPG缺少策略动作过滤导致的非法策略比例较高引起的训练效率低、收敛速度慢等问题,提出基于失败经验纠错的深度确定性策略梯度算法。通过分离经验缓存池,根据驾驶表现选择失败数据训练,并将策略网络单输出转化为油门和刹车控制量,通过正态分布噪声改善探索策略。TORCS平台仿真实验表明,所提算法相对于DDPG算法与DQN(Deep Q-learning Network)算法,训练效率明显提升,非法驾驶策略降低为0。 As a classic algorithm of deep reinforcement learning, the Deep Deterministic Policy Gradient algorithm(DDPG)has great advantage on the aspect of continuous control problems and is applied in self-driving area. In order to solve the problems of low training efficiency and large amount of illegal driving policy, an improved algorithm called failure experience correction DDPG is proposed. The algorithm divides experience pool into success experience pool and failure experience pool, selects failure experience according to the driving performance, controlls the brake pedal and acceleration pedal via one neural network output, and explores unknown policy through normal distribution noisy. Through the simulation on the TORCS platform, experimental results show that the proposed algorithm can significantly improve the training efficiency and reduce the illegal driving policy to zero.
作者 张斌 何明 陈希亮 吴春晓 刘斌 周波 ZHANG Bin;HE Ming;CHEN Xiliang;WU Chunxiao;LIU Bin;ZHOU Bo(College of Command and Control Engineering, The Army Engineering University of PLA, Nanjing 210002, China;Institute of Network Information, Academy of Systems Engineering, Academy of Military Sciences, Beijing 100071, China)
出处 《计算机工程与应用》 CSCD 北大核心 2019年第10期264-270,共7页 Computer Engineering and Applications
基金 国家重点研发计划(No.2016YFC0800606 No.2016YFC0800310) 中国工程院重点咨询课题(No.2017-XZ-05) 江苏省自然科学基金(No.BK20150721 No.BK20161469) 中国博士后科学基金(No.2015M582786 No.2016T91017) 江苏省重点研发计划(No.BE2015728 No.BE2016904) 江苏省科技基础设施建设计划(No.BM2014391)
关键词 深度强化学习 自动驾驶 DDPG算法 经验缓存分离 TORCS deep reinforcement learning self-driving DDPG algorithm experience pool dividing TORCS
  • 相关文献

参考文献4

二级参考文献8

共引文献154

同被引文献111

引证文献24

二级引证文献81

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部