期刊文献+

基于深度确定性策略梯度的智能车汇流模型 被引量:4

Traffic Merging Model for Intelligent Vehicle Based on Deep Deterministic Policy Gradient
下载PDF
导出
摘要 采用离散动作空间描述速度变化的智能车汇流模型不能满足实际车流汇入场景的应用要求,而深度确定性策略梯度(DDPG)结合策略梯度和函数近似方法,采用与深度Q网络(DQN)相同的网络结构,并使用连续动作空间对问题进行描述,更适合描述智能车速度变化。为此,提出一种基于DDPG算法的智能车汇流模型,将汇流问题转化为序列决策问题进行求解。实验结果表明,与基于DQN的模型相比,该模型的收敛速度较快,稳定性和成功率较高,更适合智能车汇入车辆场景的应用。 Traffic merging models for intelligent vehicle that use discrete action space to describe changing speed cannot meet the application requirements of actual traffic merging scenarios.Deep Deterministic Policy Gradient(DDPG),which integrates policy gradient with function approximation methods and adopts the same network structure as Deep Q-Network(DQN),uses continuous action space for problem description.So DDPG is more suitable for describing the changing speed of intelligent vehicles.On this basis,this paper proposes a traffic merging model for intelligent vehicles based on the DDPG algorithm,reducing the traffic merging problem to a sequence decision problem to be resolved.Experimental results show that compared with DQN-based models,the proposed model has a faster convergence speed,higher reliability and a higher success rate,which means it is more applicable to traffic merging scenarios of intelligent vehicle.
作者 吴思凡 杜煜 徐世杰 杨硕 杜晨 WU Sifan;DU Yu;XU Shijie;YANG Shuo;DU Chen(Smart City College,Beijing Union University,Beijing 100101,China;College of Robotics,Beijing Union University,Beijing 100101,China;Beijing Key Laboratory of Information Service Engineering,Beijing Union University,Beijing 100101,China)
出处 《计算机工程》 CAS CSCD 北大核心 2020年第1期87-92,共6页 Computer Engineering
基金 国家自然科学基金(91420202)
关键词 智能车 汇流 深度确定性策略梯度 深度Q网络 连续动作空间 intelligent vehicle traffic merging Deep Deterministic Policy Gradient(DDPG) Deep Q-Network(DQN) continuous action space
  • 相关文献

参考文献6

二级参考文献80

共引文献531

同被引文献46

引证文献4

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部