基于行人安全的交通信号灯智能控制算法研究

Research on Intelligent Control Algorithm of Traffic Light Based on Pedestrian Safety

下载PDF

导出

摘要提出了一种基于深度确定性策略梯度(DDPG,deep deterministic policy gradient)的行人安全智能交通信号控制算法;通过对交叉口数据的实时观测,综合考虑行人安全与车辆通行效率,智能地调控交通信号周期时长,相位顺序以及相位持续时间,实现交叉路口安全高效的智能控制;同时,采用优先经验回放提高采样效率,加速了算法收敛;由于行人安全与车辆通行效率存在相互矛盾,研究中通过精确地设计强化学习的奖励函数,折中考虑行人违规引起的与车辆的冲突量和车辆通行的速度,引导交通信号灯学习路口行人的行为,学习最佳的配时方案;仿真结果表明在动态环境下,该算法在行人与车辆冲突量,车辆的平均速度、等待时间和队列长度均优于现有的固定配时方案和其他的智能配时方案。 An intelligent traffic signal control algorithm based on Deep Deterministic Policy Gradient(DDPG)with Pedestrian Safeis proposed.Through the real-time observation of intersection data,the pedestrian safety and vehicle traffic efficiency are comprehensively considered,and the cycle duration,phase sequence and phase duration of traffic signals are intelligently controlled,safe and efficient intelligent control of intersections is realized.Meanwhile,priority empirical replay is adopted to improve sampling efficiency and accelerate algorithm convergence.Due to the contradiction between pedestrian safety and vehicle traffic efficiency,the reward function of reinforcement learning isaccurately designed,the pedestrian-vehicle conflicts caused by pedestrian violations and the speed of vehicles is considerd,traffic light isguided to learn pedestrian behaviors at intersections,and the best timing scheme is learned.The simulation results show that in the dynamic environment,the algorithm in terms of the number of collisions between pedestrians and vehicles,the average speed of vehicles,waiting time and queue length are better than the existing fixed timing schemes and other intelligent timing schemes.

作者张乾隆胡智群肖海林 ZHANG Qianlong;HU Zhiqun;XIAO Hailin(School of Computer and Information Engineering,Hubei University,Wuhan 430062,China)

机构地区湖北大学计算机与信息工程学院

出处《计算机测量与控制》 2022年第4期114-120,共7页 Computer Measurement &Control

基金国家自然科学基金(61901163)。

关键词交通信号灯动态配时强化学习行人安全车辆效率优先经验回放 traffic signal light dynamic timing reinforcement learning pedestrian safety vehicle efficiency prioritized experience replay

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献2

1颜文胜,吕红兵.基于Deep Q Networks的交通指示灯控制方法[J].计算机测量与控制,2021,29(6):93-97. 被引量：2
2戈军,周莲英.基于SARSA(λ)的实时交通信号控制模型[J].计算机工程与应用,2015,51(24):244-248. 被引量：8

二级参考文献24

1Zhu F,Ning J,Ren Y,et al.Optimization of image processing in video-based traffic monitoring[J].Elektron Elektrotech,2012,18(8):91-96.
2Baskar L D,Schutter B D,Hellendoorn H.Traffic management for automated highway systems using model-based predictive control[J].IEEE Transactions on Intelligent Transportation Systems,2012,3(2):838-847.
3Sutton R S,Barto A G.Reinforcement learning:an introduction[M].Cambridge:MIT Press,1998.
4Mase K,Yamamoto H.Advanced traffic control methods for network management[J].IEEE Communications Magazine,1990,28(10):82-88.
5Baskar L D,Schutter B D,Hellendoorn J,et al.Traffic control and intelligent vehicle highway systems:a survey[J].IET Intelligent Transport Systems,2011,5(1):38-52.
6Zegeye S,Schutter B D,Hellendoorn J,et al.A predictive traffic controller for sustainable mobility using parameterized control policies[J].IEEE Transactions on Intelligent Transportation Systems,2012,13(3):1420-1429.
7Chin Y K,Wei Y K,Wei L K,et al.Q-learning traffic signal optimization within multiple intersections traffic network[C]//Proceedings of the 6th UKSIM/AMSS European Symposium on Computer Modeling and Simulation(EMS’12),2012:343-348.
8Prashanth L A,Bhatnagar S.Reinforcement learning with function approximation for traffic signal control[J].IEEE Transactions on Intelligent Transportation Systems,2011,12(2):412-421.
9Wiewiora E.Potential-based shaping and Q-value initialization are equivalent[J].Journal of Artificial Intelligence Research,2003,19:205-208.
10Martin M.On-line support vector machine regression[C]//Proceedings of the European Conference on Machine Learning(ECML’02),2002:173-198.

共引文献8

1胡文伟,胡建强,李湛,周剑峰.基于强化学习算法的自适应配对交易模型[J].管理科学,2017,30(2):148-160. 被引量：17
2夏新海.交互协调强化学习下的城市交通信号配时决策[J].计算机工程与应用,2018,54(11):265-270. 被引量：3
3臧兆祥,李昭,王俊英,但志平.基于平均奖赏强化学习算法的零阶分类元系统[J].计算机工程与应用,2016,52(21):14-20. 被引量：1
4夏新海,许伦辉.引入谈判博弈的Q-学习下的城市交通信号协调配时决策[J].科学技术与工程,2018,18(33):108-116. 被引量：4
5刘智臣.一种基于5G云化机器视觉的交通控制系统研究[J].长江信息通信,2021,34(4):78-80. 被引量：2
6徐建闽,席嘉鹏.基于Q-强化学习的干道交叉口信号配时模型[J].广西大学学报（自然科学版）,2021,46(4):1036-1044. 被引量：1
7张国有,宋世峰.基于D3QN的交通灯控制优化[J].计算机与现代化,2023(7):30-35.
8陈靖宇,徐志林.VANET随机部署环境下基于改进型共享最近邻密度峰聚类的快速分簇算法[J].计算机测量与控制,2023,31(9):174-182.

1本刊综合.交通信号灯的那些事[J].发明与创新（高中生）,2022(4):14-15.
2王晨晨,崔文旭,赵韦婷,吴琦,孙万众,吴俊华.基于模糊神经网络的交通信号配时系统[J].电子技术（上海）,2021,50(8):20-22. 被引量：1
3刘骞.关于市政道路工程交叉口设计的研究[J].门窗,2022(7):96-98.
4吴志海,涂宇,刘晓熙,李玲.基于计划行为的行人违章过街行为意向研究[J].中国高新科技,2022(2):122-123.
5oneobject(设计).对女性友好的未来个人电动出行伴侣[J].工业设计,2022(3):19-19.
6张鑫辰,张军,刘元盛,路铭,谢龙洋.改进深度Q网络的无人车换道决策算法研究[J].计算机工程与应用,2022,58(7):266-275. 被引量：1
7幸有.你是未落定的昨日[J].花火（A版）,2022(3):66-72.
8宁雪辉,段元梅.基于单片机的十字路口交通信号控制系统设计[J].无线互联科技,2022,19(2):45-46. 被引量：2
9王浩聪,付主木,孙昊琛,陶发展,宋书中.改进深度Q学习的燃料电池混合动力汽车能量管理[J].河南科技大学学报（自然科学版）,2022,43(4):34-40. 被引量：4
10魏淑平.漫步云端智取未来[J].新教育（海南）,2021(31):15-16.

计算机测量与控制

2022年第4期

浏览历史

内容加载中请稍等...

基于行人安全的交通信号灯智能控制算法研究

参考文献2

二级参考文献24

共引文献8

相关作者

相关机构

相关主题

浏览历史