期刊文献+

引入谈判博弈的Q-学习下的城市交通信号协调配时决策 被引量:4

Urban Traffic Signal Timing Decision Based on Q-learning with Negotiation Game Mechanism
下载PDF
导出
摘要 由于城市交通路网中交叉口间交通信号决策是相互影响的,并且车联网技术使得交叉口交通信号配时agent间能进行直接交互,此决策问题可用博弈框架来描述。建立了城市路网中相邻交叉口间交通流关联模型,通过嵌入谈判博弈模型来设计Q-学习方法,此方法中利用谈判参考点来进行配时行为的选择。仿真实验分析表明,相对于无协调的Q-学习算法,谈判博弈Q-学习取得更好的控制效果和稳定性能。谈判博弈Q-学习在处理交通拥挤及干扰交通流时,能根据交通条件灵活地改变交通信号配时决策,具有较强的适应能力。 Because the traffic signal decision between intersection in urban traffic network is interactional,and internet of vehicles can make the intersection traffic signal agent interact directly,this decision problem can be described by the game framework.A traffic flow correlation model between adjacent intersections in urban traffic network was established,and Q-learning method was designed by embedding negotiation game model where negotiation reference point was used to choose timing behavior.The simulation experiment shows that the negotiation game Q-learning achieves better control effect and stability performance compared with the uncoordinated Q-learning algorithm.When dealing with disturbing and congested traffic flow,negotiation game Q-learning has the flexibility to change the traffic signals according to the traffic conditions and necessity.
作者 夏新海 许伦辉 XIA Xin-hai;XU Lun-Hui(Department of Port and Shipping Management,Guangzhou Maritime University,Guangzhou 510725,China;School of Civil Engineering and Transportation,South China University of Technology,Guangzhou 510640,China)
出处 《科学技术与工程》 北大核心 2018年第33期108-116,共9页 Science Technology and Engineering
基金 广东省自然基金(2016A030310104) 广东省科技计划(2015B010129017)资助
关键词 谈判博弈 Q-学习 交通信号 配时决策 negotiation game Q-learning traffic signal timing decision
  • 相关文献

参考文献5

二级参考文献53

  • 1段后利,李志恒,张毅,胡坚明.交通控制子区动态划分模型[J].吉林大学学报(工学版),2009,39(S2):13-18. 被引量:12
  • 2沈国江,孙优贤.城市交通干线递阶模糊控制及其神经网络实现[J].系统工程理论与实践,2004,24(4):99-105. 被引量:40
  • 3王飞跃.平行系统方法与复杂系统的管理和控制[J].控制与决策,2004,19(5):485-489. 被引量:330
  • 4王正武,罗大庸,黄中祥,张航.线控系统协调优化模型及其改进粒子群算法研究[J].系统工程理论与实践,2007,27(10):165-171. 被引量:8
  • 5Diakaki C. Integrated control of traffic flow in corridor road networks[D]. Chania: Technical University of Crete, 1999.
  • 6Diakaki C, Papageorgiou M, Aboudolas K. A multivariable regulator approach to traffic responsive network-wide signal control[J]. Control Engineering Practice, 2002, 10(1): 183-195.
  • 7Diakaki C, Dinopoulou V, Aboudolas K, et al. Extensions and new applications of the trafficresponsive urban control strategy: Coordinated signal control for urban networks [J]. Transportation Research Record, 2003, 1856:202-211.
  • 8Lin S, De Schutter B, Xi Y, et al. An efficient model- based method for coordinated control of urban traffic networks[C]. Proc of the 2010 IEEE Int Conf on Networking, Sensing and Control. Chicago: IEEE Press, 2010: 8-13.
  • 9Lin S, De Schutter B, Xi Y, et al. Efficient network- wide model-based predictive control for urban traffic networks[J]. Transportation Research Part C, 2012, 24(9): 122-140.
  • 10Lin S, De Schutter B, Xi Y, et al. Fast model predictive control for urban road networks via MILP[J]. IEEE Trans on Intelligent Transportation System, 2011, 12(3): 846- 856.

共引文献90

同被引文献23

引证文献4

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部