期刊文献+

基于切换模型的两交叉口信号灯Q学习协调控制 被引量:4

Based on Switching-model Cooperative Control for Two Intersections Applying Q-Learning
下载PDF
导出
摘要 为了解决两交叉口信号灯协调控制,在建立两交叉口切换模型的基础上,提出了组合相位的概念,将两交叉口形式上转换为单交叉口的形式,采用解决单个智能体和环境交互的Q学习算法实现了两交叉口之间的协调控制。应用Paramics微观交通仿真软件进行了算法仿真,仿真结果验证了该方法的有效性。 In order to realize the cooperative control for two adjacent intersections, the signal control for two intersections is put forward based on the method, which the traffic signal control for an isolated intersection has been resolved by Q-Learning. According to the switching-model for two intersections, the combination phase of two intersections is advanced in order to transform two intersections to a single one. So the cooperation control for two intersections can be resolved by general Q-Learning. It is illustrated by Paramics simulation software that the method is effective and feasible.
出处 《北京工业大学学报》 EI CAS CSCD 北大核心 2007年第11期1148-1152,1157,共6页 Journal of Beijing University of Technology
基金 北京工业大学科技创新平台建设资助项目(05002011200605) 北京市人才强教计划中青年骨干教师资助项目(05002011200606)
关键词 Q学习算法 两交叉口切换模型 组合相位 两交叉口协调控制 Q-learning algorithm the switching-model for two intersections combination-phase cooperation control for two intersections
  • 相关文献

参考文献7

二级参考文献7

  • 1Peng J,博士学位论文,1993年
  • 2Sutton R S. Introduction: The challenge of reinforcement learning[J]. Machine Learning, 1992, 8: 225-227
  • 3LIN Long_Ji. Self_improving reactive agents based on reinforcement learning, planning and teaching[J]. Machine Learning, 1992, 8: 69-97
  • 4Watkins C J C H. Technical notes:Q_learning[J]. Machine Learning, 1992, 8: 55-68
  • 5He Guoguang,Noeth G. Urban traffic control system-A general analysis from the point of view of control theory[A]. Transportation Systems: Theory and Application of Advanced Technology[C]. Oxford:PERGAMON Press,1997. 518-521
  • 6俞星星,阎平凡.强化学习系统及其基于可靠度最优的学习算法[J].信息与控制,1997,26(5):332-339. 被引量:3
  • 7马寿峰,贺国光,刘豹.一种通用的城市道路交通流微观仿真系统的研究[J].系统工程学报,1998,13(4):8-15. 被引量:35

共引文献116

同被引文献31

  • 1范立权,陈阳舟,李振龙.基于混杂模糊切换的快速路区域协调控制研究[J].交通信息与安全,2009,27(S1):44-48. 被引量:2
  • 2赵晓华,陈阳舟.基于混杂系统理论的单交叉口信号灯控制[J].北京工业大学学报,2004,30(4):412-416. 被引量:5
  • 3卢燕俊,戴华平.城市交通网络的混杂Petri网建模[J].浙江大学学报(工学版),2007,41(6):930-934. 被引量:11
  • 4张辉,杨玉珍.基于分布式Q学习的区域交通协调控制研究[J].系统仿真学报,2006(10).
  • 5齐驰,侯忠生.信号灯区域自组织控制[J].
  • 6Júlvez J,Boel R K.A continuous Petri net approach for model predictive control of traffic systems[J].IEEE Transactions on Systems,Man and Cybernetics,Part A:Systems and Humans,2010,40(4):686-697.
  • 7Dotoli M,Fanti M P,Iacobellis G.Validation of an urban traffic network model using colored timed Petri nets[C].IEEE International Conference on Systems,Man and Cybernetics.2005,1347-1352.
  • 8Young Woo Kim,Tatsuya Kato.Traffic network control based on hybrid dynamical system modeling and mixed integer nonlinear progaming with convexity analysis[J].IEEE Transactions on Systems,Man and Cybernetics,Part A:Systems and Humans,2008,38(2):346-357.
  • 9Noortje Groot,Bart De Schutter.Model-based traffic and emission control using PWA models-a mixed-logical dynamic approach[C].2011 14th International IEEE Conference on Intelligent Transportation Systems.Washington,2011:2142-2147.
  • 10R R Negenborn,De Schutter.Multi-agent model predictive control for transportation networks with continuous and discrete elements[C].Proceedings of the 11th IFAC Symposium on Control in Transportation Systems.Delft,2006:609-614.

引证文献4

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部