面向交通信号的两层递阶控制解决方案被引量：1

Two-layer hierarchical control solutions for traffic signal

下载PDF

导出

摘要针对现有交通信号控制系统的诸多不足,提出了一种用于交通信号控制的两层递阶多Agent系统解决方案。通过将交通网络进行区域划分,利用底层Agent控制各交叉口,顶层Agent控制区域,从而实现两层递阶控制。底层Agent采用经典Q学习同步学习最优策略,顶层Agent利用Tile Coding非凡的连续空间处理能力,实现Q学习的动作值函数逼近方法。仿真实验结果表明,该分层递阶控制不但提高了交通信号控制系统效率,而且也为大规模应用提供了很好的可伸缩解决方案。 In view of the existing deficiencies of traffic signal control system, this paper proposes two-layer hierarchical multi-Agent system solution for traffic signal control. Through regional division of the traffic network, it uses the bottom level Agent to control the intersection, the top level Agent to control areas, so as to achieve the two-layer hierarchical con-trol. The bottom level Agent uses the classical Q-learning to synchronize the optimal strategy, the top level Agent utilizes the special continuous space processing ability of Tile Coding to achieve Q learning of action value function approxima-tion method. The simulation test results show that, the hierarchical control not only improves the efficiency of traffic signal control system, but also provides a good scalable solution for large-scale applications.

作者戈军周莲英

机构地区宿迁学院计算机科学系江苏大学计算机科学与通信工程学院

出处《计算机工程与应用》 CSCD 北大核心 2015年第20期246-252,共7页 Computer Engineering and Applications

基金江苏省宿迁市科技创新专项基金(No.Z201211) 宿迁学院重点科研基金(No.2013KY15)

关键词多AGENT系统递阶控制交通信号 Q-学习 Tile Coding multi-Agent systems hierarchical control traffic signals Q-learning Tile Coding

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献21

1Li H G,Li Z,Robert T,et al.A real-time transportation prediction system[J].Applied Intelligence,2013,39(4):793-804.
2Chen B,Cheng H H.A review of the applications of agent technology in traffic and transportation systems[J].IEEE Transactions on Intelligent Transportation Systems,2010,11(2):485-497.
3Chen B,Cheng H H,Palen J.Integrating mobile agent technology with multi-agent systems for distributed traffic detection and management systems[J].Transportation Research Part C:Emerging Technologies,2009,17(1):1-10.
4Bazzan A.Opportunities for multiagent systems and multiagent reinforcement learning in traffic control[J].Autonomous Agents and Multi-Agent Systems,2009,18(3):342-375.
5Roozemond D A.Using intelligent agents for pro-active,real-time urban intersection control[J].European Journal of Operational Research,2001,131(2):293-301.
6Cai C Q,Yang Z S.Study on urban traffic management based on multi-agent system[C]//Proceedings of the 6th International Conference on Machine Learning and Cybernetics,Hong Kong,China:IEEE,2007:25-29.
7Chen C,Li Z J.A hierarchical networked urban traffic signal control system based on multi-agent[C]//Proceedings of the 9th IEEE International Conference on Networking,Sensing and Control(ICNSC).New York:IEEE,2012:28-33.
8Srinivasan D,Choy M C,Cheu R L.Neural networks for realtime traffic signal control[J].IEEE Transactions on Intelligent Transportation Systems,2006,7(3):261-272.
9Gregoire P,Desjardins C,Laumonier J,et al.Urban traffic control based on learning agents[C]//Proceedings of Intelligent Transportation Systems Conference.New York:IEEE,2007:916-921.
10Weiring M A.Multi-agent reinforcement learning for traffic light control[C]//Proceedings of the 7th International Conference on Machine Learning(ICML2000).San Francisco:Morgan Kaufmann Publishers Incorporation,2000:1151-1158.

二级参考文献17

1Sutton R S,Barto A G.Reinforcement Learning:An Introduction[M].MIT Press,1998.
2Busoniu L,Babuska R,DeSchutter B,et al.Reimforcement Leaming and Dynamic Programming Using Function Approximators[M].Boca Raton,FL:CRC Press,2010.
3Grondman I,Busoniu L,et al.A Survey of Actor-Critic Reinforcement Learning:Standard and Natural Policy Gradients[J].IEEE Transactions on Systems,Man,and Cybernetics—Part C:Applications and Reviews,2012,42(6):1291-1307.
4Barto A G,Sutton R S,Anderson C W.Neuronlike Adaptive Element That Can Solve Difficult Learning Control Problems[J].IEEE Trans Syst Man Cybem,1983,13:834-846.
5Konda V R,Tsitsiklis J N.Actor-Critic Algorithms[C]// Proceedings of Advances in Neural Information Processing Systems.2000.
6Rosenstein M T,Barto A G.Supervised Learning Combined with an Actor-Critic Architecture[J].CMPSCI Technical Report 02-41.October 2002.
7Peters J,Schaal S.Natural actor-critic[J].Neurocomputing,2008,71(7-9):1180-1190.
8Bathnagar S,Sutton R S,Ghavamzadeh M,et al.Natural actor critic algorithms[J].Automatica,2009,45 (11):2471-2482.
9Vamvoudakis K G,Lewis F L.Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem[J].Automatica,2010,46(5):878-888.
10Grondman I,Vaandrager M,Busoniu L,et al.Efficient Model Learning Methods for Actor-Critic Control[J].IEEE Transactions on Systems Man and Cybernetics Part B-Cybernetics,2012,42(3):591-602.

共引文献27

1徐琳琳,张树美,赵俊莉.构建并行卷积神经网络的表情识别算法[J].中国图象图形学报,2019,24(2):227-236. 被引量：51
2钟珊,刘全,傅启明,章宗长,朱斐,龚声蓉.一种近似模型表示的启发式Dyna优化算法[J].计算机研究与发展,2015,52(12):2764-2775. 被引量：4
3王准,何元烈.基于混合价值计算的云存储缓存替换方案[J].计算机工程与设计,2017,38(6):1651-1656. 被引量：4
4刘全,翟建伟,钟珊,章宗长,周倩,章鹏.一种基于视觉注意力机制的深度循环Q网络模型[J].计算机学报,2017,40(6):1353-1366. 被引量：20
5马技,李晶皎,李珍妮.基于视觉注意机制深度强化学习的行人检测方法[J].中国科技论文,2017,12(14):1570-1577. 被引量：10
6刘全,翟建伟,章宗长,钟珊,周倩,章鹏,徐进.深度强化学习综述[J].计算机学报,2018,41(1):1-27. 被引量：470
7唐丽丽,朱海军,朱斐.一种基于核的在线策略梯度算法[J].新疆大学学报（自然科学版）,2018,35(2):209-216.
8吴宏杰,杨茹,傅启明,陈建平,陆卫忠.基于强化学习的HP模型优化方法研究[J].计算机工程与应用,2019,55(12):132-139. 被引量：1
9刘全,闫岩,朱斐,吴文,张琳琳.一种带探索噪音的深度循环Q网络[J].计算机学报,2019,42(7):1588-1604. 被引量：11
10朱斐,吴文,伏玉琛,刘全.基于双深度网络的安全深度强化学习方法[J].计算机学报,2019,42(8):1812-1826. 被引量：26

同被引文献10

1朱亚华,刘秉瀚.城市平面交叉路口微观仿真软件设计[J].福州大学学报（自然科学版）,2008,36(1):64-68. 被引量：1
2李茜,李铁柱.公交专用道绿波信号设置及仿真模拟分析[J].佛山科学技术学院学报（自然科学版）,2009,27(1):38-42. 被引量：2
3王静波.交通控制信号优化模型的仿真研究[J].计算机仿真,2011,28(4):353-357. 被引量：6
4张剑,董力耘.考虑预期效应和交通灯影响的城市道路交通元胞自动机模型[J].上海大学学报（自然科学版）,2011,17(5):642-647. 被引量：4
5吕庆,方勇纯,任逍.加速抑制随机初态误差影响的迭代学习控制[J].自动化学报,2014,40(7):1295-1302. 被引量：19
6周昊,阮太元,刘智勇.定周期单路口绿信比的迭代学习控制方法[J].五邑大学学报（自然科学版）,2015,29(4):57-61. 被引量：1
7池荣虎,侯忠生,黄彪.间歇过程最优迭代学习控制的发展:从基于模型到数据驱动[J].自动化学报,2017,43(6):917-932. 被引量：25
8李娣娜,黄同,薛娓娓.一种十字路口交通灯智能控制系统的设计[J].科技资讯,2016,14(22):1-1. 被引量：7
9张自荷.基于VISSIM仿真的平面信号交叉口交通组织优化[J].山东工业技术,2018(21):143-143. 被引量：7
10徐文龙,郭杜杜,马倩雯,巴合达吾列提.热阿汗.平交口信控配时方案的优化与验证[J].西部交通科技,2015(2):62-65. 被引量：3

引证文献1

1姚斐,宋芳.交通信号灯迭代学习控制方法[J].软件导刊,2020,19(8):95-99. 被引量：1

二级引证文献1

1周艳玲,张云翔,曹晶.基于Netlogo的智能交通灯可控系统的研究与仿真[J].榆林学院学报,2022,32(6):71-75. 被引量：1

1谢超,高大启.一种基于傅里叶变换的RBF神经网络函数逼近方法[J].计算机工程与科学,2005,27(2):47-49. 被引量：3
2沈志华,赵英凯,王晓荣.全自主移动机器人分层递阶控制研究[J].微计算机信息,2006(03Z):190-191.
3陶永华.自动化技术专题讲座(Ⅳ)──智能控制技术与应用[J].基础自动化,1997,4(3):55-60. 被引量：2
4王科俊,王克成,金鸿章,李国斌.智能控制发展的新趋势——多种智能控制方法的综合[J].黑龙江自动化技术与应用,1996,15(4):5-8. 被引量：1
5蒋良孝,李超群.基于BP神经网络的函数逼近方法及其MATLAB实现[J].微型机与应用,2004,23(1):52-53. 被引量：49
6朱从民,黄玉美,上官望义.移动机器人Java Agent控制系统设计[J].计算机工程与应用,2009,45(5):74-77. 被引量：2
7李旭,张为公,董晓马.驾驶机器人关键技术的研究[J].华中科技大学学报（自然科学版）,2004,32(S1):204-206.
8刘勇.协同学习中分组策略与同步学习方法的研究[J].昌吉学院学报,2009(1):98-101. 被引量：7
9许强.分层递阶控制技术在集控站中的应用[J].商品与质量（理论研究）,2011(6):237-237.
10席先杰.“混合学习”教学模式的高职静态网页设计课程实践[J].福建电脑,2014,30(9):147-150.

计算机工程与应用

2015年第20期

浏览历史

内容加载中请稍等...

面向交通信号的两层递阶控制解决方案被引量：1

参考文献21

二级参考文献17

共引文献27

同被引文献10

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

面向交通信号的两层递阶控制解决方案 被引量：1

参考文献21

二级参考文献17

共引文献27

同被引文献10

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

面向交通信号的两层递阶控制解决方案被引量：1