期刊文献+

GameTraffic:基于交通调度历史数据挖掘的路口最优调度及道路改造预测 被引量:1

GameTraffic:Optimal-traffic-scheduling and road-reconstruction based on the mining of historical traffic-scheduling data
原文传递
导出
摘要 最大化车流量和最小化平均等待时间是交通路口调度的目标.交通调度中各路口与其它路口间存在博弈关系,相邻路口间为使其自身利益最大化而存在策略间的相互协调.我们基于博弈论对交通系统进行建模,基于博弈均衡的增强学习算法对交通调度历史数据进行挖掘分析,学习得到交通路口的最优调度策略,并进行道路改造预测.展示了交通路口最优调度及道路改造预测系统GameTraffic,旨在为智能交通管理及决策提供一种科学的依据. The target of traffic intersection scheduling is to maximize the flow rates and minimize the average waiting time of all concerned vehicles.Game relationships exist among intersections in the traffic scheduling,and there is mutual coordination among strategies for the maximal profits of neighboring intersections.We model the traffic system based on the game theory,and then learn the optimal scheduling strategies by mining the historical traffic-scheduling data based on the reinforcement learning algorithm.In this paper,we demonstrate the system for traffic-scheduling and road-reconstruction,called GameTraffic,in order to provide a scientific basis for intelligent traffic management and corresponding decision support.
出处 《云南大学学报(自然科学版)》 CAS CSCD 北大核心 2010年第S1期345-349,354,共6页 Journal of Yunnan University(Natural Sciences Edition)
基金 国家自然科学基金资助项目(60763007 60933001) 云南省应用基础研究资助项目(2008CD083) 云南省教育厅科研资助项目(08Y0023) 云南大学科研资助项目(2009F32Q) 云南大学中青年骨干教师培养计划
关键词 交通调度 博弈论 增强学习 最优调度 道路改造预测 traffic scheduling game theory reinforcement learning optimal scheduling road-reconstruction prediction
  • 相关文献

参考文献1

二级参考文献13

  • 1NAGEL K. Traffic networks [ M].Handbook of Graphs and Networks ,2002,248-272.
  • 2LIU W Y, LI J, YUE K, et al. An approach for solving fuzzy games [ J ]. Int' l Journal of Uncertainty, Fuzziness, and Knowledge - Based Systems, 2006,14 (3) : 277 -292.
  • 3MINSKY J F. Theory of neural - analog reinforcement systems and its application to the brain - model problem [ D ]. America : Princeton University, 1954.
  • 4WATKINS C. Learning with delayed rewards [ D]. England : Cambridge University, 1989.
  • 5LITTMAN M, BOYAN J. A distributed reinforcement learning scheme for network routing [ C ]//Proc. of the 1st Int' l workshop on Application of Neural Networks to Telecommunication, 1993:45-51.
  • 6CRITES R H, BARTO A G. Improving elevator performance using reinforcement learning[ C ]//Proc. of NIPS' 1996 : 1 017-1 023.
  • 7HU J L, MICHAEL P M. Nash Q - learning for general -sum stochastic games [ J ]. Machine Learning Research,2003,4:1 039-1 069.
  • 8WIERING M A. Multi - agent reinforcement learning for traffic light control [ C ]//Proc. of ICML' 2000: 1 151-1 155.
  • 9KAEBLING L P, LITTMAN M L, MOORE A W. Reinforcement learning : a survey [ J ]. Artificial Intelligence Research, 1996,4:237-285.
  • 10HU J L, MICHAEL P M. Multiagent reinforcement learning:theoretical framework and an algorithm [ C]//Proc of ICML' 1998:242-250.

共引文献2

同被引文献4

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部