GameTraffic:基于交通调度历史数据挖掘的路口最优调度及道路改造预测被引量：1

GameTraffic:Optimal-traffic-scheduling and road-reconstruction based on the mining of historical traffic-scheduling data

原文传递

导出

摘要最大化车流量和最小化平均等待时间是交通路口调度的目标.交通调度中各路口与其它路口间存在博弈关系,相邻路口间为使其自身利益最大化而存在策略间的相互协调.我们基于博弈论对交通系统进行建模,基于博弈均衡的增强学习算法对交通调度历史数据进行挖掘分析,学习得到交通路口的最优调度策略,并进行道路改造预测.展示了交通路口最优调度及道路改造预测系统GameTraffic,旨在为智能交通管理及决策提供一种科学的依据. The target of traffic intersection scheduling is to maximize the flow rates and minimize the average waiting time of all concerned vehicles.Game relationships exist among intersections in the traffic scheduling,and there is mutual coordination among strategies for the maximal profits of neighboring intersections.We model the traffic system based on the game theory,and then learn the optimal scheduling strategies by mining the historical traffic-scheduling data based on the reinforcement learning algorithm.In this paper,we demonstrate the system for traffic-scheduling and road-reconstruction,called GameTraffic,in order to provide a scientific basis for intelligent traffic management and corresponding decision support.

作者岳昆韩格刘惟一周丽萍

机构地区云南大学信息学院计算机科学与工程系云南大学学报(自然科学版)编辑部

出处《云南大学学报（自然科学版）》 CAS CSCD 北大核心 2010年第S1期345-349,354,共6页 Journal of Yunnan University(Natural Sciences Edition)

基金国家自然科学基金资助项目(60763007 60933001) 云南省应用基础研究资助项目(2008CD083) 云南省教育厅科研资助项目(08Y0023) 云南大学科研资助项目(2009F32Q) 云南大学中青年骨干教师培养计划

关键词交通调度博弈论增强学习最优调度道路改造预测 traffic scheduling game theory reinforcement learning optimal scheduling road-reconstruction prediction

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献1

1韩格,岳昆,刘惟一.一种基于博弈论的交通系统最优调度策略学习方法[J].云南大学学报（自然科学版）,2010,32(1):36-42. 被引量：3

二级参考文献13

1NAGEL K. Traffic networks [ M].Handbook of Graphs and Networks ,2002,248-272.
2LIU W Y, LI J, YUE K, et al. An approach for solving fuzzy games [ J ]. Int' l Journal of Uncertainty, Fuzziness, and Knowledge - Based Systems, 2006,14 (3) : 277 -292.
3MINSKY J F. Theory of neural - analog reinforcement systems and its application to the brain - model problem [ D ]. America : Princeton University, 1954.
4WATKINS C. Learning with delayed rewards [ D]. England : Cambridge University, 1989.
5LITTMAN M, BOYAN J. A distributed reinforcement learning scheme for network routing [ C ]//Proc. of the 1st Int' l workshop on Application of Neural Networks to Telecommunication, 1993:45-51.
6CRITES R H, BARTO A G. Improving elevator performance using reinforcement learning[ C ]//Proc. of NIPS' 1996 : 1 017-1 023.
7HU J L, MICHAEL P M. Nash Q - learning for general -sum stochastic games [ J ]. Machine Learning Research,2003,4:1 039-1 069.
8WIERING M A. Multi - agent reinforcement learning for traffic light control [ C ]//Proc. of ICML' 2000: 1 151-1 155.
9KAEBLING L P, LITTMAN M L, MOORE A W. Reinforcement learning : a survey [ J ]. Artificial Intelligence Research, 1996,4:237-285.
10HU J L, MICHAEL P M. Multiagent reinforcement learning:theoretical framework and an algorithm [ C]//Proc of ICML' 1998:242-250.