This article addresses the autonomy of joint radio resource management (JRRM) between heterogeneous radio access technologies(RATs) owned by multiple operators. By modeling the inter-operator competition as a gene...This article addresses the autonomy of joint radio resource management (JRRM) between heterogeneous radio access technologies(RATs) owned by multiple operators. By modeling the inter-operator competition as a general-sum Markov game, correlated-Q learning(CE-Q) is introduced to generate the operators' pricing and admission policies at the correlated equilibrium autonomically. The heterogeneity in terms of coverage, service suitability, and cell capacity amongst different RATs are considered in the input state space, which is generalized using multi-layer feed-forward neural networks for less memory requirement. Simulation results indicate that the proposed algorithm can produce rational JRRM polices for each network under different load conditions through the autonomic learning process. Such policies guide the traffic toward an optimized distribution and improved resource utilization, which results in the highest network profits and lowest blocking probability compared to other self-learning algorithms.展开更多
基金This work is supported by the National Natural Science Foundation of China (60632030);the Integrated Project of the 6th Framework Program of the European Commission (IST-2005-027714);the Hi-Tech Research and Development Program of China (2006AA01Z276) ;the China-European Union Science and Technology Cooperation Foundation of Ministry of Science and Technology of China (0516).
文摘This article addresses the autonomy of joint radio resource management (JRRM) between heterogeneous radio access technologies(RATs) owned by multiple operators. By modeling the inter-operator competition as a general-sum Markov game, correlated-Q learning(CE-Q) is introduced to generate the operators' pricing and admission policies at the correlated equilibrium autonomically. The heterogeneity in terms of coverage, service suitability, and cell capacity amongst different RATs are considered in the input state space, which is generalized using multi-layer feed-forward neural networks for less memory requirement. Simulation results indicate that the proposed algorithm can produce rational JRRM polices for each network under different load conditions through the autonomic learning process. Such policies guide the traffic toward an optimized distribution and improved resource utilization, which results in the highest network profits and lowest blocking probability compared to other self-learning algorithms.