期刊文献+

基于强化学习的异构无线网络资源管理算法 被引量:5

Heterogeneous Wireless Network Resource Management Algorithm Based on Reinforcement Learning
下载PDF
导出
摘要 为了充分利用各种无线网络的资源,需要实现异构网络的融合,而异构网络的融合又面临接入控制与资源分配的问题。为此,提出一种基于强化学习的异构无线网络资源管理算法,该算法引入D2D(device-to-device)通信模式,并可以根据终端不同的业务类型、终端移动性及网络负载条件等状态,选择合适的网络接入方式。同时,为降低存储需求,采用神经网络技术解决连续状态空间问题。仿真结果表明,该算法具有高效的在线学习能力,能够有效地提升网络的频谱效用,降低阻塞率,从而实现自主的无线资源管理。 In order to make full use of the resources of all kinds of wireless network, the integration of heterogeneous network is necessary. However, when it comes to the heterogeneous network integration, the problems of call request access control and resource management emerge. A reinforcement-learning-based algorithm was presented for heterogeneous wireless network resource management. D2D (device-to-device) communication was introduced into the proposed algorithm and the appropriate network for access could be selected according to different traffic types, terminal mobility, network load status and so on. Meanwhile, to reduce the storage requirement, the neural network technology was used to solve the problem of continuous state space. Simulation results show that the proposed algorithm has an efficient learning ability to achieve autonomous radio resource management, which effectively improves the spectrum utility and reduces the blocking probability.
出处 《电信科学》 北大核心 2015年第8期99-106,共8页 Telecommunications Science
基金 福建省中青年教师教育科研项目A类资助项目(No.JA14233) 国家自然科学基金青年科学基金资助项目(No.61202013) 福建省自然科学基金资助项目(No.2015J01670)~~
关键词 异构无线网络 接入控制 资源管理 强化学习 Q学习 heterogeneous wireless network, access control, resource management, reinforcement learning, Q-learning
  • 相关文献

参考文献21

  • 1Luo J, Mukerjee R, Dillinger M, et al. Investigation of radio resource scheduling in WLANs coupled with 3G cellular network. IEEE Communications Magazine, 2003, 41(6): 108-115.
  • 23GPP TR 25.881 v5.0.0. Improvement of RRM across RNS and RNS/BSS (Release5). http://www.3gpp.org, 2001.
  • 3Song Q, Jamalipour A. Network selection in an integrated wireless LAN and UMTS environment using mathematical modeling and computing techniques~ IEEE Wireless Communications, 2005, 12(3): 42~48.
  • 4Zhang Y, Chela J, Zhang P. Autonomic joint radio resource management in B3G environment using reinforcement learning. Proceedings of the 6th Annual Wireless Telecommunications Symposium (WTS 2007), Pomona, California, USA, 2007.
  • 5Barto A G. Reinforcement Learning: an Introduction. Cambridge: MIT Press, 1998.
  • 6Kaelbling L P, Littman M L, Moore A W. Reinforcement learning: a survey. Journal of Artificial Intelligence Research, 1996 (4): 237-285.
  • 7Nie J, Haykin S. A Q-learning-based dynamic channel assignment technique for mobile eommunieation systems. IEEE Transactions on Vehicular Technology, 1999, 48(5): 1676~1687.
  • 8Senouci S M, Beylot A L, Pujolle G. Call admission control in eellular networks: a reinforcement learning solution. International Journal of Network Management, 2004, 14(2): 89~103.
  • 9Haddad M, Altman Z, Elayoubi S E, et a/. A nash-stackelberg fuzzy Q-learning decision approach in heterogeneous cognitive networks. Proceedings of Global Telecommunications Conference (GLOBECOM 2010), Miami, Florida, USA, 2010:1~6.
  • 10Simsek M, Czylwik A. Decentralized Q-learning of LTE-femtoeells for interference reduction in heterogeneous networks using cooperation. Proceedings of 2012 International ITG Workshop on Smart Antennas (WSA), Dresden, Germany, 2012:86~91.

二级参考文献10

  • 1Song Q and Jamalipour A. Network selection in an integrated wireless LAN and UMTS environment using mathematical modeling and computing techniques[J]. IEEE Wireless Commun., 2005, 12(3): 42-48.
  • 23GPP TR 25.881 v5.0.0. Improvement of RRM across RNS and RNS/BSS (Release 5) [OL]. http://www.3gpp.org, Dec. 2001.
  • 3IST-2003-507995 Project E2R (End-to-End Reconfigurability) [OL]. http://e2r.motlabs.com, Jan. 2004.
  • 4Agusti R, Salient O, and Perez-Romero J, et al.. A fuzzyneural based approach for joint radio resource management in a beyond 3G framework[C]. First Int. Conf. on Quality of Service in Heterogeneous Wired/Wireless Networks, Barcelona, Mar. 2004: 216-224.
  • 5Luo J, Mohyeldin E, and Dillinger M, et al.. Performance analysis of joint radio resource management for reconfigurable terminals with multi-class circuit-switched services[C]. Wireless World Research Forum 12th Meeting, Toronto, Nov. 2004: 138-150.
  • 6Zhang Y, Zhang K, and Ji Y, et al.. Adaptive threshold joint load control in an end-to-end reconfigurable systemiC]. IST Mobile and Wireless Summit 2006, Mykonos, Jun. 2006: 332-337.
  • 7Kaelbling L P, Littman M L, and Wang X, et al..Reinforcement learning: a survey[J]. Journal of Artificial Intelligence Research, 1996, 4(2): 237-285.
  • 8Nie J and Haykin S. A Q-learning-based dynamic channel assignment technique for mobile communication systems[J]. IEEE Trans. on Vehicular Technology, 1999, 48(5): 1676- 1687.
  • 9Watkins C J C H and Dayan P. Q-learning[J]. Machine Learning, 1992, 8(3): 279-292.
  • 10Radunovic B, Le Boudec J Y. Rate performance objectives of multihop wireless networks[J]. IEEE Trans. on Mobile Computing, 2004, 3(4): 334-349.

共引文献8

同被引文献17

引证文献5

二级引证文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部