基于Q学习的自主联合无线资源管理算法被引量：9

A Q-learning Based Autonomic Joint Radio Resource Management Algorithm

下载PDF

导出

摘要该文提出了一种基于Q学习的联合无线资源管理(JRRM)算法,用于异构无线接入技术条件下B3G系统的自主资源优化。JRRM控制器通过与无线环境的"试错"交互,学会为每个会话分配合适的接入技术和业务带宽。为降低存储需求,算法引入了反向传播神经网络用于泛化其输入状态空间。仿真结果表明,该算法不仅通过在线学习实现了JRRM的自主化,且在频谱效用和阻塞率之间获得了很好的性能折衷。 A Q-learning based Joint Radio Resource Management （JRRM） algorithm is proposed for the autonomic resource optimization in a B3G system with heterogeneous Radio Access Technologies （RAT）. Through the ＂trial-and-error＂ interactions with the radio environment, the JRRM controller learns to allocate the proper RAT and the service bandwidth for each session. A backpropagation neural network is adopted to generalize the large input state space to reduce memory requirement. Simulation results show that the proposed algorithm not only realizes the autonomy of JRRM through the online learning process, but also achieves well trade-off between the spectrum utility and the blocking probability.

作者张永靖冯志勇张平

机构地区北京邮电大学电信工程学院

出处《电子与信息学报》 EI CSCD 北大核心 2008年第3期676-680,共5页 Journal of Electronics & Information Technology

基金欧盟FP6端到端重配置(IST-2005-027714) 国家自然科学基金重点项目(60502035) 国家863计划(2006AA01Z276) 科技部中欧盟科技合作项目(0516)资助课题

关键词无线接入技术(RAT) 联合接纳控制带宽分配 Q学习神经网络 Radio Access Technology （RAT） Joint admission control Bandwidth allocation Q-learning Neural network

分类号 TN915.65 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献10

1Song Q and Jamalipour A. Network selection in an integrated wireless LAN and UMTS environment using mathematical modeling and computing techniques[J]. IEEE Wireless Commun., 2005, 12(3): 42-48.
23GPP TR 25.881 v5.0.0. Improvement of RRM across RNS and RNS/BSS (Release 5) [OL]. http://www.3gpp.org, Dec. 2001.
3IST-2003-507995 Project E2R (End-to-End Reconfigurability) [OL]. http://e2r.motlabs.com, Jan. 2004.
4Agusti R, Salient O, and Perez-Romero J, et al.. A fuzzyneural based approach for joint radio resource management in a beyond 3G framework[C]. First Int. Conf. on Quality of Service in Heterogeneous Wired/Wireless Networks, Barcelona, Mar. 2004: 216-224.
5Luo J, Mohyeldin E, and Dillinger M, et al.. Performance analysis of joint radio resource management for reconfigurable terminals with multi-class circuit-switched services[C]. Wireless World Research Forum 12th Meeting, Toronto, Nov. 2004: 138-150.
6Zhang Y, Zhang K, and Ji Y, et al.. Adaptive threshold joint load control in an end-to-end reconfigurable systemiC]. IST Mobile and Wireless Summit 2006, Mykonos, Jun. 2006: 332-337.
7Kaelbling L P, Littman M L, and Wang X, et al..Reinforcement learning: a survey[J]. Journal of Artificial Intelligence Research, 1996, 4(2): 237-285.
8Nie J and Haykin S. A Q-learning-based dynamic channel assignment technique for mobile communication systems[J]. IEEE Trans. on Vehicular Technology, 1999, 48(5): 1676- 1687.
9Watkins C J C H and Dayan P. Q-learning[J]. Machine Learning, 1992, 8(3): 279-292.
10Radunovic B, Le Boudec J Y. Rate performance objectives of multihop wireless networks[J]. IEEE Trans. on Mobile Computing, 2004, 3(4): 334-349.

同被引文献98

1杨光,余凯,章魁,张平.基于IEEE 802.11e MAC的联合无线资源管理[J].重庆邮电学院学报（自然科学版）,2005,17(6):662-666. 被引量：1
2罗强.端到端重配置技术研究[J].电信科学,2006,22(12):40-45. 被引量：2
3HAYKIN S. Cognitive radio: brain-empowered wireless communications[J]. IEEE Journal on Selected Areas in Communications, 2005,23 (2) : 201-220.
4KAELBLING L P, LITTMAN M L, MOORE A W. Reinforcement learning: a survey[J].Journal of Artificial Intelligence Research, 1996 (4) :237-285.
5NIE J, HAYKIN S. A Q-learning-based dynamic channel assignment technique for mobile communication systems[J]. IEEE Trans on Vehicular Technology, 1999, 48: 1676-1687.
6WATKINS C J C H. Learning from delayed rewards[D]. Cambridge: Cambridge University, 1989.
7WATKINS C J C H, DAYAN P. Q-learning[J]. Machine Learning, 1992,8:279-292.
8NGAI D C K, YUNG N H C. Double action Q-learning for obstacle avoidance in a dynamically changing environment[C]. Las Vegas: Proceedings of the 2005 IEEE Intelligent Vehicles Symposium, 2005.
9NGAI D C K, YUNG N H C. Performance evaluation of double action Q-Learning in moving obstacle avoidance problem[C]. Hawaii:Proceedings of the 2005 IEEE International Conference on Systems, Man, and Cybernetics, 2005.
10WAEL ABD-ALMAGEED, ALY I EL-OSERY, CHRISTOPHER E S. Estimating time-varying densities using a stochastic learning automaton [J]. Soft Computing-A Fusion of Foundations, Methodologies and Applications, 2006,10(11):1007-1020.

引证文献9

1吴启晖,刘琼俐.基于DAQL算法的动态频谱接入方案[J].解放军理工大学学报（自然科学版）,2008,9(6):607-611. 被引量：3
2李默,徐友云,蔡跃明.基于Q-Learning的认知无线电系统感知管理算法[J].电子与信息学报,2010,32(3):623-628. 被引量：3
3吴爱军,李屹.异构无线网络中支持端到端重配置的资源管理技术[J].信息化研究,2010,36(8):5-7. 被引量：1
4赵彦清,朱琦.基于Q学习的异构网络选择新算法[J].计算机应用,2011,31(6):1461-1464. 被引量：4
5江虹,伍春,刘勇.基于强化学习的频谱决策与传输算法[J].系统仿真学报,2013,25(3):565-570. 被引量：1
6赵彪,李鸥,栾红志.Q学习算法在机会频谱接入信道选择中的应用[J].信号处理,2014,30(3):298-305. 被引量：4
7冯陈伟,袁江南.基于强化学习的异构无线网络资源管理算法[J].电信科学,2015,31(8):99-106. 被引量：5
8冯陈伟,张璘.一种基于Q学习的网络接入控制算法[J].计算机工程,2015,41(10):99-104. 被引量：5
9刘惠茹,马琳,徐玉滨.基于Q学习的CDMA/WLAN异构网络接入控制算法[J].通信技术,2016,49(8):1017-1022.

二级引证文献25

1张文柱,刘栩辰.Centralized Dynamic Spectrum Allocation in Cognitive Radio Networks Based on Fuzzy Logic and Q-Learning[J].China Communications,2011,8(7):46-54. 被引量：4
2赵彦清,朱琦.基于Q学习的异构网络选择新算法[J].计算机应用,2011,31(6):1461-1464. 被引量：4
3张凯,李鸥,杨白薇.基于Q-learning的机会频谱接入信道选择算法[J].计算机应用研究,2013,30(5):1467-1470. 被引量：10
4陈雯,丰文斌,廖小飞,余海翔.异构无线网络接入控制演示平台的实现[J].东华大学学报（自然科学版）,2015,41(3):335-340.
5康俊丽,郭坤祺,曹亚兰,王思璇.一种多Agent系统频谱接入算法[J].无线通信技术,2015,24(4):7-12. 被引量：1
6鲁栋.异构无线融合网络中无线资源管理关键技术探讨[J].电子技术与软件工程,2016(9):38-39. 被引量：2
7刘惠茹,马琳,徐玉滨.基于Q学习的CDMA/WLAN异构网络接入控制算法[J].通信技术,2016,49(8):1017-1022.
8高秀娥,李克秋.基于改进多属性判决的异构网络接入选择算法[J].计算机科学,2017,44(6):97-101. 被引量：8
9张亚洲,周又玲.基于Q-learning的动态频谱接入算法研究[J].海南大学学报（自然科学版）,2018,36(1):9-15. 被引量：1
10李志宏.通信工程中有线传输技术的应用改进[J].电子技术与软件工程,2018(12):23-23. 被引量：6

1银奕淇,张微,高屹扬,范双南.异构融合机制下物联网网络层结构研究[J].电脑知识与技术,2012,8(10X):7379-7380. 被引量：1
2陈骏,刘立刚,王江,谭国平.基于不同策略的联合接纳控制算法综述[J].电子设计工程,2014,22(6):178-181.
3张志东,邓超,和阳.团队强烈的信念导致了微信的成功[J].创业家,2013(12):100-102.
4靳玉涵,刘泽民.基于DSA与JRRM联合的无线网络资源分配方法[J].高技术通讯,2009,19(2):136-140.
5刘香附.未来无线资源管理[J].中国无线电,2007(11):23-25.
6邓洁,雒江涛.基于第四代移动通信的无线资源管理[J].信息技术,2007,31(12):93-96. 被引量：1
7杨光,余凯,章魁,张平.基于IEEE 802.11e MAC的联合无线资源管理[J].重庆邮电学院学报（自然科学版）,2005,17(6):662-666. 被引量：1
8广电网络友好网在北京结盟[J].中国有线电视,2010(4):549-550. 被引量：1
9季红兵.基于CMOS工艺的10位逐次逼近型模数转换器设计分析[J].盐城工学院学报（自然科学版）,2006,19(4):42-45. 被引量：1
10IEEE开始制定两项有关异构无线接入网络的新标准[J].上海标准化,2009(4):42-42.

电子与信息学报

2008年第3期

浏览历史

内容加载中请稍等...

基于Q学习的自主联合无线资源管理算法被引量：9

参考文献10

同被引文献98

引证文献9

二级引证文献25

相关作者

相关机构

相关主题

浏览历史

基于Q学习的自主联合无线资源管理算法 被引量：9

参考文献10

同被引文献98

引证文献9

二级引证文献25

相关作者

相关机构

相关主题

浏览历史

基于Q学习的自主联合无线资源管理算法被引量：9