基于DAQL算法的动态频谱接入方案被引量：3

Dynamic spectrum access based on DAQL

下载PDF

导出

摘要针对传统的动态频谱接入方案一般没有考虑自主性,不具备普适性这一缺点,提出了一种基于双动作Q学习算法DAQL(double action Q-learning)的频谱接入方案,该方案将DAQL引入到多授权用户存在的环境下频谱接入问题中,用以降低接入未知频谱环境时的冲突概率。仿真结果表明,提出的方案与随机接入方案相比,不但有更小的冲突概率,而且能动态适应环境的变化,适合认知无线电的需要。 As for the traditional dynamic spectrum access schemes that are not fit to all kinds of instances and do not consider the ability of self-determination, a scheme based on Double action Q-learning （DAQL） was proposed to solve the problem of dynamic spectrum access. This scheme could reduce the collision probability when accessing unknown spectrum by introducing DAQL into the scenario where there were many licensed users. Simulation results show that this scheme has lower collision probability than random access scheme, and can be adaptive to the change of environment so that it can satisfy the need of cognitive radio.

作者吴启晖刘琼俐

机构地区解放军理工大学通信工程学院

出处《解放军理工大学学报（自然科学版）》 EI 2008年第6期607-611,共5页 Journal of PLA University of Science and Technology(Natural Science Edition)

基金国家863计划资助项目(2007AA01Z267) 国家973计划资助项目(2009CB3020402)

关键词强化学习 Q学习双动作Q学习算法冲突概率 reinforcement learning Q-learning DAQL （double action Q-learning） collision probability

分类号 TN929.5 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献9

1HAYKIN S. Cognitive radio: brain-empowered wireless communications[J]. IEEE Journal on Selected Areas in Communications, 2005,23 (2) : 201-220.
2KAELBLING L P, LITTMAN M L, MOORE A W. Reinforcement learning: a survey[J].Journal of Artificial Intelligence Research, 1996 (4) :237-285.
3NIE J, HAYKIN S. A Q-learning-based dynamic channel assignment technique for mobile communication systems[J]. IEEE Trans on Vehicular Technology, 1999, 48: 1676-1687.
4张永靖,冯志勇,张平.基于Q学习的自主联合无线资源管理算法[J].电子与信息学报,2008,30(3):676-680. 被引量：9
5WATKINS C J C H. Learning from delayed rewards[D]. Cambridge: Cambridge University, 1989.
6WATKINS C J C H, DAYAN P. Q-learning[J]. Machine Learning, 1992,8:279-292.
7NGAI D C K, YUNG N H C. Double action Q-learning for obstacle avoidance in a dynamically changing environment[C]. Las Vegas: Proceedings of the 2005 IEEE Intelligent Vehicles Symposium, 2005.
8NGAI D C K, YUNG N H C. Performance evaluation of double action Q-Learning in moving obstacle avoidance problem[C]. Hawaii:Proceedings of the 2005 IEEE International Conference on Systems, Man, and Cybernetics, 2005.
9WAEL ABD-ALMAGEED, ALY I EL-OSERY, CHRISTOPHER E S. Estimating time-varying densities using a stochastic learning automaton [J]. Soft Computing-A Fusion of Foundations, Methodologies and Applications, 2006,10(11):1007-1020.

二级参考文献10

1Song Q and Jamalipour A. Network selection in an integrated wireless LAN and UMTS environment using mathematical modeling and computing techniques[J]. IEEE Wireless Commun., 2005, 12(3): 42-48.
23GPP TR 25.881 v5.0.0. Improvement of RRM across RNS and RNS/BSS (Release 5) [OL]. http://www.3gpp.org, Dec. 2001.
3IST-2003-507995 Project E2R (End-to-End Reconfigurability) [OL]. http://e2r.motlabs.com, Jan. 2004.
4Agusti R, Salient O, and Perez-Romero J, et al.. A fuzzyneural based approach for joint radio resource management in a beyond 3G framework[C]. First Int. Conf. on Quality of Service in Heterogeneous Wired/Wireless Networks, Barcelona, Mar. 2004: 216-224.
5Luo J, Mohyeldin E, and Dillinger M, et al.. Performance analysis of joint radio resource management for reconfigurable terminals with multi-class circuit-switched services[C]. Wireless World Research Forum 12th Meeting, Toronto, Nov. 2004: 138-150.
6Zhang Y, Zhang K, and Ji Y, et al.. Adaptive threshold joint load control in an end-to-end reconfigurable systemiC]. IST Mobile and Wireless Summit 2006, Mykonos, Jun. 2006: 332-337.
7Kaelbling L P, Littman M L, and Wang X, et al..Reinforcement learning: a survey[J]. Journal of Artificial Intelligence Research, 1996, 4(2): 237-285.
8Nie J and Haykin S. A Q-learning-based dynamic channel assignment technique for mobile communication systems[J]. IEEE Trans. on Vehicular Technology, 1999, 48(5): 1676- 1687.
9Watkins C J C H and Dayan P. Q-learning[J]. Machine Learning, 1992, 8(3): 279-292.
10Radunovic B, Le Boudec J Y. Rate performance objectives of multihop wireless networks[J]. IEEE Trans. on Mobile Computing, 2004, 3(4): 334-349.

共引文献8

1李默,徐友云,蔡跃明.基于Q-Learning的认知无线电系统感知管理算法[J].电子与信息学报,2010,32(3):623-628. 被引量：3
2吴爱军,李屹.异构无线网络中支持端到端重配置的资源管理技术[J].信息化研究,2010,36(8):5-7. 被引量：1
3赵彦清,朱琦.基于Q学习的异构网络选择新算法[J].计算机应用,2011,31(6):1461-1464. 被引量：4
4江虹,伍春,刘勇.基于强化学习的频谱决策与传输算法[J].系统仿真学报,2013,25(3):565-570. 被引量：1
5赵彪,李鸥,栾红志.Q学习算法在机会频谱接入信道选择中的应用[J].信号处理,2014,30(3):298-305. 被引量：4
6冯陈伟,袁江南.基于强化学习的异构无线网络资源管理算法[J].电信科学,2015,31(8):99-106. 被引量：5
7冯陈伟,张璘.一种基于Q学习的网络接入控制算法[J].计算机工程,2015,41(10):99-104. 被引量：5
8刘惠茹,马琳,徐玉滨.基于Q学习的CDMA/WLAN异构网络接入控制算法[J].通信技术,2016,49(8):1017-1022.

同被引文献21

1杨光,余凯,章魁,张平.基于IEEE 802.11e MAC的联合无线资源管理[J].重庆邮电学院学报（自然科学版）,2005,17(6):662-666. 被引量：1
2WU GANG, HAVINGA P J M, MIZUNO M. Wireless Internet over heterogeneous wireless networks [ C]// Global Tel Conference. San Antonio: IEEE Press, 2001:1759 - 1765.
33GPP TR 25. 881 v5.0.0. Improvement of RRM across RNS and RNS/BSS ( Release 5) [ EB/OL]. [ 2010 - 09 - 01 ]. http://www. 3 gpp. org.
4SONG QINGYANG, JAMALIPOUR A. Network selection in an integrated wireless LAN and UMTS environment using mathematical modeling and computing techniques [ J]. IEEE Transactions on Wireless Communications, 2005, 12(3): 42-48.
5SUTTON R S, BARTO A G. Reinforcement learning: An introduction[ J]. IEEE Transactions on Neural Networks, 2005, 16( 1):285 - 286.
6MATARIEN M J. Reinforcement learning in the multirobot domain [J]. Autonomous Robot, 1997, 4(1): 76-79.
7SAKER L, JEMAA S B, ELAYOUBI S E. Q-learning for joint access decision in heterogeneous networks [ C]// WCNC'09: Proceedings of the 2009 IEEE conference on Wireless Communications & Networking Conference. Piscataway: IEEE Press, 2009:1904-1908.
8NIE J, HAYKIN S. A Q-learning-based dynamic channel assignment technique for mobile communication systems [ J]. IEEE Transactions on Vehicular Technology, 1999, 48(5) : 1676 - 1687.
9ALEXANDRI E, MARTINEZ G, ZEGHLACHE D. Adaptive joint call admission control and access network selection for multimedia wireless systems [ C]// The 5th International Symposium on Wireless Personal Multimedia Communications. Honolulu, USA: IEEE, 2002:1390 - 1394.
10张永靖,冯志勇,张平.基于Q学习的自主联合无线资源管理算法[J].电子与信息学报,2008,30(3):676-680. 被引量：9

引证文献3

1赵彦清,朱琦.基于Q学习的异构网络选择新算法[J].计算机应用,2011,31(6):1461-1464. 被引量：4
2李志宏.通信工程中有线传输技术的应用改进[J].电子技术与软件工程,2018(12):23-23. 被引量：6
3张云景,王昊,王帅,孟斌.基于改进SARSA算法的航空器滑行路径规划[J].郑州航空工业管理学院学报,2024,42(1):43-48.

二级引证文献10

1陈雯,丰文斌,廖小飞,余海翔.异构无线网络接入控制演示平台的实现[J].东华大学学报（自然科学版）,2015,41(3):335-340.
2刘惠茹,马琳,徐玉滨.基于Q学习的CDMA/WLAN异构网络接入控制算法[J].通信技术,2016,49(8):1017-1022.
3黄海燕.通信工程中有线传输技术的改进措施运用[J].通信电源技术,2019,36(1):221-222. 被引量：7
4曾康铭.通信工程中有线传输技术的应用与优化[J].风景名胜,2019,0(11):306-306.
5沈洋.通信工程中的有线传输技术应用[J].集成电路应用,2020,37(6):112-113. 被引量：1
6龙烁宇.高效互联网传输技术的应用策略分析[J].中国新通信,2021,23(14):15-16.
7何灿.通信工程中有线传输技术的应用与优化[J].通信电源技术,2022,39(4):90-92.
8丁雨,李晨凯,韩会梅,卢为党,任元红,高原,曹江.基于5G无人机通信的多智能体异构网络选择方法[J].电信科学,2022,38(8):28-36. 被引量：6
9胡瑜洪,王德光,何家汉,张志恒.离散事件系统最优监督控制算法[J].计算机应用,2023,43(7):2271-2279.
10黄勇.通信工程中有线传输技术的优化策略探讨[J].中国新通信,2019,0(5):1-2. 被引量：1

1黄影,严定宇,李男.动态频谱接入的Q学习优化算法[J].西安电子科技大学学报,2015,42(6):179-183. 被引量：1
2徐玉滨,陈佳美,马琳.基于Q学习的WLAN/WIMAX接入控制网络选择策略[J].华南理工大学学报（自然科学版）,2013,41(8):41-46. 被引量：1
3钱进,郭士增,王孝.基于Q学习异构网络干扰协调算法[J].现代电子技术,2016,39(23):13-16. 被引量：1
4杨秀清,陈禹,李正富.Femtocell双层网络中基于Q-learning的子信道分配方案[J].电子与信息学报,2017,39(3):598-604. 被引量：1
5张雅男,乔瑞娟.基于ZigBee的认知路由协议研究[J].电子世界,2014(6):94-95.
6叶邦彦,赵学智,裴胜伟,华蕊.基于动态适应视频编码技术的机械设备远程监控系统[J].Journal of Shanghai University(English Edition),2004,8(A01):227-230.
7李默,徐友云,蔡跃明.基于Q-Learning的认知无线电系统感知管理算法[J].电子与信息学报,2010,32(3):623-628. 被引量：3
8江虹,伍春,刘勇.基于强化学习的频谱决策与传输算法[J].系统仿真学报,2013,25(3):565-570. 被引量：1
9段勇,陈腾峰.基于强化学习的多机器人避碰算法研究[J].信息技术,2012,36(6):100-103. 被引量：2
10赵彪,李鸥,栾红志.Q学习算法在机会频谱接入信道选择中的应用[J].信号处理,2014,30(3):298-305. 被引量：4

解放军理工大学学报（自然科学版）

2008年第6期

浏览历史

内容加载中请稍等...

基于DAQL算法的动态频谱接入方案被引量：3

参考文献9

二级参考文献10

共引文献8

同被引文献21

引证文献3

二级引证文献10

相关作者

相关机构

相关主题

浏览历史

基于DAQL算法的动态频谱接入方案 被引量：3

参考文献9

二级参考文献10

共引文献8

同被引文献21

引证文献3

二级引证文献10

相关作者

相关机构

相关主题

浏览历史

基于DAQL算法的动态频谱接入方案被引量：3