期刊文献+

Q学习算法在机会频谱接入信道选择中的应用 被引量:4

Application of Q-Learning Algorithm in Channel Selection for Opportunistic Spectrum Access
下载PDF
导出
摘要 针对"先听后传"的机会频谱接入中认知用户的信道选择问题,本文提出了一种基于Q学习的信道选择算法。在非理想感知的条件下,通过建立认知用户的信道选择模型并设计恰当的奖励函数,使智能体能够与未知环境不断交互和学习,进而选择长期累积回报最大的信道接入。在学习过程中,本文引入了Boltzmann实验策略,运用模拟退火思想实现了资源探索与资源利用之间的折衷。仿真结果表明,所提算法能够在未知环境先验知识条件下可以快速选择性能较好的信道接入,有效提高认知用户的接入吞吐量和系统的平均容量。 Considering the problem of channel selection for opportunistic spectrum access (OSA),a Q-Learning based channel selection scheme was proposed for OSA in this paper.A secondary user detected the channels licensed to some primary users periodically before it decided whether to transmit in the OSA system.Under imperfect sensing circumstances,the construction of channel selection model of the secondary user and the designation of an appropriate reward function play a significant role in the continuous interaction and learning between the agent and unknown environment,thus selecting the channel with the maximum cumulative reward.During the learning stage,a Boltzmann learning rule using simulated annealing ideas was employed to realize the tradeoff between channel exploration and exploitation.As the simulation results show,the proposed algorithm can get access to suitable channel,and raise the average system capacity and throughput of the secondary user effectively in the absence of prior knowledge on the channel environment.
出处 《信号处理》 CSCD 北大核心 2014年第3期298-305,共8页 Journal of Signal Processing
基金 国家科技重大专项(2008ZX03006) 国家863计划(2012AA711)
关键词 认知无线电 机会频谱接入 信道选择 Q学习 cognitive radio opportunistic spectrum access channel selection Q learning
  • 相关文献

参考文献13

  • 1Choi K W, Hossain E. Opportunistic access to Spectrum holes between packet bursts; a learning-based approach[J]. IEEE Transactions on Wireless Communications, Aug. 2011, W( 8) ;2497-2509.
  • 2MitolaJ. Cognitive radio for flexible mobile multimedia communications[J]. Mobile Networks and Applications, May 2001 ,6(5) ;435-441.
  • 3Zhao Q, Tong L, Swami A, et al. Decentralized cogni?tive MAC for opportunistic spectrum access in ad hoc net?works; a POMDP framework[J]. IEEEJournal on Se?lected Areas in Communications, Mar. 2007, 25 ( 3 ) ; 589-600.
  • 4Chen Y, Zhao Q, Swami A.Joint design and separation principle for opportunistic spectrum access in the presence of sensing errors[J]. IEEE Transactions on Information Theory, May 2008,54(5) ;2053-2071.
  • 5Senthuran S, Anpalagan A, Das O. Throughput Analy?sis of Opportunistic Access Strategies in Hybrid Under?lay-Overlay Cognitive Radio Networks[J]. IEEE Trans?actions on Wireless Communications,Jun. 2012, 11 (6) ;2024-2035.
  • 6Li B W, Yang P L, WangJ L, et al. Optimal Action Point for Dynamic Spectrum Utilization Under Rayleigh Fading[J]. Ad-hoc & Sensor Wireless Networks,Jan. 2013,17(1-2) :1-32.
  • 7Wu L, Wang W, and Zhang Z. A POMDP-based optimal spectrum sensing and access scheme for cognitive radio networks with hardware limitation[J]. 2012 IEEE Wire?less Communications and Networking Conference (WC?NC), Shanghai, China, 2012, pp. 1281-1286.
  • 8Wang Y, Ren P, Zhou S U. A POMDP based distributed adaptive opportunistic spectrum access strategy for cogni?tive ad hoc networks[J]. IEICE Transactions on Commu?nications,Jun. 2011,94(6) :1621-1624.
  • 9张永靖,冯志勇,张平.基于Q学习的自主联合无线资源管理算法[J].电子与信息学报,2008,30(3):676-680. 被引量:9
  • 10Reddy Y B. Detecting primary signals for efficient utiliza?tion of spectrum using Q-leaming[CJ //Information Tech?nology: New Generations, 2008. ITNG 2008, Fifth Inter?national Conference on. IEEE, 2008: 360-365.

二级参考文献25

  • 1Song Q and Jamalipour A. Network selection in an integrated wireless LAN and UMTS environment using mathematical modeling and computing techniques[J]. IEEE Wireless Commun., 2005, 12(3): 42-48.
  • 23GPP TR 25.881 v5.0.0. Improvement of RRM across RNS and RNS/BSS (Release 5) [OL]. http://www.3gpp.org, Dec. 2001.
  • 3IST-2003-507995 Project E2R (End-to-End Reconfigurability) [OL]. http://e2r.motlabs.com, Jan. 2004.
  • 4Agusti R, Salient O, and Perez-Romero J, et al.. A fuzzyneural based approach for joint radio resource management in a beyond 3G framework[C]. First Int. Conf. on Quality of Service in Heterogeneous Wired/Wireless Networks, Barcelona, Mar. 2004: 216-224.
  • 5Luo J, Mohyeldin E, and Dillinger M, et al.. Performance analysis of joint radio resource management for reconfigurable terminals with multi-class circuit-switched services[C]. Wireless World Research Forum 12th Meeting, Toronto, Nov. 2004: 138-150.
  • 6Zhang Y, Zhang K, and Ji Y, et al.. Adaptive threshold joint load control in an end-to-end reconfigurable systemiC]. IST Mobile and Wireless Summit 2006, Mykonos, Jun. 2006: 332-337.
  • 7Kaelbling L P, Littman M L, and Wang X, et al..Reinforcement learning: a survey[J]. Journal of Artificial Intelligence Research, 1996, 4(2): 237-285.
  • 8Nie J and Haykin S. A Q-learning-based dynamic channel assignment technique for mobile communication systems[J]. IEEE Trans. on Vehicular Technology, 1999, 48(5): 1676- 1687.
  • 9Watkins C J C H and Dayan P. Q-learning[J]. Machine Learning, 1992, 8(3): 279-292.
  • 10Radunovic B, Le Boudec J Y. Rate performance objectives of multihop wireless networks[J]. IEEE Trans. on Mobile Computing, 2004, 3(4): 334-349.

共引文献16

同被引文献12

引证文献4

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部