期刊文献+

基于强化学习的动态频谱分配研究 被引量:1

Dynamic spectrum allocation research based on reinforcement learning
下载PDF
导出
摘要 首先介绍了认知无线电技术产生的背景,以及强化学习的发展和应用于认知领域的优势;接着对强化学习的基本原理及其2个常见的模型Q-Learning和POMDP作了介绍,并对其模型定义、思想、所要描述的问题和使用的场景都做了较详细的阐述;然后针对这个方向最近几年的顶级会议和期刊论文,分析了其主要内容;通过最近几年的学术、会议论文中所述的研究现状及成果,说明强化学习的主要特点是能够准确、快速学习到最优策略,能够模拟真实环境,自适应性强,提高频谱感知、分配效率,从而最大化系统吞吐量,这些优势充分证明了强化学习将是认知领域里一种很有前景的技术。 This essay briefly sketches the background and characteristic of cognitive radio and reinforcement learning tech- nology. It reviews the main research direction of the field of cognitive radio for dynamic spectrum allocation ( DSA), inclu- ding the introduction of the two common models in Reinforcement Learning: Q-learning and partially observable markov de- cision process (POMDP). And we analyze the research contents and developments for DSA on the basis of the two models in recent years. Finally, we deduce a conclusion and forecast the development trend of this field in the future.
作者 杜江 刘毅
出处 《数字通信》 2012年第4期34-38,共5页 Digital Communications and Networks
关键词 认知无线电 动态频谱分配 强化学习 Q学习 部分感知 马尔科夫决策过程 cognitive radio dynamic spectrum allocation reinforcement Learning Q-Learning partial perception POM- DP
  • 相关文献

参考文献13

  • 1王军,李少谦.认知无线电:原理、技术与发展趋势[J].中兴通讯技术,2007,13(3):1-4. 被引量:39
  • 2高阳,陈世福,陆鑫.强化学习研究综述[J].自动化学报,2004,30(1):86-100. 被引量:262
  • 3KAELBLING L P, LITTMAN M L, MOORE A W. Rein- forcement learning : a survey [ J ]. Journal of Artificial In- telligence Research, 1996,4 : 237-285.
  • 4LI Mo. A Q-Learning based sensing task selection scheme for cognitive radio networks [ C ] //Wireless Communica- tions & Signal Processing. Nanjing: WCSP : 2009 : 1-5.
  • 5WATKINS C J C H, PETER D. Q-Learning[J]. Machine Learning, 1992 (8) ,279-292.
  • 6LOVEJOY W S. A survey of algorithrnlc methods for par- tially observed Markov decision process [ J ]. Annals of Operations Research, 1991 (28) : 47-65:
  • 7KAELBLING L P, MICHAEL L L, ANTHONY R C. Planning and acting in partially observable stochastic do- mains[ J]. Artificial Intelligence, 1998 ( 101 ) :99-134.
  • 8Performance analysis of reinforcement learning for achie- ving context awareness and intelligence in mobile cogni- tive radio networks [ C ]//Advanced hfforrnation Networ- king and Applications. Biopolis : IEEE, 2011 : 1-8.
  • 9YAO Yanjun, FENG Zhiyong. Centralized channel andpower allocation for cognitive radio networks: a q-learning solution [ C ] JJ Future Network&Mobile Summit. Flor- ence : IEEE, 2010 : 1-$.
  • 10TENG Yinglei, ZHANG Yong, NIU Fang, et al. Rein- forcement learning based auction algorithm for dynamic spectrum access in cognitive radio networks[ J ]. Vehicular Technology Conference Fall, 2010 (72) : 1-5.

二级参考文献5

共引文献298

同被引文献11

引证文献1

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部