期刊文献+

基于分层Q学习的联合抗干扰算法 被引量:5

Joint Anti-jamming Algorithm Based on Hierarchical Q Learning
下载PDF
导出
摘要 针对智能干扰威胁下的跨层抗干扰通信问题,提出一种基于分层Q学习的联合抗干扰学习算法。根据用户与干扰机之间的路由信道选择问题构建分层Stackelberg博弈模型,干扰机选择最佳干扰信道实施干扰,用户与干扰机进行路由信道博弈,选择最佳路由及信道实现通信。仿真结果表明,与固定路由-随机信道选择算法、随机路由-最佳信道选择算法和随机路由-随机信道选择算法相比,该算法具有更好的抗干扰性能。 Aiming at the cross-layer anti-jamming communication problem under the threat of intelligent interference,a joint anti-jamming learning algorithm based on hierarchical Q learning is proposed.The problem of routing selection and channel allocation between users and intelligent jammer is modeled as a hierarchical Stackelberg game.In the routing-chanel game between users and jammers,the intelligent jammer chooses the best channel for jamming,while users select the best route and channels for communication.Simulation results show that compared with the Fixed-routing-random-channel Selection Algorithm(FRSA),Random-routing-optimal-channel Selection Algorithm(ROSA) and Random-routing-random-channel Selection Algorithm(RRSA),the proposed algorithm has better anti-jamming capacity.
作者 韩晨 牛英滔 HAN Chen;NIU Yingtao(College of Communications Engineering,Army Engineering University,Nanjing 210000,China;Nanjing Telecommunication Technology Institute,Nanjing 210008,China)
出处 《计算机工程》 CAS CSCD 北大核心 2019年第5期279-284,共6页 Computer Engineering
基金 江苏省自然科学基金(BK20151450)
关键词 通信抗干扰 STACKELBERG博弈 信道分配 路由选择 Q学习 跨层设计 communication anti-jamming Stackelberg game channel allocation routing selection Q learning cross-layer design
  • 相关文献

参考文献4

二级参考文献43

  • 1田厚平,郭亚军,王学军.一类基于进化博弈的多主多从Stackelberg对策算法[J].系统工程学报,2005,20(3):303-307. 被引量:10
  • 2Choi K W, Hossain E. Opportunistic access to Spectrum holes between packet bursts; a learning-based approach[J]. IEEE Transactions on Wireless Communications, Aug. 2011, W( 8) ;2497-2509.
  • 3MitolaJ. Cognitive radio for flexible mobile multimedia communications[J]. Mobile Networks and Applications, May 2001 ,6(5) ;435-441.
  • 4Zhao Q, Tong L, Swami A, et al. Decentralized cogni?tive MAC for opportunistic spectrum access in ad hoc net?works; a POMDP framework[J]. IEEEJournal on Se?lected Areas in Communications, Mar. 2007, 25 ( 3 ) ; 589-600.
  • 5Chen Y, Zhao Q, Swami A.Joint design and separation principle for opportunistic spectrum access in the presence of sensing errors[J]. IEEE Transactions on Information Theory, May 2008,54(5) ;2053-2071.
  • 6Senthuran S, Anpalagan A, Das O. Throughput Analy?sis of Opportunistic Access Strategies in Hybrid Under?lay-Overlay Cognitive Radio Networks[J]. IEEE Trans?actions on Wireless Communications,Jun. 2012, 11 (6) ;2024-2035.
  • 7Li B W, Yang P L, WangJ L, et al. Optimal Action Point for Dynamic Spectrum Utilization Under Rayleigh Fading[J]. Ad-hoc & Sensor Wireless Networks,Jan. 2013,17(1-2) :1-32.
  • 8Wu L, Wang W, and Zhang Z. A POMDP-based optimal spectrum sensing and access scheme for cognitive radio networks with hardware limitation[J]. 2012 IEEE Wire?less Communications and Networking Conference (WC?NC), Shanghai, China, 2012, pp. 1281-1286.
  • 9Wang Y, Ren P, Zhou S U. A POMDP based distributed adaptive opportunistic spectrum access strategy for cogni?tive ad hoc networks[J]. IEICE Transactions on Commu?nications,Jun. 2011,94(6) :1621-1624.
  • 10Reddy Y B. Detecting primary signals for efficient utiliza?tion of spectrum using Q-leaming[CJ //Information Tech?nology: New Generations, 2008. ITNG 2008, Fifth Inter?national Conference on. IEEE, 2008: 360-365.

共引文献48

同被引文献22

引证文献5

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部