基于近端策略优化算法的频谱切换问题研究

Proximal Policy Optimization Method for Spectrum Handoff in Cognitive Radio Networks

下载PDF

导出

摘要针对认知无线网络(Cognitive Radio Network,CRN)中的频谱切换问题,提出了一种基于强化学习的近端策略优化(Proximal Policy Optimization,PPO)方法。首先,将频谱切换问题建模为马尔可夫决策过程,设计了一种基于用户体验质量(Quality of Experience,QoE)的回报函数。其次,通过训练算法模型使长期回报最大化,从而实现了最优频谱切换。最后,通过仿真实验对提出的切换方法进行验证。结果表明,基于PPO的频谱切换方法能够实现更高效和更稳定的切换,提高了认知用户的可用传输速率和数据交付成功率,缩短了数据交付时间。 A PPO(Proximal Policy Optimization) method based on reinforcement learning is proposed to solve the spectrum handoff problem in cognitive radio networks. Firstly, the spectrum handoff problem is transformed into a Markov decision process. Then, a novel kind of return function based on the QoE(Quality of Experience) is designed. Optimal spectrum handoff is achieved by training the model to maximize the long-term return. Finally, the proposed handoff method is compared with other methods by simulation. The results indicate that the spectrum handoff method based on PPO can achieve more efficient and stable handoff. It can improve the available date rate of secondary users, shorten the data delivery time, and improve the success rate of data delivery.

作者李淑丰邵尉谢然于玉江 LI Shufeng;SHAO Wei;XIE Ran;YU Yujiang(Army Engineering University of PLA,Nanjing Jiangsu 210000,China;Unit 31107 of PLA,Nanjing Jiangsu 210000,China;Jiangsu Ecological Environment Monitoring Center,Nanjing Jiangsu 210000,China)

机构地区陆军工程大学 [ 江苏省生态环境监控中心

出处《通信技术》 2021年第8期1917-1924,共8页 Communications Technology

基金江苏省自然科学基金项目(No.BK20160080)。

关键词认知无线电频谱切换强化学习近端策略优化(PPO) cognitive radio spectrum handoff reinforcement learning PPO(Proximal Policy Optimization)

分类号 TN929 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献1

1Xiangwei Zhou,Mingxuan Sun,Geoffrey Ye Li,Biing-Hwang (Fred) Juang.Intelligent Wireless Communications Enabled by Cognitive Radio and Machine Learning[J].China Communications,2018,15(12):16-48. 被引量：11

二级参考文献2

1Jinlong Wang,Guoru Ding,Qihui Wu,Liang Shen,Fei Song.Spatial-temporal spectrum hole discovery:a hybrid spectrum sensing and geolocation database framework[J].Chinese Science Bulletin,2014,59(16):1896-1902. 被引量：8
2Tianqi Wang,Chao-Kai Wen,Hanqing Wang,Feifei Gao,Tao Jiang,Shi Jin.Deep Learning for Wireless Physical Layer: Opportunities and Challenges[J].China Communications,2017,14(11):92-111. 被引量：59

共引文献10

1Lin Zhang,Ying-Chang Liang,Dusit Niyato.6G Visions:Mobile Ultra-Broadband,Super Internet-of-Things,and Artificial Intelligence[J].China Communications,2019,16(8):1-14. 被引量：59
2卢光跃,施聪,吕少卿,周亮.基于LSTM神经网络的频谱感知算法[J].信号处理,2019,35(12):2070-2076. 被引量：8
3Wei Liang,Soon Xin Ng,Jia Shi,Lixin Li,Dawei Wang.Energy Efficient Transmission in Underlay CR-NOMA Networks Enabled by Reinforcement Learning[J].China Communications,2020,17(12):66-79. 被引量：2
4卢光跃,赵彬辰,施聪,吕少卿.基于均值辅助的LSTM网络频谱感知算法[J].信号处理,2021,37(3):409-416. 被引量：4
5岳文静,崔恒瑞,陈志.基于卷积神经网络的自适应频谱感知模型[J].计算机技术与发展,2021,31(5):62-66. 被引量：1
6Haoxiang Sun,Changxing Chen,Yunfei Ling,Mu Yang.Cooperative Perception Optimization Based on Self-Checking Machine Learning[J].Computers, Materials & Continua,2020(2):747-761.
7Rui Ma,Yuji Komatsuzaki,Mouhacine Benosman,Koji Yamanaka,Shintaro Shinjo.机器学习开启了功率放大器的新领域[J].变频器世界,2021(10):34-38.
8Amir Haider,Muhammad Adnan Khan,Abdur Rehman,Muhib Ur Rahman,Hyung Seok Kim.A Real-Time Sequential Deep Extreme Learning Machine Cybersecurity Intrusion Detection System[J].Computers, Materials & Continua,2021(2):1785-1798. 被引量：4
9任昕.一种改进的特征值-LSTM微弱信号盲检测方法[J].应用科技,2022,49(5):67-73.
10王诗,姜涵,朱笑莹,张敏,程思瑶.一种适应时变信道的多用户多信道自适应协议[J].电波科学学报,2023,38(5):877-886. 被引量：1

1彭云飞,邵尉,卢春兰.认知无线电网络中基于博弈论的频谱切换方法[J].通信技术,2021,54(8):1925-1929. 被引量：1
2彭艺,朱桢以,魏翔,谢钊萍.一种基于强化Q学习的跳频交会算法[J].通信技术,2021,54(8):1820-1826. 被引量：1
3沈畔阳(编译).5个投资[J].做人与处世,2021(17):59-59.
4康守强,刘哲,王玉静,王庆岩,兰朝凤.基于改进DQN网络的滚动轴承故障诊断方法[J].仪器仪表学报,2021,42(3):201-212. 被引量：23
5颜廷秋,申滨,王欣.基于无线指纹数据库的认知无线电频谱感知[J].电子技术应用,2021,47(7):69-73.
6马永娟,尹燕莉,马什鹏.基于DP的随机模型预测控制能量管理研究[J].汽车工程师,2021(8):40-44. 被引量：1
7奚玉芹,金永红,韩钰,魏萌.现金股利分配、投资效率与投资者回报[J].管理评论,2021,33(6):280-293. 被引量：12
8赵阳.投资要做时间的朋友[J].理财周刊,2021(3):36-37.

通信技术

2021年第8期

浏览历史

内容加载中请稍等...

基于近端策略优化算法的频谱切换问题研究

参考文献1

二级参考文献2

共引文献10

相关作者

相关机构

相关主题

浏览历史