一种基于NOMA的Q学习卫星通信随机接入方法

A NOMA-based Q-learning random access scheme for satellite communications

下载PDF

导出

摘要基于非正交多址接入(NOMA)的Q学习(Q-Learning)随机接入方法(NORA-QL)是实现物联网中海量设备泛在接入的一项有效技术。为了解决NORA-QL方法仍存在的传输能效和过载容量较低的问题,提出了一种适合卫星通信网络的改进方法(I-NORA-QL)。针对传输功耗高的问题,I-NORA-QL利用卫星广播的全局信息改进Q学习的学习策略,将用户发射功率用于奖励函数的构造,同时将学习速率设计为与算法迭代次数相关的衰减函数。I-NORA-QL进一步在接入类别限制ACB(Access Class Barring)的基础上,基于学习过程中的Q值特性和负载估计实现ACB限制因子的自适应调整以进行过载控制。仿真结果表明,提出的I-NORA-QL改进方法相比于现有其他方法,能够有效降低用户设备的平均功耗,且在系统过载状态下可以显著提高吞吐量。 The Non-Orthogonal Multiple Access(NOMA)-based Q-learning random access method(NORA-QL)is an effective technique to achieve ubiquitous access to a large number of devices in the Internet of Things.In order to solve the problems of low transmission energy efficiency and low overload capacity in the NORA-QL method,an improved method(I-NORA-QL)suitable for satellite communication networks is proposed.To address the problem of high transmission power consumption,I-NORA-QL improves the learning strategy of Q-learning using global information from satellite broadcasting,the transmitted power of user equipment is used in the construction of the reward function,and the learning rate is designed as a decay function related to the number of iterations of the algorithm.Furthermore,based on the Access Class Barring(ACB),I-NORA-QL realizes the adaptive adjustment of ACB barring factor based on the Q value characteristics and load estimation during the learning process to carry out overload control.Simulation results show that,compared with other existing methods,the proposed I-NORA-QL improved method can effectively reduce the average power consumption of user devices,and significantly improve the throughput under system overload state.

作者杨伟康许小东 YANG Weikang;XU Xiaodong(CAS Key Laboratory of Wireless-Optical Communications,University of Science and Technology of China,Hefei 230026,China)

机构地区中国科学技术大学中科院无线光电通信重点实验室

出处《遥测遥控》 2022年第2期25-35,共11页 Journal of Telemetry,Tracking and Command

关键词卫星通信随机接入能量效率过载控制非正交多址接入 Q学习 Satellite communications Random access Energy efficiency Overload control Non-Orthogonal Multiple Access Q-learning

分类号 TN927.2 [电子电信—通信与信息系统]

引文网络
相关文献

1周宁浩,侯嘉.基于黄金分割的NOMA-SWIPT协作中继网络能效优化算法研究[J].科学技术与工程,2022,22(8):3169-3175. 被引量：1
2陈慧,张铭宇,李兴旺,孙江峰,李美玲.I/Q失衡影响下无人机多向全双工中继NOMA传输系统性能分析[J].电子与信息学报,2022,44(3):987-995. 被引量：2
3王娇,邱恭安,张士兵.交通应急通信中信道自适应的业务接入机制[J].电信科学,2022,38(1):95-101.
4曾柏森,钟勇,牛宪华.基于因子分解机用于安全探索的Q表初始化方法[J].计算机应用,2022,42(1):209-214.
5钟剑峰,王红军.适用于无人机集群应急通信系统分簇路由协议[J].火力与指挥控制,2022,47(2):56-66. 被引量：6
6杨生华,舒巧云.例谈导数中与构造函数有关的两类问题[J].高中数理化,2022(3):51-52.
7胡明.基于强化学习算法的用户异构接入策略研究[J].西安文理学院学报（自然科学版）,2022,25(1):40-44.
8薛珍,艾渤,马国玉,马毅琰,李庚乾.面向5G-R大规模物联网的新型多址方案[J].铁道学报,2022,44(2):56-63. 被引量：2
9石皓南,孙顺远.上行链路协作NOMA中基于TOA定位的研究[J].计算机与数字工程,2022,50(3):619-624.
10贺伊琳,宋若旸,马建.基于强化学习DDPG的智能车辆轨迹跟踪控制[J].中国公路学报,2021,34(11):335-348. 被引量：13

遥测遥控

2022年第2期

浏览历史

内容加载中请稍等...

一种基于NOMA的Q学习卫星通信随机接入方法

相关作者

相关机构

相关主题

浏览历史