期刊文献+

无蜂窝大规模MIMO中基于深度强化学习的无人机辅助通信与资源调度 被引量:5

UAV Assisted Communication and Resource Scheduling in Cell-free Massive MIMO Based on Deep Reinforcement Learning Approach
下载PDF
导出
摘要 无蜂窝大规模多入多出(MIMO)网络中分布式接入点(AP)同时服务多个用户,可以实现较大区域内虚拟MIMO的大容量传输;而无人机辅助通信能够为该目标区域热点或边缘用户提供覆盖增强。为了降低反馈链路负载,并有效提升无人机辅助通信的频谱利用率,该文研究了基于AP功率分配、无人机服务区选择和接入用户选择的联合调度;首先将AP功率分配和无人机服务区选择问题联合建模为双动作马尔可夫决策过程(DAMDP),提出了基于Q-learning和卷积神经网络(CNN)的深度强化学习(DRL)算法;然后将用户调度构造为一个0-1优化问题,并分解成子问题来求解。仿真结果表明,该文提出的基于DRL的资源调度方案与现有方案相比,可以有效提升无蜂窝大规模MIMO网络中频谱利用率。 Distributed Access Points(AP)in the cell-free massive Multiple Input Multiple Output(MIMO)networks serve multiple users at the same time,which can achieve large-capacity transmission of virtual MIMO in a larger area.Unmanned Aerial Vehicle(UAV)assisted communication can provide coverage enhancement for hotspots or edge users in this area.In order to improve the spectrum efficiency and reduce the feedback overhead,a joint resource scheduling scheme that includes AP power allocation,UAV service zone selection and user scheduling is proposed in this paper.Firstly,the AP power allocation and the UAV service zone selection problems are jointly modeled as a Double-Action Markov Decision Process(DAMDP).Then,a Deep Reinforcement Learning(DRL)algorithm based on Q-learning and Convolutional Neural Networks(CNN)is proposed.Furthermore,the user scheduling problem is formulated as a 0-1 optimization problem and solved by dividing into sub-problems.Simulation results demonstrate that the proposed DRL-based resource scheduling scheme exhibits a higher spectrum efficiency than existing schemes.
作者 王朝炜 邓丹昊 王卫东 江帆 WANG Chaowei;DENG Danhao;WANG Weidong;JIANG Fan(School of Electronic Engineering,Beijing University of Posts and Telecommunications,Beijing 100876,China;Key Laboratory of Universal Wireless Communications,Ministry of Education,Beijing 100876,China;School of Communication and Information Engineering,Xi’an University of Posts and Telecommunications,Xi’an 710061,China)
出处 《电子与信息学报》 EI CSCD 北大核心 2022年第3期835-843,共9页 Journal of Electronics & Information Technology
基金 国家重点研发计划(2020YFB1807204)。
关键词 无蜂窝大规模MIMO 无人机辅助通信 资源调度 深度增强学习 Cell-free massive MIMO UAV assisted communication Resource scheduling Deep Reinforcement Learning(DRL)
  • 相关文献

参考文献2

二级参考文献22

  • 1Tank D W, Hopfield J J. Simple "Neural" Optimization Networks: An A/D Converter, Signal Decision Circuit,and a Linear Programming Circuit. IEEE Trans. on Circ.Sys., 1986,33(5):533~541.
  • 2Tagliarini G A, Page E W. Solving Constraints Satisfaction Problem with Neural network. Proc. of IEEE 1st IJCNN,1987, Ⅲ :741~747.
  • 3Wilson G V, Pawley G S. On the Stability of the Traveling Salesman Problem Algorithm of Hopfield and Tank. Biolog. Cybernet, 1988, 58:63~70.
  • 4Nozawa H. Solution of the Optimization Problem Using the Neural Network Model as a Globally Coupled Map. Physical D, 1994, 75(1 - 3): 179~ 189.
  • 5Hopfield J J, Tank D W. "Neural" Computation of Decisions in Optimization Problems. Biolog. Cybern., 1985,52(1): 141~152.
  • 6Hopfietd J J. Neurons with Graded Response Have Collective Computational Properties tike Those of Two-state Neurons. Proc. of Nat. Academy Sci., USA, 1984, 81:3088~ 3092.
  • 7Bamnister J A, Trivedi K S. Task Allocation in Fault-tolerant Distributed System. in Hard Real-Time Systems (Tutorial). IEEE Computer Society Press, 1988: 256 ~272.
  • 8Aihara K, Takabe T, Toyoda M. Chaotic Neural Networks. Phys. Lett. A, 1990, 144(6,7): 333-340.
  • 9Smith K, Palaniswami M. Static and Dynamic Channel Assignment Using Neural Networks. IEEE J. Selected Areas Commun., 1997, 15(2) :238~249.
  • 10Kirkpatrick K, Gelatt C D, Vecchi P V. Optimizatiom by Simulated Annealing. Science, 1983, 220: 671 680.

共引文献55

同被引文献48

引证文献5

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部