期刊文献+

Multi-UAV Trajectory Design and Power Control Based on Deep Reinforcement Learning 被引量:1

原文传递
导出
摘要 In this paper,multi-unmanned aerial vehicle(multi-UAV)and multi-user system are studied,where UAVs are served as aerial base stations(BS)for ground users in the same frequency band without knowing the locations and channel parameters for the users.We aim to maximize the total throughput for all the users and meet the fairness requirement by optimizing the UAVs’trajectories and transmission power in a centralized way.This problem is non-convex and very difficult to solve,as the locations of the user are unknown to the UAVs.We propose a deep reinforcement learning(DRL)-based solution,i.e.,soft actor-critic(SAC)to address it via modeling the problem as a Markov decision process(MDP).We carefully design the reward function that combines sparse with non-sparse reward to achieve the balance between exploitation and exploration.The simulation results show that the proposed SAC has a very good performance in terms of both training and testing.
出处 《Journal of Communications and Information Networks》 EI CSCD 2022年第2期192-201,共10页 通信与信息网络学报(英文)
基金 National Nat-ural Science Foundation of China(62101161) Shenzhen Basic Research Program(20200811192821001) Shenzhen Basic Research Program(JCYJ20190808122409660) Guangdong Basic Research Program(2019A1515110358) Guangdong Basic Research Program(2021A1515012097) Guangdong Basic Research Program(2020ZDZX1037) Guangdong Basic Research Program(2020ZDZX1021) open research fund of National Mobile Communications Research Laboratory,Southeast University(2021D16) open research fund of National Mobile Communications Research Laboratory,Southeast University(2022D02)。
  • 相关文献

同被引文献6

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部