基于深度强化学习的联合通信感知系统波束优化方法

Beamforming Optimization for Integrated Sensing and Communication Systems:A Deep Reinforcement Learning Approach

下载PDF

导出

摘要在不远的未来,ISAC系统将同时提供通信和感知服务。ISAC系统需要通过先进的波束优化算法保证所提供服务的质量,并满足形式多样的服务目标和资源约束。通常,波束算法可建模为一个优化问题。然而,基于传统优化理论设计的优化算法仅能处理带有瞬时约束的资源分配问题,而不能处理带有长时间约束的优化问题,从而降低了系统性能。一种可行的解决方案是基于RL理论设计相应算法来解决上述问题。然而,现有的工作主要致力于解决无约束RL问题,对约束强化学习问题关注较少,这限制了强化学习在波束优化问题中的应用。为了克服上述挑战,提出了一种基于CSSCA的RL方法。该方法将原有的目标函数和约束函数替换为对应的凸近似函数,通过求解一系列的凸近似问题,最终可以保证收敛到原问题的KKT点。最后,通过仿真结果展示了所提出方法的优越性。 In future,integrated sensing and communication(ISAC)systems are expected to provide communication and sensing service simultaneously.The systems are required to perform advanced beamforming algorithms to ensure the quality of service and satisfy various types of service targets and resource constraints.In general,the beamforming algorithms can be formulated as an optimization problem.However,the optimization algorithm based on the traditional optimization theory can only address the resource allocation problems with instantaneous constraints and fail to address the problems with long-term constraints,degrading the system performance.One possible solution to overcome the drawbacks of existing algorithms is designing optimization algorithms based on the reinforcement learning.However,the existing algorithms only focus on the unconstrained reinforcement learning problems and pay little attention on the constrained reinforcement learning ones,which restricts the application of reinforcement learning in beamforming algorithm design.To tackle this challenge,we propose a novel reinforcement learning algorithm based on the constrained successive convex approximation method.This method replaces the original objective function and constraint functions with the corresponding convex approximation functions.By solving a series of convex approximation problems,the convergence to the Karush-Kuhn-Tucker(KKT)point of the original problem can be guaranteed.Finally,the simulation results show the superiority of the proposed method.

作者黄哲刘安 HUANG Zhe;LIU An(Zhejiang University,Hangzhou 310007,China)

机构地区浙江大学

出处《移动通信》 2024年第10期41-48,共8页 Mobile Communications

基金国家自然科学基金“基于深度随机优化的联合压缩信道估计与定位跟踪方法”(62071416)。

关键词通信感知一体化波束优化深度强化学习约束随机逐次凸逼近 Integrated sensing and communication beamforming optimization deep reinforcement learning constrained successive convex approximation

分类号 TN929.5 [电子电信—通信与信息系统]

引文网络
相关文献

1訾薇宇,舒忠平.基于深度学习的大米加工新鲜度分类方法[J].粮食与饲料工业,2024(5):71-75.
2孙蓉.电力施工人员安全带佩戴检测研究[J].电气技术与经济,2024(10):238-242.
3赵本进,林鸷,李嘉皓,何锡忠,彭丽英.猪细小病毒病(PPV_(S-1)株)灭活疫苗不同佐剂的比较试验[J].国外畜牧学（猪与禽）,2024,44(5):33-36.
4余景波,孙丽,李瑛.高职院校职业技能大赛困境及突破研究[J].武汉职业技术学院学报,2024,23(4):65-71.
5王春英.世界银行营商环境新评估体系的深刻变化、丰富内涵及重要启示[J].当代经济管理,2024,46(10):88-96.
6周孟然,王皓.基于改进YOLOv7的安全帽佩戴检测算法[J].软件,2024,45(8):14-17.
7欧阳日辉.“数据要素×”驱动新质生产力加快发展:理论逻辑、典型案例及政策建议[J].电子科技大学学报（社科版）,2024,26(5):16-28.
8刘美迎,范晶晶,张保仁.“U-G-S”协同机制下高校助力中小学课后延时服务的路径研究[J].教师,2024(25):81-83.
9刘兆民,宋昕茗,宋佳佳,卢飞.基于启发式算法的停机位分配研究[J].科技创新与生产力,2024,45(10):94-97.
10徐国冲,宋知远.“拼凑式合作”:资源约束下基层治理的“权宜之计”?[J].公共行政评论,2024,17(5):65-82.

移动通信

2024年第10期

浏览历史

内容加载中请稍等...

基于深度强化学习的联合通信感知系统波束优化方法

相关作者

相关机构

相关主题

浏览历史