无人机集群协同主动搜索的强化学习策略研究

Collaborative altitude-adaptive reinforcement learning for active search with unmanned aerial vehicle swarms

下载PDF

导出

摘要在多变和复杂的灾害环境中,迅速定位幸存者是一项至关重要的任务,无人机(UAV,unmanned aerial vehicle)群的主动搜索能力在这一过程中发挥着关键作用。然而,无人机的传感器性能与其飞行高度紧密相关,覆盖范围和探测精度难以平衡。为了实现高效的搜索,无人机集群需要在高空飞行以覆盖更广的区域,同时在低空飞行以提高探测的准确性。此时,策略的制定对于无人机集群的协调和决策至关重要。为了应对这些挑战,提出了协同高度自适应强化学习(CARL,collaborative altitude-adaptive reinforcement learning)方法,该方法融合了可变高度传感器模型、基于信心的评估机制以及基于近端策略优化(PPO,proximal policy optimization)的高度自适应规划器。通过CARL方法,无人机能够根据实时情况动态地调整感知策略,并做出更加明智的决策。此外,引入了一种创新的奖励塑造策略,从而在广阔环境中最大化搜索效率。通过在多种条件下的模拟测试,CARL方法在提高完全搜索率方面表现出色,相较于基线方法提升了12%,充分证明了其在提升无人机集群在主动搜索任务中的有效性。 Active search with unmanned aerial vehicle(UAV)swarms in cluttered and unpredictable environments poses a critical challenge in search and rescue missions,where the rapid localizations of survivors are of paramount importance,as the majority of urban disaster victims are surface casualties.However,the altitude-dependent sensor performance of UAV introduces a crucial trade-off between coverage and accuracy,significantly influencing the coordination and decision-making of UAV swarms.The optimal strategy has to strike a balance between exploring larger areas at higher alti‐tudes and exploiting regions of high target probability at lower altitudes.To address these challenges,collaborative altitude-adaptive reinforcement learning(CARL)was proposed which incorporated an altitude-aware sensor model,a confidence-informed assessment module,and an altitude-adaptive planner based on proximal policy optimization(PPO)algorithms.CARL enabled UAV to dynamically adjust their sensing location and made informed decisions.Furthermore,a tailored reward shaping strategy was introduced,which maximized search efficiency in extensive environments.Com‐prehensive simulations under diverse conditions demonstrate that CARL surpasses baseline methods,achieves a 12%im‐provement in full recovery rate,and showcase its potential for enhancing the effectiveness of UAV swarms in active search missions.

作者肖子健夏晨钧徐杨罡任纪媛陈鑫磊 XIAO Zijian;Chen-Chun Hsia;XU Yanggang;REN Jiyuan;CHEN Xinlei(Shenzhen International Graduate School,Tsinghua University,Shenzhen 518000,China;Pengcheng Laboratory,Shenzhen 518000,China;RISC-V International Open Source Laboratory,Shenzhen 518000,China)

机构地区清华大学深圳国际研究生院鹏城实验室 RISC-V国际开源实验室

出处《物联网学报》 2024年第3期36-45,共10页 Chinese Journal on Internet of Things

基金国家重点研发计划项目(No.2022YFB3300703) 国家自然科学基金项目(No.62371269) 深圳市稳定支持项目(No.WDZC20220811103500001) 清华大学深圳国际研究生院交叉科研创新基金项目(No.JC20220011)。

关键词强化学习贝叶斯学习协同无人机集群主动搜索框架 reinforcement learning Bayesian learning collaborative UAV swarms active search framework

分类号 TP393 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1Kai Zhu,Bin Han,Tao Zhang.Multi-UAV Distributed Collaborative Coverage for Target Search Using Heuristic Strategy[J].Guidance, Navigation and Control,2021,1(1):25-48. 被引量：5

共引文献4

1段海滨,仝秉达,刘冀川.基于指数平均动量鸽群优化的多无人机协同目标防御[J].北京航空航天大学学报,2022,48(9):1624-1629.
2YUAN GuangSong,DUAN HaiBin.Extremum seeking control for UAV close formation flight via improved pigeon-inspired optimization[J].Science China(Technological Sciences),2024,67(2):435-448. 被引量：2
3赖幸君,唐鑫,林磊,王志胜,丛玉华.基于差分进化粒子群混合算法的多无人机协同区域搜索策略[J].弹箭与制导学报,2024,44(1):89-97. 被引量：2
4ZHANG ZhaoYu,DUAN HaiBin.Distributed velocity-free formation tracking control for clustered UAVs under virtual leader-follower framework[J].Science China(Technological Sciences),2024,67(5):1538-1552.

1李俊文.大数据语境下短视频新闻的“算法绑架”及其治理[J].传媒,2024(17):76-78.
2郭磊磊,崔蕾,王波,高鹏飞,金楠.抗直流偏置与频率偏移的并网逆变器无电网电压传感器模型预测控制[J].电力自动化设备,2024,44(11):142-148.
3谭雪红.传统与现代的碰撞,青春与激情的燃烧——评环境式越剧《新龙门客栈》[J].华声,2024(13):0069-0071.
4张力文,徐洋,陈攀,浮丹丹.基于变分贝叶斯学习时序InSAR数据的城市轨道沉降监测方法[J].城市勘测,2024(5):128-133.
5李晓力.从文学作品《平凡的世界》看话剧塑造人物的途径[J].戏剧之家,2024(28):44-46.
6李艳,施琦,王付宇.同步服务下考虑护工午休的双目标家庭护理人员调度问题[J].工业工程与管理,2024,29(4):78-88.
7王凌,李瑞,陈靖方.面向人机协同能效车间调度的群智能优化算法[J].中国科学：技术科学,2024,54(9):1676-1692.
8牛雯琦,马润波.用于测量介电常数的叉指电容传感器设计[J].传感器与微系统,2024,43(11):53-57.
9梅晓端.电视公益节目主持人个性化风格塑造策略[J].西部广播电视,2024,45(14):136-139.
10卢静潇.浅谈“非遗”题材纪录片的人物形象塑造策略[J].文学艺术周刊,2024(15):80-83.

物联网学报

2024年第3期

浏览历史

内容加载中请稍等...

无人机集群协同主动搜索的强化学习策略研究

参考文献1

共引文献4

相关作者

相关机构

相关主题

浏览历史