Cooperative multi-target hunting by unmanned surface vehicles based on multi-agent reinforcement learning

下载PDF

导出

摘要 To solve the problem of multi-target hunting by an unmanned surface vehicle(USV)fleet,a hunting algorithm based on multi-agent reinforcement learning is proposed.Firstly,the hunting environment and kinematic model without boundary constraints are built,and the criteria for successful target capture are given.Then,the cooperative hunting problem of a USV fleet is modeled as a decentralized partially observable Markov decision process(Dec-POMDP),and a distributed partially observable multitarget hunting Proximal Policy Optimization(DPOMH-PPO)algorithm applicable to USVs is proposed.In addition,an observation model,a reward function and the action space applicable to multi-target hunting tasks are designed.To deal with the dynamic change of observational feature dimension input by partially observable systems,a feature embedding block is proposed.By combining the two feature compression methods of column-wise max pooling(CMP)and column-wise average-pooling(CAP),observational feature encoding is established.Finally,the centralized training and decentralized execution framework is adopted to complete the training of hunting strategy.Each USV in the fleet shares the same policy and perform actions independently.Simulation experiments have verified the effectiveness of the DPOMH-PPO algorithm in the test scenarios with different numbers of USVs.Moreover,the advantages of the proposed model are comprehensively analyzed from the aspects of algorithm performance,migration effect in task scenarios and self-organization capability after being damaged,the potential deployment and application of DPOMH-PPO in the real environment is verified.

作者 Jiawei Xia Yasong Luo Zhikun Liu Yalun Zhang Haoran Shi Zhong Liu

机构地区 College of Weaponry Engineering Institute of Vibration and Noise

出处《Defence Technology（防务技术）》 SCIE EI CAS CSCD 2023年第11期80-94,共15页 Defence Technology

基金 financial support from National Natural Science Foundation of China(Grant No.61601491) Natural Science Foundation of Hubei Province,China(Grant No.2018CFC865) Military Research Project of China(-Grant No.YJ2020B117)。

关键词 Unmanned surface vehicles Multi-agent deep reinforcement learning Cooperative hunting Feature embedding Proximal policy optimization

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

参考文献4

1Jie-ru Fan,Dong-guang Li,Ru-peng Li,Yue Wang.Analysis on MAV/UAV cooperative combat based on complex network[J].Defence Technology（防务技术）,2020,16(1):150-157. 被引量：20
2Shou-yi Li,Mou Chen,Yu-hui Wang,Qing-xian Wu.Air combat decision-making of multiple UCAVs based on constraint strategy games[J].Defence Technology（防务技术）,2022,18(3):368-383. 被引量：12
3王石,张建强,杨舒卉,张博伦.国内外无人艇发展现状及典型作战应用研究[J].火力与指挥控制,2019,44(2):11-15. 被引量：49
4伊戈,刘忠,张建强,董蛟.基于改进终端滑模控制的USV航向跟踪控制方法[J].电光与控制,2020,27(10):12-16. 被引量：12

二级参考文献36

1朱涛,常国岑,施笑安.基于复杂网络的作战系统结构研究[J].火力与指挥控制,2008,33(S1):136-137. 被引量：17
2王小艺,刘载文,侯朝桢,原菊梅.防空武器多目标优化分配建模与决策[J].兵工学报,2007,28(2):228-231. 被引量：26
3沈寿林,张国宁,杜丹.基于复杂网络的作战系统结构研究[J].电子测量技术,2007,30(4):155-158. 被引量：24
4朱涛,常国岑,张水平,郭戎潇.基于复杂网络的指挥控制信息协同模型研究[J].系统仿真学报,2008,20(22):6058-6060. 被引量：29
5姜长生,丁全心,王建刚,王俊.多机协同空战中的威胁评估与目标分配[J].火力与指挥控制,2008,33(11):8-12. 被引量：23
6严汝建,庞硕,孙寒冰,庞永杰.Development and Missions of Unmanned Surface Vehicle[J].Journal of Marine Science and Application,2010,9(4):451-457. 被引量：73
7姚敏,朱艳萍,赵敏.敌对环境多无人机协同攻击策略研究[J].仪器仪表学报,2011,32(8):1891-1897. 被引量：7
8李家良.水面无人艇发展与应用[J].火力与指挥控制,2012,37(6):203-207. 被引量：123
9Yu Zhang,Jing Chen,Lincheng Shen.Hybrid hierarchical trajectory planning for a fixed-wing UCAV performing air-to-surface multi-target attack[J].Journal of Systems Engineering and Electronics,2012,23(4):536-552. 被引量：5
10杨哲,李曙林,周莉,谢紫龙.基于复杂网络空战体系作战网络拓扑模型分析[J].计算机仿真,2013,30(10):72-75. 被引量：9

共引文献87

1夏天冰,查伊倩,赵丽莉,李明原,王鸿东.无人船在港口安全保障中的应用研究[J].船舶工程,2023,45(7).
2李方旭,金久才,张杰,李立刚,戴永寿.一种用于无人船海面障碍物测距的双目视觉系统[J].舰船科学技术,2019,41(23):118-122.
3田野,唐国元,袁子建,李琪凡.基于LabVIEW的水面无人艇远程监控软件系统开发及应用[J].机械与电子,2020,38(3):53-57. 被引量：6
4谢慧,杨忠,吴有龙,顾娟.基于物联网的水面无人艇技术体系和系统功能架构的研究[J].物联网技术,2020,10(3):52-54. 被引量：2
5张卫东,刘笑成,韩鹏.水上无人系统研究进展及其面临的挑战[J].自动化学报,2020,46(5):847-857. 被引量：54
6张丽珍,高浩,吴迪,李卫,陆天辰.基于MPC的半潜式无人艇导航轨迹跟踪控制研究[J].全球定位系统,2020,45(3):63-70. 被引量：5
7侯瑞超,唐智诚,王博,颜秉卿,任桐炜,武港山.水面无人艇智能化技术的发展现状和趋势[J].中国造船,2020,61(S01):211-220. 被引量：21
8程烨.小型无人艇研究现状及关键技术[J].中国造船,2020,61(S01):241-249. 被引量：5
9孙庆鹏,黄宏友,田彬.AUV远距离快速布放方法研究[J].数字海洋与水下攻防,2020,3(4):333-338. 被引量：3
10阴启玉,韩昱.无人艇载雷达视频无线传输技术研究[J].南京信息工程大学学报（自然科学版）,2020,12(5):563-568. 被引量：4

1XIA Jiawei,ZHU Xufang,LIU Zhong,XIA Qingtao.LSTM-DPPO based deep reinforcement learning controller for path following optimization of unmanned surface vehicle[J].Journal of Systems Engineering and Electronics,2023,34(5):1343-1358. 被引量：1
2龚芮.基于建筑工程造价预结算审查视角的成本管理分析[J].大众标准化,2023(22):81-82. 被引量：1
3宋吉广,张京晶,冯亮,林扬,李德隆,谷海涛.USV地形测量路径自主规划研究与应用[J].舰船科学技术,2023,45(22):86-92.
4张婷婷,杨学军.基于强化学习的城市场景下巡飞弹自主协同饱和攻击方法[J].指挥与控制学报,2023,9(4):457-468. 被引量：1
5廖登宇,张震,赵德京,崔浩岩.基于多智能体深度强化学习的机器人协作搬运方法[J].电子设计工程,2023,31(23):7-11.
6Xuanyi Xiao,Jianbing Yin,Lin Chen,Mingchang Wang,Yi Zhao,Zhiyi Li.Evolutionary Game-theoretic Modeling of Massive Distributed Renewable Energy Deployment Towards Low-carbon Distribution Networks[J].Journal of Modern Power Systems and Clean Energy,2023,11(5):1519-1528. 被引量：2
7Sichen Li,Di Cao,Weihao Hu,Qi Huang,Zhe Chen,Frede Blaabjerg.Multi-energy Management of Interconnected Multi-microgrid System Using Multi-agent Deep Reinforcement Learning[J].Journal of Modern Power Systems and Clean Energy,2023,11(5):1606-1617. 被引量：1
8高甲博,肖玮,何智杰.P3C-MADDPG算法的多无人机协同追捕对抗策略研究[J].指挥控制与仿真,2023,45(6):7-18.
9Bicheng CAI,Chengfei YUE,Fan WU,Xueqin CHEN,Yunhai GENG.A grasp planning algorithm under uneven contact point distribution scenario for space non-cooperative target capture[J].Chinese Journal of Aeronautics,2023,36(11):452-464. 被引量：1
10Maryam Bukhari,Sadaf Yasmin,Sheneela Naz,Mehr Yahya Durrani,Mubashir Javaid,Jihoon Moon,Seungmin Rho.A Smart Heart Disease Diagnostic System Using Deep Vanilla LSTM[J].Computers, Materials & Continua,2023,77(10):1251-1279. 被引量：2

Defence Technology（防务技术）

2023年第11期

浏览历史

内容加载中请稍等...

Cooperative multi-target hunting by unmanned surface vehicles based on multi-agent reinforcement learning

参考文献4

二级参考文献36

共引文献87

相关作者

相关机构

相关主题

浏览历史