基于鱼群涌现行为启发的集群机器人硬注意力强化模型

Hard attention reinforcement model for swarm robotics inspired by fish school emergence behavior

下载PDF

导出

摘要生物集群运动模型能使集群机器人涌现秩序,但是所形成的机器人自然集群秩序难以有效地被人工控制,为此提出鱼群硬注意力模型来解析实验鱼群数据中的交互行为。该模型通过编码器网络、图注意力网络、信息聚合网络、预解码网络以及最终解码网络等结构来获取焦点单体的重要邻居;再利用深度确定性策略梯度技术设计轨道强化网络与安全强化网络,以实现集群的人工控制。多智能体仿真与集群机器人实验结果表明:所提方法能够实现集群的人工轨道、安全控制,重要邻居信息为解决集群运动的强化学习难题提供了新思路,所提控制模型在无人机群空中协作、智慧农机集群作业、物流仓储多体搬运等领域具有较大的应用潜力。 The biological swarm motion model enables the emergence of order in robot collectives,but controlling the natural swarm order formed by robots is challenging.To address this issue,this paper proposed the fish school hard attention model to analyze interaction behaviors in experimental fish school data.This model utilized structures such as an encoder network,graph attention network,information aggregation network,pre-decoding network and a final decoding network to capture crucial information about the focal individual s important neighbors.Subsequently,it employed deep deterministic policy gradient techniques to design trajectory reinforcement networks and safety reinforcement networks to achieve artificial control of the swarm.Results from multi-agent simulations and experiments with swarm robotics demonstrate that the proposed method can realize artificial trajectory and safety control of collectives.The utilization of high-attention neighborhood information for resolving reinforcement learning challenges in collective motion provides a novel approach.The proposed control model exhibits substantial potential applications in areas such as collaborative aerial operations of drone swarms,intelligent agricultural machinery operations,and multi-robot material handling in logistics and warehousing.

作者刘磊葛振业林杰陶宇孙俊杰 Liu Lei;Ge Zhenye;Lin Jie;Tao Yu;Sun Junjie(School of Management,University of Shanghai for Science&Technology,Shanghai 200093,China;School of Optoelectronics,University of Shanghai for Science&Technology,Shanghai 200093,China)

机构地区上海理工大学管理学院上海理工大学光电信息与计算机工程学院

出处《计算机应用研究》 CSCD 北大核心 2024年第9期2737-2744,共8页 Application Research of Computers

基金上海市自然科学基金资助项目(22ZR1443300)。

关键词自然秩序人工控制集群硬注意力机制多智能体运动强化学习集群机器人任务控制 natural order artificial control collective hard attention mechanism multi-agents motion reinforcement learning swarm robotics task control

分类号 TP242.6 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

1范寿林.合肥市“四大一优”专项行动保平安[J].长安,2024(2):38-38.
2刘锡琳,潘文松,张爱军.基于改进YOLOv5s的轮毂气门孔检测算法[J].电子设计工程,2024,32(19):140-144.
3合成生物学是否挑战传统生命观[J].中国社会科学文摘,2024(8):153-153.
4乔文超,聂伟民,杜选民,周胜增,温斌荣.基于MPC-MRAC的用于子母式AUV投放的子AUV控制算法[J].船舶工程,2024,46(4):112-122.
5张啸成,王涛,田昕,张永刚.基于移位窗口自注意力机制的新生儿脑区域图像分割[J].吉林大学学报（理学版）,2024,62(5):1129-1137.
6刘金存,任崟杰,徐战,安冬,位耀光.从自然灵感角度出发的群体智能集群机器人系统研究综述[J].信息与控制,2024,53(2):154-181.
7龚凯,向俊,刘林芽.基于货物列车抗脱轨安全度的重载铁路轨道结构强化措施评价[J].中南大学学报（自然科学版）,2020,51(3):832-841. 被引量：5
8刘敬东.马克思考察价值形式的历史意识与阶级意识——基于《资本论》第一卷“商品”章的考察[J].马克思主义理论学科研究,2024,10(6):22-34.
9梅晓虎,吕小强,雷萌.基于Stair−YOLOv7−tiny的煤矿井下输送带异物检测[J].工矿自动化,2024,50(8):99-104.
10李潇洋,陈健,常剑波.基于语义分割的视频鱼类特征提取方法研究[J].水生态学杂志,2024,45(5):204-212.

计算机应用研究

2024年第9期

浏览历史

内容加载中请稍等...

基于鱼群涌现行为启发的集群机器人硬注意力强化模型

相关作者

相关机构

相关主题

浏览历史