期刊文献+
共找到6篇文章
< 1 >
每页显示 20 50 100
Mutual information oriented deep skill chaining for multi‐agent reinforcement learning
1
作者 Zaipeng Xie Cheng Ji +4 位作者 Chentai Qiao WenZhan Song Zewen Li Yufeng Zhang Yujing Zhang 《CAAI Transactions on Intelligence Technology》 SCIE EI 2024年第4期1014-1030,共17页
Multi‐agent reinforcement learning relies on reward signals to guide the policy networks of individual agents.However,in high‐dimensional continuous spaces,the non‐stationary environment can provide outdated experi... Multi‐agent reinforcement learning relies on reward signals to guide the policy networks of individual agents.However,in high‐dimensional continuous spaces,the non‐stationary environment can provide outdated experiences that hinder convergence,resulting in ineffective training performance for multi‐agent systems.To tackle this issue,a novel reinforcement learning scheme,Mutual Information Oriented Deep Skill Chaining(MioDSC),is proposed that generates an optimised cooperative policy by incorporating intrinsic rewards based on mutual information to improve exploration efficiency.These rewards encourage agents to diversify their learning process by engaging in actions that increase the mutual information between their actions and the environment state.In addition,MioDSC can generate cooperative policies using the options framework,allowing agents to learn and reuse complex action sequences and accelerating the convergence speed of multi‐agent learning.MioDSC was evaluated in the multi‐agent particle environment and the StarCraft multi‐agent challenge at varying difficulty levels.The experimental results demonstrate that MioDSC outperforms state‐of‐the‐art methods and is robust across various multi‐agent system tasks with high stability. 展开更多
关键词 artificial intelligence techniques decision making intelligent multiagent systems
下载PDF
Event-triggered and consensus-based attitude tracking alignment for discrete-time multiple spacecraft system exploiting interference
2
作者 Peiran LI Xin WEN +1 位作者 Haiying LIU Yuping LU 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2023年第2期241-255,共15页
This paper presents a discrete-time attitude control strategy with equi-global practical stabilizability for aligning the attitude of multiple spacecraft to a predesigned configuration according to a time-variant refe... This paper presents a discrete-time attitude control strategy with equi-global practical stabilizability for aligning the attitude of multiple spacecraft to a predesigned configuration according to a time-variant reference.By utilizing the interference of the wireless channel,the communication scheme designed in this paper can save communication resources,amount of computation,and energy proportionally to the number of spacecraft.The exact discrete-time model and approximate discrete-time model of the consensus-based spacecraft tracking system are given.Then the framework for the design of an event-triggered control scheme for the exact discrete-time system via its approximate models is developed,which avoids the periodic actuation,and Zeno behavior is proved to be excluded.Furthermore,the control scheme can handle the presence of the unknown fading channel.Finally,simulation results are presented to demonstrate the effectiveness of the control strategy. 展开更多
关键词 Attitude control Cooperative communication Discrete time control systems multi agent systems SPACECRAFT
原文传递
Distributed state estimation for linear timeinvariant dynamical systems:A review of theories and algorithms
3
作者 Shuaiting HUANG Yuzhe LI Junfeng WU 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2022年第6期1-17,共17页
Distributed state estimation is of paramount importance in many applications involving the large-scale complex systems over spatially deployed networked sensors.This paper provides an overview for analysis of distribu... Distributed state estimation is of paramount importance in many applications involving the large-scale complex systems over spatially deployed networked sensors.This paper provides an overview for analysis of distributed state estimation algorithms for linear time invariant systems.A number of previous works are reviewed and a clear classification of the main approaches in this field are presented,i.e.,Kalman-filter-type methods and Luenberger-observer-type methods.The design and the stability analysis of these methods are discussed.Moreover,a comprehensive comparison of the existing results is provided in terms of some standard metrics including the graph connectivity,system observability,optimality,time scale and so on.Finally,several important and challenging future research directions are discussed. 展开更多
关键词 Dynamical systems Distributed estimation Kalman filter Luenberger observer Linear systems multi agent systems State estimation
原文传递
Dynamic event-triggered-based human-in-the-loop formation control for stochastic nonlinear MASs 被引量:1
4
作者 Yonghua Peng Guohuai Lin +1 位作者 Guangdeng Chen Hongyi Li 《Security and Safety》 2023年第4期49-65,共17页
The dynamic event-triggered(DET)formation control problem of a class of stochastic nonlinear multi-agent systems(MASs)with full state constraints is investigated in this article.Supposing that the human operator sends... The dynamic event-triggered(DET)formation control problem of a class of stochastic nonlinear multi-agent systems(MASs)with full state constraints is investigated in this article.Supposing that the human operator sends commands to the leader as control input signals,all followers keep formation through network topology communication.Under the command-filter-based backstepping technique,the radial basis function neural networks(RBF NNs)and the barrier Lyapunov function(BLF)are utilized to resolve the problems of unknown nonlinear terms and full state constraints,respectively.Furthermore,a DET control mechanism is proposed to reduce the occupation of communication bandwidth.The presented distributed formation control strategy guarantees that all signals of the MASs are semi-globally uniformly ultimately bounded(SGUUB)in probability.Finally,the feasibility of the theoretical research result is demonstrated by a simulation example. 展开更多
关键词 Dynamic event-triggered(DET)control formation control full state constraints human-in-the-loop(HiTL) multi agent systems(MASs)
原文传递
基于采样通信的变形翼分布式协同控制方案(英文) 被引量:5
5
作者 吴俊 陆宇平 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2010年第3期364-369,共6页
To investigate the control of morphing wings by means of interacting effectors,this article proposes a distributed coordinated control scheme with sampled communication on the basis of a simple morphing wing model,est... To investigate the control of morphing wings by means of interacting effectors,this article proposes a distributed coordinated control scheme with sampled communication on the basis of a simple morphing wing model,established with arrayed agents. The control scheme can change the shape of airfoil into an expected one and keep it smooth during morphing. As the interconnection of communication network and the agents would make the behavior of the morphing wing system complicated,a diagrammatic stability analysis method is put forward to ensure the system stability. Two simulations are carried out on the morphing wing system by using MATLAB. The results stand witness to the feasibility of the distributed coordinated control scheme and the effectiveness of the diagrammatic stability analysis method. 展开更多
关键词 morphing wing multi agent systems distributed control coordinated control system stability
原文传递
Cooperative coalition for formation flight scheduling based on incomplete information 被引量:6
6
作者 Meng Linghang Xu Xiaohao Zhao Yifei 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2015年第6期1747-1757,共11页
This study analyzes the cooperative coalition problem for formation scheduling based on incomplete information. A multi-agent cooperative coalition framework is developed to optimize the formation scheduling problem i... This study analyzes the cooperative coalition problem for formation scheduling based on incomplete information. A multi-agent cooperative coalition framework is developed to optimize the formation scheduling problem in a decentralized manner. The social class differentiation mech- anism and role-assuming mechanism are incorporated into the framework, which, in turn, ensures that the multi-agent system (MAS) evolves in the optimal direction. Moreover, a further differen- tiation pressure can be achieved to help MAS escape from local optima. A Bayesian coalition nego- tiation algorithm is constructed, within which the Harsanyi transformation is introduced to transform the coalition problem based on incomplete information to the Bayesian-equivalent coali- tion problem based on imperfect information. The simulation results suggest that the distribution of agents' expectations of other agents' unknown information approximates to the true distribution after a finite set of generations. The comparisons indicate that the MAS cooperative coalition algo- rithm produces a significantly better utility and possesses a more effective capability of escaping from local optima than the proposal-engaged marriage algorithm and the Simulated Annealing algorithm. 展开更多
关键词 Coalition Commercial aviation Formation flight Harsanyi transformation Incomplete information multi agent systems (MAS) SCHEDULING
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部