期刊文献+
共找到1,081篇文章
< 1 2 55 >
每页显示 20 50 100
基于Multi-agents的智能变电站警报处理及故障诊断系统 被引量:12
1
作者 辛建波 廖志伟 《电力系统保护与控制》 EI CSCD 北大核心 2011年第16期83-88,共6页
针对传统变电站故障诊断的不足,在智能变电站架构的基础上,提出了基于multi-agents的智能变电站警报处理及故障诊断系统。根据智能变电站的体系结构、信息流和数据流特点,设计了警报处理、输变电设备诊断等主要功能模块,以此满足变电站... 针对传统变电站故障诊断的不足,在智能变电站架构的基础上,提出了基于multi-agents的智能变电站警报处理及故障诊断系统。根据智能变电站的体系结构、信息流和数据流特点,设计了警报处理、输变电设备诊断等主要功能模块,以此满足变电站事故分析各层次的功能需求。就警报处理和输变电设备故障诊断系统中各个agent及agent之间的协作机制等方面做了详细论述,实际变电站故障案例证明了该警报处理和输变电诊断模型的可行性和有效性。 展开更多
关键词 multi-agents 智能变电站 警报处理 故障诊断
下载PDF
基于Multi-Agents分布式医学诊断系统研究 被引量:4
2
作者 张全海 叶晨洲 施鹏飞 《信息与控制》 CSCD 北大核心 2003年第1期23-27,共5页
医学诊断系统是一个新兴的复杂的应用系统,人工智能技术,计算机协作支持技术及高速通信网络体系结构的发展促进了计算机支持的诊断系统的发展.当前医学诊断系统的难点在于如何利用网络这个资源分布平台来获取所需要的数据及在数据不完... 医学诊断系统是一个新兴的复杂的应用系统,人工智能技术,计算机协作支持技术及高速通信网络体系结构的发展促进了计算机支持的诊断系统的发展.当前医学诊断系统的难点在于如何利用网络这个资源分布平台来获取所需要的数据及在数据不完整状态进行推理求解,而这些问题的解决在于能够有一种机制使得能在一个标准的应用系统结构中准确的表示并获取信息及集成各种医学资源使之相互协作.本文描述了一种利用多智能体(Multi-agents system,MAS)体系结构和中间件(middleware)技术如公共请求代理结构(Common Object Request Broker Architecture,CORBA)进行设计的分布式医学诊断系统.该系统能集成多种医学资源和医学应用实体并且能实现参与诊断的医学实体之间的协作,以减少由于信息缺乏而带来的诊断偏差.另外本文还将一种实验室开发的模糊最小最大神经网络(Fuzzy Min—Max Neural Network,FMMNN)的模糊规则提取方法应用于该系统以证实该分布式诊断系统的优越性. 展开更多
关键词 multi-agents 分布式医学诊断系统 人工智能 计算机
下载PDF
基于multi-agents的网络防卫体系中预警定位系统的研究与实现 被引量:2
3
作者 汪芳 戴冠中 慕德俊 《西北工业大学学报》 EI CAS CSCD 北大核心 2010年第6期952-957,共6页
传统的网络安全措施,如加密认证、防火墙和入侵检测系统等,虽然在保护信息的保密性、完整性、可用性和控制访问方面有一定的效果,但在协同和预警方面依然存在不足。文章提出了1个基于multi-agents的网络安全防卫系统,该系统由协同预警... 传统的网络安全措施,如加密认证、防火墙和入侵检测系统等,虽然在保护信息的保密性、完整性、可用性和控制访问方面有一定的效果,但在协同和预警方面依然存在不足。文章提出了1个基于multi-agents的网络安全防卫系统,该系统由协同预警定位系统、协同审计系统、安全隔离系统、事故恢复系统等多个模块构成,模块之间由多个多级分层agents来负责通信任务。系统控制中心的agent server负责控制和协调整个安全体系,制定全网统一的安全控制策略。在该系统中,整个网络被划分成不同级别的分区,建立不同级别的协同预警定位系统,各分区既相互协作,又能够独立自治,通过协作的方式共同维护着整个网络的安全。在IPv6环境下测试的结果表明,该系统可以高效进行预警,IDS的捕获率约为95%、漏报率小于6%、误报率小于7%。 展开更多
关键词 multi-agents 协同防卫 预警定位 网络防护
下载PDF
基于Multi-Agents的多媒体信息检索引擎探讨 被引量:2
4
作者 张立厚 郑大庆 高京广 《图书馆论坛》 CSSCI 北大核心 2003年第6期118-120,共3页
在介绍了数字图书馆等概念的基础上 ,简要地介绍了基于Multi Agents (MAS)的多媒体信息检索引擎在数字图书馆中的应用 ,并结合当前的研究状况 ,描述了基于MAS的多媒体信息检索引擎应用的光明未来。
关键词 multi-agents 数字图书馆 多媒体信息检索 搜索引擎 智能代理技术
下载PDF
基于Multi-agents系统的黑启动决策方法
5
作者 叶凯 《西华大学学报(自然科学版)》 CAS 2005年第2期15-18,共4页
在黑启动过程中,建立相应的发电机、母线及线路开关等分层主体,进行相互通信与协调控制,实时监测电力系统的状态变化,并采用Petri net算法进行优化建模,从而提出相应的故障恢复方案或是大停电状态下的黑启动方案。
关键词 multi-agENT 黑启动 故障恢复 PETRI-NET
下载PDF
Maintaining Complex Formations and Avoiding Obstacles for Multi-Agents
6
作者 Yali Wang Youqian Feng +1 位作者 Zhonghai Yin Cheng Zhou 《Computers, Materials & Continua》 SCIE EI 2020年第2期877-891,共15页
This paper researches some problems in complex formation for multi-agents,in which two matrices are proposed to record the formation.The pattern matrix is used to describe the pattern of the formation;meanwhile,the lo... This paper researches some problems in complex formation for multi-agents,in which two matrices are proposed to record the formation.The pattern matrix is used to describe the pattern of the formation;meanwhile,the location matrix is used to record the location of each agent.Thus,all desired positions of each agent will be obtained by geometrical relationship on the basis of two matrices above.In addition a self-adaptation flocking algorithm is proposed to control all agents to form a desired formation and avoid obstacles.The main idea is as follows:agents will form a desired formation through the method of formation control when far away from obstacles;otherwise,agents will freely fly to pass through the area of obstacles.In the simulation,three scenarios are designed to verify the effectiveness of our method.The results show that our method also can be applied in three dimensions.All agents will form a stable formation and keep the same velocity at last. 展开更多
关键词 multi-agents formation control SELF-ADAPTATION DISTRIBUTED velocities consensus
下载PDF
Distributed Consensus of High-Order Multi-Agents with Nonlinear Dynamics
7
作者 Jianzhen Li 《Intelligent Control and Automation》 2011年第1期1-7,共7页
This paper deals with the distributed consensus problem of high-order multi-agent systems with nonlinear dynamics subject to external disturbances. The network topology is assumed to be a fixed undirected graph. Some ... This paper deals with the distributed consensus problem of high-order multi-agent systems with nonlinear dynamics subject to external disturbances. The network topology is assumed to be a fixed undirected graph. Some sufficient conditions are derived, under which the consensus can be achieved with a prescribed norm bound. It is shown that the parameter matrix in the consensus algorithm can be designed by solving two linear matrix inequalities (LMIs). In particular, if the nonzero eigenvalues of the laplacian matrix ac-cording to the network topology are identical, the parameter matrix in the consensus algorithm can be de-signed by solving one LMI. A numerical example is given to illustrate the proposed results. 展开更多
关键词 CONSENSUS multi-agENT Systems Nonlinear DYNAMICS EXTERNAL Disturbances
下载PDF
基于Multi-Agent的无人机集群体系自主作战系统设计
8
作者 张堃 华帅 +1 位作者 袁斌林 杜睿怡 《系统工程与电子技术》 EI CSCD 北大核心 2024年第4期1273-1286,共14页
针对无人集群自主作战体系设计中的关键问题,提出基于Multi-Agent的无人集群自主作战系统设计方法。建立无人集群各节点的Agent模型及其推演规则;对于仿真系统模块化和通用化的需求,设计系统互操作式接口和无人集群自主作战的交互关系;... 针对无人集群自主作战体系设计中的关键问题,提出基于Multi-Agent的无人集群自主作战系统设计方法。建立无人集群各节点的Agent模型及其推演规则;对于仿真系统模块化和通用化的需求,设计系统互操作式接口和无人集群自主作战的交互关系;开展无人集群系统仿真推演验证。仿真结果表明,所提设计方案不仅能够有效开展并完成自主作战网络生成-集群演化-效能评估的全过程动态演示验证,而且能够通过重复随机试验进一步评估无人集群的协同作战效能,最后总结了集群协同作战的策略和经验。 展开更多
关键词 multi-agENT 无人集群 体系设计 协同作战
下载PDF
基于Multi-Agent的水电站变压器故障诊断系统
9
作者 乔丹 马鹏 王琦 《自动化技术与应用》 2024年第7期58-61,65,共5页
为了精准、快速完成水电站变压器的故障诊断,设计基于Multi-Agent的水电站变压器故障诊断系统。变压器状态监控agent将检测到的变压器故障信息发送给系统管理agent,系统管理agent通过通信agent将变压器故障信息发送给变压器故障诊断age... 为了精准、快速完成水电站变压器的故障诊断,设计基于Multi-Agent的水电站变压器故障诊断系统。变压器状态监控agent将检测到的变压器故障信息发送给系统管理agent,系统管理agent通过通信agent将变压器故障信息发送给变压器故障诊断agent,变压器故障诊断agent利用小波变换方法提取变压器故障特征,并将其作为IFOA-SVM模型输入,完成变压器故障分类后,获取变压器故障诊断结果,该结果通过通信agent显示给用户。实验表明,该系统可有效诊断变压器故障诊断,诊断成功率受系统故障信息丢失率的影响较小,诊断耗时、耗能小,并具有较高故障诊断成功率。 展开更多
关键词 multi-agENT 水电站 变压器 故障诊断 小波变换
下载PDF
UAV-Assisted Dynamic Avatar Task Migration for Vehicular Metaverse Services: A Multi-Agent Deep Reinforcement Learning Approach 被引量:1
10
作者 Jiawen Kang Junlong Chen +6 位作者 Minrui Xu Zehui Xiong Yutao Jiao Luchao Han Dusit Niyato Yongju Tong Shengli Xie 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第2期430-445,共16页
Avatars, as promising digital representations and service assistants of users in Metaverses, can enable drivers and passengers to immerse themselves in 3D virtual services and spaces of UAV-assisted vehicular Metavers... Avatars, as promising digital representations and service assistants of users in Metaverses, can enable drivers and passengers to immerse themselves in 3D virtual services and spaces of UAV-assisted vehicular Metaverses. However, avatar tasks include a multitude of human-to-avatar and avatar-to-avatar interactive applications, e.g., augmented reality navigation,which consumes intensive computing resources. It is inefficient and impractical for vehicles to process avatar tasks locally. Fortunately, migrating avatar tasks to the nearest roadside units(RSU)or unmanned aerial vehicles(UAV) for execution is a promising solution to decrease computation overhead and reduce task processing latency, while the high mobility of vehicles brings challenges for vehicles to independently perform avatar migration decisions depending on current and future vehicle status. To address these challenges, in this paper, we propose a novel avatar task migration system based on multi-agent deep reinforcement learning(MADRL) to execute immersive vehicular avatar tasks dynamically. Specifically, we first formulate the problem of avatar task migration from vehicles to RSUs/UAVs as a partially observable Markov decision process that can be solved by MADRL algorithms. We then design the multi-agent proximal policy optimization(MAPPO) approach as the MADRL algorithm for the avatar task migration problem. To overcome slow convergence resulting from the curse of dimensionality and non-stationary issues caused by shared parameters in MAPPO, we further propose a transformer-based MAPPO approach via sequential decision-making models for the efficient representation of relationships among agents. Finally, to motivate terrestrial or non-terrestrial edge servers(e.g., RSUs or UAVs) to share computation resources and ensure traceability of the sharing records, we apply smart contracts and blockchain technologies to achieve secure sharing management. Numerical results demonstrate that the proposed approach outperforms the MAPPO approach by around 2% and effectively reduces approximately 20% of the latency of avatar task execution in UAV-assisted vehicular Metaverses. 展开更多
关键词 AVATAR blockchain metaverses multi-agent deep reinforcement learning transformer UAVS
下载PDF
Dynamic event-triggered bipartite consensus for uncertain high-order nonlinearmulti-agentsystems
11
作者 Yanan Qi Chunshui Du +1 位作者 Xianfu Zhang Rui Mu 《Control Theory and Technology》 EI CSCD 2023年第2期222-232,共11页
In this paper,the bipartite consensus problem is studied for a class of uncertain high-order nonlinear multi-agent systems.A signed digraph is presented to describe the collaborative and competitive interactions among... In this paper,the bipartite consensus problem is studied for a class of uncertain high-order nonlinear multi-agent systems.A signed digraph is presented to describe the collaborative and competitive interactions among agents.For each agent with lower triangular structure,a time-varying gain compensator is first designed by relative output information of neighboring agents.Subsequently,a distributed controller with dynamic event-triggered mechanism is proposed to drive the bipartite consensus error to zero.It is worth noting that an internal dynamic variable is introduced in triggering function,which plays an essential role in excluding the Zeno behavior and reducing energy consumption.Furthermore,the dynamic event-triggered control protocol is developed for upper triangular multi-agent systems to realize the bipartite consensus without Zeno behavior.Finally,simulation examples are provided to illustrate the effectiveness of the presented results. 展开更多
关键词 High-order nonlinear multi-agent systems Uncertain systems Dynamic event-triggered control Bipartite consensus
原文传递
Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement Learning
12
作者 Kun Jiang Wenzhang Liu +2 位作者 Yuanda Wang Lu Dong Changyin Sun 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第7期1591-1604,共14页
Efficient exploration in complex coordination tasks has been considered a challenging problem in multi-agent reinforcement learning(MARL). It is significantly more difficult for those tasks with latent variables that ... Efficient exploration in complex coordination tasks has been considered a challenging problem in multi-agent reinforcement learning(MARL). It is significantly more difficult for those tasks with latent variables that agents cannot directly observe. However, most of the existing latent variable discovery methods lack a clear representation of latent variables and an effective evaluation of the influence of latent variables on the agent. In this paper, we propose a new MARL algorithm based on the soft actor-critic method for complex continuous control tasks with confounders. It is called the multi-agent soft actor-critic with latent variable(MASAC-LV) algorithm, which uses variational inference theory to infer the compact latent variables representation space from a large amount of offline experience.Besides, we derive the counterfactual policy whose input has no latent variables and quantify the difference between the actual policy and the counterfactual policy via a distance function. This quantified difference is considered an intrinsic motivation that gives additional rewards based on how much the latent variable affects each agent. The proposed algorithm is evaluated on two collaboration tasks with confounders, and the experimental results demonstrate the effectiveness of MASAC-LV compared to other baseline algorithms. 展开更多
关键词 Latent variable model maximum entropy multi-agent reinforcement learning(MARL) multi-agent system
下载PDF
Designing Proportional-Integral Consensus Protocols for Second-Order Multi-Agent Systems Using Delayed and Memorized State Information
13
作者 Honghai Wang Qing-Long Han 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第4期878-892,共15页
This paper is concerned with consensus of a secondorder linear time-invariant multi-agent system in the situation that there exists a communication delay among the agents in the network.A proportional-integral consens... This paper is concerned with consensus of a secondorder linear time-invariant multi-agent system in the situation that there exists a communication delay among the agents in the network.A proportional-integral consensus protocol is designed by using delayed and memorized state information.Under the proportional-integral consensus protocol,the consensus problem of the multi-agent system is transformed into the problem of asymptotic stability of the corresponding linear time-invariant time-delay system.Note that the location of the eigenvalues of the corresponding characteristic function of the linear time-invariant time-delay system not only determines the stability of the system,but also plays a critical role in the dynamic performance of the system.In this paper,based on recent results on the distribution of roots of quasi-polynomials,several necessary conditions for Hurwitz stability for a class of quasi-polynomials are first derived.Then allowable regions of consensus protocol parameters are estimated.Some necessary and sufficient conditions for determining effective protocol parameters are provided.The designed protocol can achieve consensus and improve the dynamic performance of the second-order multi-agent system.Moreover,the effects of delays on consensus of systems of harmonic oscillators/double integrators under proportional-integral consensus protocols are investigated.Furthermore,some results on proportional-integral consensus are derived for a class of high-order linear time-invariant multi-agent systems. 展开更多
关键词 Consensus protocol Hurwitz stability multi-agent systems quasi-polynomials time delay
下载PDF
Bipartite consensus problems of Lurie multi-agent systems over signed graphs: A contraction approach
14
作者 张晓娇 吴祥 《Chinese Physics B》 SCIE EI CAS CSCD 2024年第7期137-145,共9页
This paper examines the bipartite consensus problems for the nonlinear multi-agent systems in Lurie dynamics form with cooperative and competitive communication between different agents. Based on the contraction theor... This paper examines the bipartite consensus problems for the nonlinear multi-agent systems in Lurie dynamics form with cooperative and competitive communication between different agents. Based on the contraction theory, some new conditions for the nonlinear Lurie multi-agent systems reaching bipartite leaderless consensus and bipartite tracking consensus are presented. Compared with the traditional methods, this approach degrades the dimensions of the conditions, eliminates some restrictions of the system matrix, and extends the range of the nonlinear function. Finally, two numerical examples are provided to illustrate the efficiency of our results. 展开更多
关键词 contraction theory virtual system bipartite consensus Lurie multi-agent systems
下载PDF
Finite-time Prescribed Performance Time-Varying Formation Control for Second-Order Multi-Agent Systems With Non-Strict Feedback Based on a Neural Network Observer
15
作者 Chi Ma Dianbiao Dong 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第4期1039-1050,共12页
This paper studies the problem of time-varying formation control with finite-time prescribed performance for nonstrict feedback second-order multi-agent systems with unmeasured states and unknown nonlinearities.To eli... This paper studies the problem of time-varying formation control with finite-time prescribed performance for nonstrict feedback second-order multi-agent systems with unmeasured states and unknown nonlinearities.To eliminate nonlinearities,neural networks are applied to approximate the inherent dynamics of the system.In addition,due to the limitations of the actual working conditions,each follower agent can only obtain the locally measurable partial state information of the leader agent.To address this problem,a neural network state observer based on the leader state information is designed.Then,a finite-time prescribed performance adaptive output feedback control strategy is proposed by restricting the sliding mode surface to a prescribed region,which ensures that the closed-loop system has practical finite-time stability and that formation errors of the multi-agent systems converge to the prescribed performance bound in finite time.Finally,a numerical simulation is provided to demonstrate the practicality and effectiveness of the developed algorithm. 展开更多
关键词 Finite-time control multi-agent systems neural network prescribed performance control time-varying formation control
下载PDF
Hyperbolic Tangent Function-Based Protocols for Global/Semi-Global Finite-Time Consensus of Multi-Agent Systems
16
作者 Zongyu Zuo Jingchuan Tang +1 位作者 Ruiqi Ke Qing-Long Han 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第6期1381-1397,共17页
This paper investigates the problem of global/semi-global finite-time consensus for integrator-type multi-agent sys-tems.New hyperbolic tangent function-based protocols are pro-posed to achieve global and semi-global ... This paper investigates the problem of global/semi-global finite-time consensus for integrator-type multi-agent sys-tems.New hyperbolic tangent function-based protocols are pro-posed to achieve global and semi-global finite-time consensus for both single-integrator and double-integrator multi-agent systems with leaderless undirected and leader-following directed commu-nication topologies.These new protocols not only provide an explicit upper-bound estimate for the settling time,but also have a user-prescribed bounded control level.In addition,compared to some existing results based on the saturation function,the pro-posed approach considerably simplifies the protocol design and the stability analysis.Illustrative examples and an application demonstrate the effectiveness of the proposed protocols. 展开更多
关键词 Consensus protocol finite-time consensus hyper-bolic tangent function multi-agent systems.
下载PDF
A coordinated operation method of wind-PV-hydrogenstorage multi-agent energy system
17
作者 Tengfei Ma Wei Pei +3 位作者 Yanhong Yang Hao Xiao Chenghong Tang Weiqi Hua 《Global Energy Interconnection》 EI CSCD 2024年第4期446-461,共16页
Wind-photovoltaic(PV)-hydrogen-storage multi-agent energy systems are expected to play an important role in promoting renewable power utilization and decarbonization.In this study,a coordinated operation method was pr... Wind-photovoltaic(PV)-hydrogen-storage multi-agent energy systems are expected to play an important role in promoting renewable power utilization and decarbonization.In this study,a coordinated operation method was proposed for a wind-PVhydrogen-storage multi-agent energy system.First,a coordinated operation model was formulated for each agent considering peer-to-peer power trading.Second,a coordinated operation interactive framework for a multi-agent energy system was proposed based on the theory of the alternating direction method of multipliers.Third,a distributed interactive algorithm was proposed to protect the privacy of each agent and solve coordinated operation strategies.Finally,the effectiveness of the proposed coordinated operation method was tested on multi-agent energy systems with different structures,and the operational revenues of the wind power,PV,hydrogen,and energy storage agents of the proposed coordinated operation model were improved by approximately 59.19%,233.28%,16.75%,and 145.56%,respectively,compared with the independent operation model. 展开更多
关键词 Wind-PV-hydrogen-storage multi-agent energy system Renewable power to hydrogen Coordinated operation Peer-to-peer power trading
下载PDF
Unleashing the Power of Multi-Agent Reinforcement Learning for Algorithmic Trading in the Digital Financial Frontier and Enterprise Information Systems
18
作者 Saket Sarin Sunil K.Singh +4 位作者 Sudhakar Kumar Shivam Goyal Brij Bhooshan Gupta Wadee Alhalabi Varsha Arya 《Computers, Materials & Continua》 SCIE EI 2024年第8期3123-3138,共16页
In the rapidly evolving landscape of today’s digital economy,Financial Technology(Fintech)emerges as a trans-formative force,propelled by the dynamic synergy between Artificial Intelligence(AI)and Algorithmic Trading... In the rapidly evolving landscape of today’s digital economy,Financial Technology(Fintech)emerges as a trans-formative force,propelled by the dynamic synergy between Artificial Intelligence(AI)and Algorithmic Trading.Our in-depth investigation delves into the intricacies of merging Multi-Agent Reinforcement Learning(MARL)and Explainable AI(XAI)within Fintech,aiming to refine Algorithmic Trading strategies.Through meticulous examination,we uncover the nuanced interactions of AI-driven agents as they collaborate and compete within the financial realm,employing sophisticated deep learning techniques to enhance the clarity and adaptability of trading decisions.These AI-infused Fintech platforms harness collective intelligence to unearth trends,mitigate risks,and provide tailored financial guidance,fostering benefits for individuals and enterprises navigating the digital landscape.Our research holds the potential to revolutionize finance,opening doors to fresh avenues for investment and asset management in the digital age.Additionally,our statistical evaluation yields encouraging results,with metrics such as Accuracy=0.85,Precision=0.88,and F1 Score=0.86,reaffirming the efficacy of our approach within Fintech and emphasizing its reliability and innovative prowess. 展开更多
关键词 Neurodynamic Fintech multi-agent reinforcement learning algorithmic trading digital financial frontier
下载PDF
Targeted multi-agent communication algorithm based on state control
19
作者 Li-yang Zhao Tian-qing Chang +3 位作者 Lei Zhang Jie Zhang Kai-xuan Chu De-peng Kong 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第1期544-556,共13页
As an important mechanism in multi-agent interaction,communication can make agents form complex team relationships rather than constitute a simple set of multiple independent agents.However,the existing communication ... As an important mechanism in multi-agent interaction,communication can make agents form complex team relationships rather than constitute a simple set of multiple independent agents.However,the existing communication schemes can bring much timing redundancy and irrelevant messages,which seriously affects their practical application.To solve this problem,this paper proposes a targeted multiagent communication algorithm based on state control(SCTC).The SCTC uses a gating mechanism based on state control to reduce the timing redundancy of communication between agents and determines the interaction relationship between agents and the importance weight of a communication message through a series connection of hard-and self-attention mechanisms,realizing targeted communication message processing.In addition,by minimizing the difference between the fusion message generated from a real communication message of each agent and a fusion message generated from the buffered message,the correctness of the final action choice of the agent is ensured.Our evaluation using a challenging set of Star Craft II benchmarks indicates that the SCTC can significantly improve the learning performance and reduce the communication overhead between agents,thus ensuring better cooperation between agents. 展开更多
关键词 multi-agent deep reinforcement learning State control Targeted interaction Communication mechanism
下载PDF
Regional Multi-Agent Cooperative Reinforcement Learning for City-Level Traffic Grid Signal
20
作者 Yisha Li Ya Zhang +1 位作者 Xinde Li Changyin Sun 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第9期1987-1998,共12页
This article studies the effective traffic signal control problem of multiple intersections in a city-level traffic system.A novel regional multi-agent cooperative reinforcement learning algorithm called RegionSTLight... This article studies the effective traffic signal control problem of multiple intersections in a city-level traffic system.A novel regional multi-agent cooperative reinforcement learning algorithm called RegionSTLight is proposed to improve the traffic efficiency.Firstly a regional multi-agent Q-learning framework is proposed,which can equivalently decompose the global Q value of the traffic system into the local values of several regions Based on the framework and the idea of human-machine cooperation,a dynamic zoning method is designed to divide the traffic network into several strong-coupled regions according to realtime traffic flow densities.In order to achieve better cooperation inside each region,a lightweight spatio-temporal fusion feature extraction network is designed.The experiments in synthetic real-world and city-level scenarios show that the proposed RegionS TLight converges more quickly,is more stable,and obtains better asymptotic performance compared to state-of-theart models. 展开更多
关键词 Human-machine cooperation mixed domain attention mechanism multi-agent reinforcement learning spatio-temporal feature traffic signal control
下载PDF
上一页 1 2 55 下一页 到第
使用帮助 返回顶部