Multi‐agent reinforcement learning relies on reward signals to guide the policy networks of individual agents.However,in high‐dimensional continuous spaces,the non‐stationary environment can provide outdated experi...Multi‐agent reinforcement learning relies on reward signals to guide the policy networks of individual agents.However,in high‐dimensional continuous spaces,the non‐stationary environment can provide outdated experiences that hinder convergence,resulting in ineffective training performance for multi‐agent systems.To tackle this issue,a novel reinforcement learning scheme,Mutual Information Oriented Deep Skill Chaining(MioDSC),is proposed that generates an optimised cooperative policy by incorporating intrinsic rewards based on mutual information to improve exploration efficiency.These rewards encourage agents to diversify their learning process by engaging in actions that increase the mutual information between their actions and the environment state.In addition,MioDSC can generate cooperative policies using the options framework,allowing agents to learn and reuse complex action sequences and accelerating the convergence speed of multi‐agent learning.MioDSC was evaluated in the multi‐agent particle environment and the StarCraft multi‐agent challenge at varying difficulty levels.The experimental results demonstrate that MioDSC outperforms state‐of‐the‐art methods and is robust across various multi‐agent system tasks with high stability.展开更多
The increasing trend toward dematerialization and digitalization has prompted a surge in the adoption of IT service providers, offering cost-effective alternatives to traditional local services. Consequently, cloud se...The increasing trend toward dematerialization and digitalization has prompted a surge in the adoption of IT service providers, offering cost-effective alternatives to traditional local services. Consequently, cloud services have become prevalent across various industries. While these services offer undeniable benefits, they face significant threats, particularly concerning the sensitivity of the data they handle. Many existing mathematical models struggle to accurately depict the complex scenarios of cloud systems. In response to this challenge, this paper proposes a behavioral model for ransomware propagation within such environments. In this model, each component of the environment is defined as an agent responsible for monitoring the propagation of malware. Given the distinct characteristics and criticality of these agents, the impact of malware can vary significantly. Scenario attacks are constructed based on real-world vulnerabilities documented in the Common Vulnerabilities and Exposures (CVEs) through the National Vulnerability Database. Defender actions are guided by an Intrusion Detection System (IDS) guideline. This research aims to provide a comprehensive framework for understanding and addressing ransomware threats in cloud systems. By leveraging an agent- based approach and real-world vulnerability data, our model offers valuable insights into detection and mitigation strategies for safeguarding sensitive cloud-based assets.展开更多
A Multi-Agent System ( MAS ) is a promising approach to build complex system. This paper introduces the research of the Inner-Enterprise Credit Rating MAS ( IECRMAS). To raise the rating accuracy, we not only cons...A Multi-Agent System ( MAS ) is a promising approach to build complex system. This paper introduces the research of the Inner-Enterprise Credit Rating MAS ( IECRMAS). To raise the rating accuracy, we not only consider the rating-target's information, but also focus on the evaluators' feature information and propose the rational rating-group formation algorithm based on an anti-bias measurement of the group. We also propose the rational rating individual, which consists of the evaluator and the assistant rating agent. A rational group formation protocol is designed to coordinate autonomous agents to perform the rating job.展开更多
Development of complicated products is a project of system engineering It involves extensive and complicated knowledge,design methods and auxiliary technology Various factors affect each other So,modern product dev...Development of complicated products is a project of system engineering It involves extensive and complicated knowledge,design methods and auxiliary technology Various factors affect each other So,modern product development is a typical group problem with distributed and dynamic features It is apparent superiority to solve this problem with a multi agent system representing various knowledge domains Distributed artificial intelligence knowledge being used,the multi agent collaborative design system concept and model based on Internet environment are put forward The realizing method of product developing agents,interactive process among multi agents,and organization and implementing of the design project of the multi agent collaborative design system are discussed in detail Application examples are also presented.展开更多
This paper presents a multi agent model for the realization of the tasks dispatched in a distributed flexible manufacturing system.Agent behavior is described in terms of its capabilities and related environment.Acco...This paper presents a multi agent model for the realization of the tasks dispatched in a distributed flexible manufacturing system.Agent behavior is described in terms of its capabilities and related environment.According to task execution forms,two kinds of task allocation methods are used and the proper communication mechanisms and negotiation mechanisms are involved to guarantee a high performance and high reliability for a DFMS.展开更多
Aiming at the deficiency of conventional traffic control method, this paper proposes a new method based on multi-agent technology for traffic control. Different from many existing methods, this paper distinguishes tra...Aiming at the deficiency of conventional traffic control method, this paper proposes a new method based on multi-agent technology for traffic control. Different from many existing methods, this paper distinguishes traffic control on the basis of the agent technology from conventional traffic control method. The composition and structure of a multi-agent system (MAS) is first discussed. Then, the step-coordination strategies of intersection-agent, segment-agent, and area-agent are put forward. The advantages of the algorithm are demonstrated by a simulation study.展开更多
基金National Natural Science Foundation of China,Grant/Award Number:61872171The Belt and Road Special Foundation of the State Key Laboratory of Hydrology‐Water Resources and Hydraulic Engineering,Grant/Award Number:2021490811。
文摘Multi‐agent reinforcement learning relies on reward signals to guide the policy networks of individual agents.However,in high‐dimensional continuous spaces,the non‐stationary environment can provide outdated experiences that hinder convergence,resulting in ineffective training performance for multi‐agent systems.To tackle this issue,a novel reinforcement learning scheme,Mutual Information Oriented Deep Skill Chaining(MioDSC),is proposed that generates an optimised cooperative policy by incorporating intrinsic rewards based on mutual information to improve exploration efficiency.These rewards encourage agents to diversify their learning process by engaging in actions that increase the mutual information between their actions and the environment state.In addition,MioDSC can generate cooperative policies using the options framework,allowing agents to learn and reuse complex action sequences and accelerating the convergence speed of multi‐agent learning.MioDSC was evaluated in the multi‐agent particle environment and the StarCraft multi‐agent challenge at varying difficulty levels.The experimental results demonstrate that MioDSC outperforms state‐of‐the‐art methods and is robust across various multi‐agent system tasks with high stability.
文摘The increasing trend toward dematerialization and digitalization has prompted a surge in the adoption of IT service providers, offering cost-effective alternatives to traditional local services. Consequently, cloud services have become prevalent across various industries. While these services offer undeniable benefits, they face significant threats, particularly concerning the sensitivity of the data they handle. Many existing mathematical models struggle to accurately depict the complex scenarios of cloud systems. In response to this challenge, this paper proposes a behavioral model for ransomware propagation within such environments. In this model, each component of the environment is defined as an agent responsible for monitoring the propagation of malware. Given the distinct characteristics and criticality of these agents, the impact of malware can vary significantly. Scenario attacks are constructed based on real-world vulnerabilities documented in the Common Vulnerabilities and Exposures (CVEs) through the National Vulnerability Database. Defender actions are guided by an Intrusion Detection System (IDS) guideline. This research aims to provide a comprehensive framework for understanding and addressing ransomware threats in cloud systems. By leveraging an agent- based approach and real-world vulnerability data, our model offers valuable insights into detection and mitigation strategies for safeguarding sensitive cloud-based assets.
基金This paper is supported by National Science Foundation of China under Grant No60542004
文摘A Multi-Agent System ( MAS ) is a promising approach to build complex system. This paper introduces the research of the Inner-Enterprise Credit Rating MAS ( IECRMAS). To raise the rating accuracy, we not only consider the rating-target's information, but also focus on the evaluators' feature information and propose the rational rating-group formation algorithm based on an anti-bias measurement of the group. We also propose the rational rating individual, which consists of the evaluator and the assistant rating agent. A rational group formation protocol is designed to coordinate autonomous agents to perform the rating job.
基金This project is supported by National Natural Science Foundation of China (No.59875087) and by Foundation for University Key T
文摘Development of complicated products is a project of system engineering It involves extensive and complicated knowledge,design methods and auxiliary technology Various factors affect each other So,modern product development is a typical group problem with distributed and dynamic features It is apparent superiority to solve this problem with a multi agent system representing various knowledge domains Distributed artificial intelligence knowledge being used,the multi agent collaborative design system concept and model based on Internet environment are put forward The realizing method of product developing agents,interactive process among multi agents,and organization and implementing of the design project of the multi agent collaborative design system are discussed in detail Application examples are also presented.
文摘This paper presents a multi agent model for the realization of the tasks dispatched in a distributed flexible manufacturing system.Agent behavior is described in terms of its capabilities and related environment.According to task execution forms,two kinds of task allocation methods are used and the proper communication mechanisms and negotiation mechanisms are involved to guarantee a high performance and high reliability for a DFMS.
文摘Aiming at the deficiency of conventional traffic control method, this paper proposes a new method based on multi-agent technology for traffic control. Different from many existing methods, this paper distinguishes traffic control on the basis of the agent technology from conventional traffic control method. The composition and structure of a multi-agent system (MAS) is first discussed. Then, the step-coordination strategies of intersection-agent, segment-agent, and area-agent are put forward. The advantages of the algorithm are demonstrated by a simulation study.