BACKGROUND Pain in the back or pelvis or fear of back pain may affect the timing or cocontraction of the core muscles.In both static and dynamic movements,the Sahrmann core stability test provides an assessment of cor...BACKGROUND Pain in the back or pelvis or fear of back pain may affect the timing or cocontraction of the core muscles.In both static and dynamic movements,the Sahrmann core stability test provides an assessment of core muscle activation and a person's ability to stabilize the lumbopelvic complex.Preparatory cues and images can be used to increase the activation of these muscles.To attain optimal movement patterns,it will be necessary to determine what cueing will give the most effective results for core stability.AIM To investigate the effects of external and internal cues on core muscle activation during the Sahrmann five-level core stability test.METHODS Total 68 participants(21.83±3.47 years)were randomly allocated to an external(n=35)or internal cue group(n=33).Participants performed the Sahrmann fivelevel core stability test without a cue as baseline and the five-level stability exercises with an internal or external cue.External cue group received a pressure biofeedback unit(PBU),and the internal cue group received an audio cue.A Delsys Trigno^(TM)surface electromyography unit was used for muscle activation from the rectus abdominis,external oblique,and transverse abdominis/internal oblique muscles.RESULTS Linear mixed effects model analysis showed that cueing had a significant effect on core muscle activation(P=0.001);however,there was no significant difference between cue types(internal or external)(P=0.130).CONCLUSION Both external and internal cueing have significant effects on core muscle activation during the Sahrmann five-level core stability test and the PBU does not create higher muscle activation than internal cueing.展开更多
Aiming for the coordinated motion and cooperative control of multi-agents in a non-rectangular bounded space, a velocity consensus algorithm for the agents with double- integrator dynamics is presented. The traditiona...Aiming for the coordinated motion and cooperative control of multi-agents in a non-rectangular bounded space, a velocity consensus algorithm for the agents with double- integrator dynamics is presented. The traditional consensus algorithm for bounded space is only applicable to rectangular bouncing boundaries, not suitable for non-rectangular space. In order to extend the previous consensus algorithm to the non- rectangular space, the concept of mirrored velocity is introduced, which can convert the discontinuous real velocity to continuous mirrored velocity, and expand a bounded space into an infinite space. Using the consensus algorithm, it is found that the mirrored velocities of multi-agents asymptotically converge to the same values. Because each mirrored velocity points to a unique velocity in real space, it can be concluded that the real velocities of multi-agents also asymptotically converge. Finally, the effectiveness of the proposed consensus algorithm is examined by theoretical proof and numerical simulations. Moreover, an experiment is performed with the algorithm in a real multi-robot system successfully.展开更多
This paper discusses the progress of computer integrated processing (CIPS) of coal-preparation and then preserits an intelligence controlled production-process, device-maintenance and production-management system of...This paper discusses the progress of computer integrated processing (CIPS) of coal-preparation and then preserits an intelligence controlled production-process, device-maintenance and production-management system of coal- preparation based on multi-agents (IICMMS-CP). The construction of the IICMMS-CP, the distributed network control system based on live intelligence control stations and the strategy of implementing distributed intelligence control system are studied in order to overcome the disadvantages brought about by the wide use of the PLC system by coaipreparation plants. The software frame, based on a Multi-Agent Intelligence Control and Maintenance Management integrated system, is studied and the implemention methods of IICMMS-CP are discussed. The characteristics of distributed architecture, cooperation and parallel computing meet the needs of integrated control of coal-preparation plants with large-scale spatial production distribution, densely-related processes and complex systems. Its application further improves the reliability and precision of process control, accuracy of fault identification and intelligence of production adjustment, establishes a technical basis for system integration and flexible production. The main function of the system has been tested in a coal-preparation plant to good effect in stabilizing product quality, improving efficiency and reducing consumption.展开更多
Avatars, as promising digital representations and service assistants of users in Metaverses, can enable drivers and passengers to immerse themselves in 3D virtual services and spaces of UAV-assisted vehicular Metavers...Avatars, as promising digital representations and service assistants of users in Metaverses, can enable drivers and passengers to immerse themselves in 3D virtual services and spaces of UAV-assisted vehicular Metaverses. However, avatar tasks include a multitude of human-to-avatar and avatar-to-avatar interactive applications, e.g., augmented reality navigation,which consumes intensive computing resources. It is inefficient and impractical for vehicles to process avatar tasks locally. Fortunately, migrating avatar tasks to the nearest roadside units(RSU)or unmanned aerial vehicles(UAV) for execution is a promising solution to decrease computation overhead and reduce task processing latency, while the high mobility of vehicles brings challenges for vehicles to independently perform avatar migration decisions depending on current and future vehicle status. To address these challenges, in this paper, we propose a novel avatar task migration system based on multi-agent deep reinforcement learning(MADRL) to execute immersive vehicular avatar tasks dynamically. Specifically, we first formulate the problem of avatar task migration from vehicles to RSUs/UAVs as a partially observable Markov decision process that can be solved by MADRL algorithms. We then design the multi-agent proximal policy optimization(MAPPO) approach as the MADRL algorithm for the avatar task migration problem. To overcome slow convergence resulting from the curse of dimensionality and non-stationary issues caused by shared parameters in MAPPO, we further propose a transformer-based MAPPO approach via sequential decision-making models for the efficient representation of relationships among agents. Finally, to motivate terrestrial or non-terrestrial edge servers(e.g., RSUs or UAVs) to share computation resources and ensure traceability of the sharing records, we apply smart contracts and blockchain technologies to achieve secure sharing management. Numerical results demonstrate that the proposed approach outperforms the MAPPO approach by around 2% and effectively reduces approximately 20% of the latency of avatar task execution in UAV-assisted vehicular Metaverses.展开更多
This paper researches some problems in complex formation for multi-agents,in which two matrices are proposed to record the formation.The pattern matrix is used to describe the pattern of the formation;meanwhile,the lo...This paper researches some problems in complex formation for multi-agents,in which two matrices are proposed to record the formation.The pattern matrix is used to describe the pattern of the formation;meanwhile,the location matrix is used to record the location of each agent.Thus,all desired positions of each agent will be obtained by geometrical relationship on the basis of two matrices above.In addition a self-adaptation flocking algorithm is proposed to control all agents to form a desired formation and avoid obstacles.The main idea is as follows:agents will form a desired formation through the method of formation control when far away from obstacles;otherwise,agents will freely fly to pass through the area of obstacles.In the simulation,three scenarios are designed to verify the effectiveness of our method.The results show that our method also can be applied in three dimensions.All agents will form a stable formation and keep the same velocity at last.展开更多
This paper studies the problem of time-varying formation control with finite-time prescribed performance for nonstrict feedback second-order multi-agent systems with unmeasured states and unknown nonlinearities.To eli...This paper studies the problem of time-varying formation control with finite-time prescribed performance for nonstrict feedback second-order multi-agent systems with unmeasured states and unknown nonlinearities.To eliminate nonlinearities,neural networks are applied to approximate the inherent dynamics of the system.In addition,due to the limitations of the actual working conditions,each follower agent can only obtain the locally measurable partial state information of the leader agent.To address this problem,a neural network state observer based on the leader state information is designed.Then,a finite-time prescribed performance adaptive output feedback control strategy is proposed by restricting the sliding mode surface to a prescribed region,which ensures that the closed-loop system has practical finite-time stability and that formation errors of the multi-agent systems converge to the prescribed performance bound in finite time.Finally,a numerical simulation is provided to demonstrate the practicality and effectiveness of the developed algorithm.展开更多
This article investigates the problem of robust adaptive leaderless consensus for heterogeneous uncertain nonminimumphase linear multi-agent systems over directed communication graphs. Each agent is assumed tobe of un...This article investigates the problem of robust adaptive leaderless consensus for heterogeneous uncertain nonminimumphase linear multi-agent systems over directed communication graphs. Each agent is assumed tobe of unknown nominal dynamics and also subject to external disturbances and/or unmodeled dynamics. Anovel distributed robust adaptive control strategy is proposed. It is shown that the robust adaptive leaderlessconsensus problem is solved with the proposed control strategy under some sufficient conditions. Two examplesare provided to demonstrate the efficacy of the proposed control strategy.展开更多
Material composition and structural design are important factors influencing the electromagnetic wave(EMW)absorption performance of materials.To alleviate the impedance mismatch attributed to the high dielectric const...Material composition and structural design are important factors influencing the electromagnetic wave(EMW)absorption performance of materials.To alleviate the impedance mismatch attributed to the high dielectric constant of Ti_(3)C_(2)T_(x)MXene,we have successfully synthesized core‐shell structured SiO_(2)@MXene@MoS_(2)nanospheres.This architecture,comprising SiO_(2) as the core,MXene as the intermediate layer,and MoS_(2) as the outer shell,is achieved through an electrostatic self‐assembly method combined with a hydrothermal process.This complex core‐shell structure not only provides a variety of loss mechanisms that effectively dissipate electromagnetic energy but also prevents self‐aggregation of MXene and MoS_(2) nanosheets.Notably,the synergistic combination of SiO_(2) and MoS_(2) with highly conductive MXene enables the suitable dielectric constant of the composites,ensuring optimal impedance matching.Therefore,the core‐shell structured SiO_(2)@MXene@MoS_(2) nanospheres exhibit excellent EMW absorption performance,featuring a remarkable minimum reflection loss(RL_(min))of−52.11 dB(2.4 mm).It is noteworthy that these nanospheres achieve an ultra‐wide effective absorption bandwidth(EAB)of 6.72 GHz.This work provides a novel approach for designing and synthesizing high‐performance EMW absorbers characterized by“wide bandwidth and strong reflection loss.”展开更多
This paper deals with the distributed consensus problem of high-order multi-agent systems with nonlinear dynamics subject to external disturbances. The network topology is assumed to be a fixed undirected graph. Some ...This paper deals with the distributed consensus problem of high-order multi-agent systems with nonlinear dynamics subject to external disturbances. The network topology is assumed to be a fixed undirected graph. Some sufficient conditions are derived, under which the consensus can be achieved with a prescribed norm bound. It is shown that the parameter matrix in the consensus algorithm can be designed by solving two linear matrix inequalities (LMIs). In particular, if the nonzero eigenvalues of the laplacian matrix ac-cording to the network topology are identical, the parameter matrix in the consensus algorithm can be de-signed by solving one LMI. A numerical example is given to illustrate the proposed results.展开更多
In these latter days software agents are used for the development and implementation of intellectual decision support systems. In order to implement intelligence in a system some or several dozen of software agents ar...In these latter days software agents are used for the development and implementation of intellectual decision support systems. In order to implement intelligence in a system some or several dozen of software agents are used and the made system becomes multi-agent. For the development of these systems a set of methodologies, i.e., the sequence of consequent steps of analysis, designing and implementation, is offered. The carried out analysis of the methodologies showed that as a rule they are limited by the spectrum of their pending problem (within the pales of the requirements of specific applied task, within the pales of the possibilities of technical implementation) or within the pales of amount of detail. The variety of methodologies is influenced by the fact that for the development of these systems the requirements and attitudes are offered by the specialists of related spheres such as software, numeral intellect engineers. In the course of the development of hardware and software appeared possibilities to implement mobile multi-agents systems, however, there is no one united mobile multi-agent systems design methodology, whereas existing systems are underdeveloped and their number is small. In this article we introduce the course of the designing of an intellectual real time multi-agent investment management decision support information system adapting and combining some methodologies where the choice to use either communicating or mobile agents is the question of rather technical implementation than methodological. In the article we introduce two ways of system implementation by JADE platform: the first one-using communicating agents, and the second one-using mobile agents.展开更多
Efficient exploration in complex coordination tasks has been considered a challenging problem in multi-agent reinforcement learning(MARL). It is significantly more difficult for those tasks with latent variables that ...Efficient exploration in complex coordination tasks has been considered a challenging problem in multi-agent reinforcement learning(MARL). It is significantly more difficult for those tasks with latent variables that agents cannot directly observe. However, most of the existing latent variable discovery methods lack a clear representation of latent variables and an effective evaluation of the influence of latent variables on the agent. In this paper, we propose a new MARL algorithm based on the soft actor-critic method for complex continuous control tasks with confounders. It is called the multi-agent soft actor-critic with latent variable(MASAC-LV) algorithm, which uses variational inference theory to infer the compact latent variables representation space from a large amount of offline experience.Besides, we derive the counterfactual policy whose input has no latent variables and quantify the difference between the actual policy and the counterfactual policy via a distance function. This quantified difference is considered an intrinsic motivation that gives additional rewards based on how much the latent variable affects each agent. The proposed algorithm is evaluated on two collaboration tasks with confounders, and the experimental results demonstrate the effectiveness of MASAC-LV compared to other baseline algorithms.展开更多
文摘BACKGROUND Pain in the back or pelvis or fear of back pain may affect the timing or cocontraction of the core muscles.In both static and dynamic movements,the Sahrmann core stability test provides an assessment of core muscle activation and a person's ability to stabilize the lumbopelvic complex.Preparatory cues and images can be used to increase the activation of these muscles.To attain optimal movement patterns,it will be necessary to determine what cueing will give the most effective results for core stability.AIM To investigate the effects of external and internal cues on core muscle activation during the Sahrmann five-level core stability test.METHODS Total 68 participants(21.83±3.47 years)were randomly allocated to an external(n=35)or internal cue group(n=33).Participants performed the Sahrmann fivelevel core stability test without a cue as baseline and the five-level stability exercises with an internal or external cue.External cue group received a pressure biofeedback unit(PBU),and the internal cue group received an audio cue.A Delsys Trigno^(TM)surface electromyography unit was used for muscle activation from the rectus abdominis,external oblique,and transverse abdominis/internal oblique muscles.RESULTS Linear mixed effects model analysis showed that cueing had a significant effect on core muscle activation(P=0.001);however,there was no significant difference between cue types(internal or external)(P=0.130).CONCLUSION Both external and internal cueing have significant effects on core muscle activation during the Sahrmann five-level core stability test and the PBU does not create higher muscle activation than internal cueing.
基金The National Natural Science Foundation of China(No.61273110)the Specialized Fund for the Doctoral Program of Higher Education(No.20130092130002)
文摘Aiming for the coordinated motion and cooperative control of multi-agents in a non-rectangular bounded space, a velocity consensus algorithm for the agents with double- integrator dynamics is presented. The traditional consensus algorithm for bounded space is only applicable to rectangular bouncing boundaries, not suitable for non-rectangular space. In order to extend the previous consensus algorithm to the non- rectangular space, the concept of mirrored velocity is introduced, which can convert the discontinuous real velocity to continuous mirrored velocity, and expand a bounded space into an infinite space. Using the consensus algorithm, it is found that the mirrored velocities of multi-agents asymptotically converge to the same values. Because each mirrored velocity points to a unique velocity in real space, it can be concluded that the real velocities of multi-agents also asymptotically converge. Finally, the effectiveness of the proposed consensus algorithm is examined by theoretical proof and numerical simulations. Moreover, an experiment is performed with the algorithm in a real multi-robot system successfully.
文摘This paper discusses the progress of computer integrated processing (CIPS) of coal-preparation and then preserits an intelligence controlled production-process, device-maintenance and production-management system of coal- preparation based on multi-agents (IICMMS-CP). The construction of the IICMMS-CP, the distributed network control system based on live intelligence control stations and the strategy of implementing distributed intelligence control system are studied in order to overcome the disadvantages brought about by the wide use of the PLC system by coaipreparation plants. The software frame, based on a Multi-Agent Intelligence Control and Maintenance Management integrated system, is studied and the implemention methods of IICMMS-CP are discussed. The characteristics of distributed architecture, cooperation and parallel computing meet the needs of integrated control of coal-preparation plants with large-scale spatial production distribution, densely-related processes and complex systems. Its application further improves the reliability and precision of process control, accuracy of fault identification and intelligence of production adjustment, establishes a technical basis for system integration and flexible production. The main function of the system has been tested in a coal-preparation plant to good effect in stabilizing product quality, improving efficiency and reducing consumption.
基金supported in part by NSFC (62102099, U22A2054, 62101594)in part by the Pearl River Talent Recruitment Program (2021QN02S643)+9 种基金Guangzhou Basic Research Program (2023A04J1699)in part by the National Research Foundation, SingaporeInfocomm Media Development Authority under its Future Communications Research Development ProgrammeDSO National Laboratories under the AI Singapore Programme under AISG Award No AISG2-RP-2020-019Energy Research Test-Bed and Industry Partnership Funding Initiative, Energy Grid (EG) 2.0 programmeDesCartes and the Campus for Research Excellence and Technological Enterprise (CREATE) programmeMOE Tier 1 under Grant RG87/22in part by the Singapore University of Technology and Design (SUTD) (SRG-ISTD-2021- 165)in part by the SUTD-ZJU IDEA Grant SUTD-ZJU (VP) 202102in part by the Ministry of Education, Singapore, through its SUTD Kickstarter Initiative (SKI 20210204)。
文摘Avatars, as promising digital representations and service assistants of users in Metaverses, can enable drivers and passengers to immerse themselves in 3D virtual services and spaces of UAV-assisted vehicular Metaverses. However, avatar tasks include a multitude of human-to-avatar and avatar-to-avatar interactive applications, e.g., augmented reality navigation,which consumes intensive computing resources. It is inefficient and impractical for vehicles to process avatar tasks locally. Fortunately, migrating avatar tasks to the nearest roadside units(RSU)or unmanned aerial vehicles(UAV) for execution is a promising solution to decrease computation overhead and reduce task processing latency, while the high mobility of vehicles brings challenges for vehicles to independently perform avatar migration decisions depending on current and future vehicle status. To address these challenges, in this paper, we propose a novel avatar task migration system based on multi-agent deep reinforcement learning(MADRL) to execute immersive vehicular avatar tasks dynamically. Specifically, we first formulate the problem of avatar task migration from vehicles to RSUs/UAVs as a partially observable Markov decision process that can be solved by MADRL algorithms. We then design the multi-agent proximal policy optimization(MAPPO) approach as the MADRL algorithm for the avatar task migration problem. To overcome slow convergence resulting from the curse of dimensionality and non-stationary issues caused by shared parameters in MAPPO, we further propose a transformer-based MAPPO approach via sequential decision-making models for the efficient representation of relationships among agents. Finally, to motivate terrestrial or non-terrestrial edge servers(e.g., RSUs or UAVs) to share computation resources and ensure traceability of the sharing records, we apply smart contracts and blockchain technologies to achieve secure sharing management. Numerical results demonstrate that the proposed approach outperforms the MAPPO approach by around 2% and effectively reduces approximately 20% of the latency of avatar task execution in UAV-assisted vehicular Metaverses.
文摘This paper researches some problems in complex formation for multi-agents,in which two matrices are proposed to record the formation.The pattern matrix is used to describe the pattern of the formation;meanwhile,the location matrix is used to record the location of each agent.Thus,all desired positions of each agent will be obtained by geometrical relationship on the basis of two matrices above.In addition a self-adaptation flocking algorithm is proposed to control all agents to form a desired formation and avoid obstacles.The main idea is as follows:agents will form a desired formation through the method of formation control when far away from obstacles;otherwise,agents will freely fly to pass through the area of obstacles.In the simulation,three scenarios are designed to verify the effectiveness of our method.The results show that our method also can be applied in three dimensions.All agents will form a stable formation and keep the same velocity at last.
基金the National Natural Science Foundation of China(62203356)Fundamental Research Funds for the Central Universities of China(31020210502002)。
文摘This paper studies the problem of time-varying formation control with finite-time prescribed performance for nonstrict feedback second-order multi-agent systems with unmeasured states and unknown nonlinearities.To eliminate nonlinearities,neural networks are applied to approximate the inherent dynamics of the system.In addition,due to the limitations of the actual working conditions,each follower agent can only obtain the locally measurable partial state information of the leader agent.To address this problem,a neural network state observer based on the leader state information is designed.Then,a finite-time prescribed performance adaptive output feedback control strategy is proposed by restricting the sliding mode surface to a prescribed region,which ensures that the closed-loop system has practical finite-time stability and that formation errors of the multi-agent systems converge to the prescribed performance bound in finite time.Finally,a numerical simulation is provided to demonstrate the practicality and effectiveness of the developed algorithm.
基金Research Grants Council of Hong Kong under Grant CityU-11205221.
文摘This article investigates the problem of robust adaptive leaderless consensus for heterogeneous uncertain nonminimumphase linear multi-agent systems over directed communication graphs. Each agent is assumed tobe of unknown nominal dynamics and also subject to external disturbances and/or unmodeled dynamics. Anovel distributed robust adaptive control strategy is proposed. It is shown that the robust adaptive leaderlessconsensus problem is solved with the proposed control strategy under some sufficient conditions. Two examplesare provided to demonstrate the efficacy of the proposed control strategy.
基金Joint Fund of Research and Development Program of Henan Province,Grant/Award Number:222301420002National Natural Science Foundation of China,Grant/Award Number:U21A2064Scientific and Technological Innovation Talents in Colleges and Universities in Henan Province,Grant/Award Number:22HASTIT001。
文摘Material composition and structural design are important factors influencing the electromagnetic wave(EMW)absorption performance of materials.To alleviate the impedance mismatch attributed to the high dielectric constant of Ti_(3)C_(2)T_(x)MXene,we have successfully synthesized core‐shell structured SiO_(2)@MXene@MoS_(2)nanospheres.This architecture,comprising SiO_(2) as the core,MXene as the intermediate layer,and MoS_(2) as the outer shell,is achieved through an electrostatic self‐assembly method combined with a hydrothermal process.This complex core‐shell structure not only provides a variety of loss mechanisms that effectively dissipate electromagnetic energy but also prevents self‐aggregation of MXene and MoS_(2) nanosheets.Notably,the synergistic combination of SiO_(2) and MoS_(2) with highly conductive MXene enables the suitable dielectric constant of the composites,ensuring optimal impedance matching.Therefore,the core‐shell structured SiO_(2)@MXene@MoS_(2) nanospheres exhibit excellent EMW absorption performance,featuring a remarkable minimum reflection loss(RL_(min))of−52.11 dB(2.4 mm).It is noteworthy that these nanospheres achieve an ultra‐wide effective absorption bandwidth(EAB)of 6.72 GHz.This work provides a novel approach for designing and synthesizing high‐performance EMW absorbers characterized by“wide bandwidth and strong reflection loss.”
文摘This paper deals with the distributed consensus problem of high-order multi-agent systems with nonlinear dynamics subject to external disturbances. The network topology is assumed to be a fixed undirected graph. Some sufficient conditions are derived, under which the consensus can be achieved with a prescribed norm bound. It is shown that the parameter matrix in the consensus algorithm can be designed by solving two linear matrix inequalities (LMIs). In particular, if the nonzero eigenvalues of the laplacian matrix ac-cording to the network topology are identical, the parameter matrix in the consensus algorithm can be de-signed by solving one LMI. A numerical example is given to illustrate the proposed results.
文摘In these latter days software agents are used for the development and implementation of intellectual decision support systems. In order to implement intelligence in a system some or several dozen of software agents are used and the made system becomes multi-agent. For the development of these systems a set of methodologies, i.e., the sequence of consequent steps of analysis, designing and implementation, is offered. The carried out analysis of the methodologies showed that as a rule they are limited by the spectrum of their pending problem (within the pales of the requirements of specific applied task, within the pales of the possibilities of technical implementation) or within the pales of amount of detail. The variety of methodologies is influenced by the fact that for the development of these systems the requirements and attitudes are offered by the specialists of related spheres such as software, numeral intellect engineers. In the course of the development of hardware and software appeared possibilities to implement mobile multi-agents systems, however, there is no one united mobile multi-agent systems design methodology, whereas existing systems are underdeveloped and their number is small. In this article we introduce the course of the designing of an intellectual real time multi-agent investment management decision support information system adapting and combining some methodologies where the choice to use either communicating or mobile agents is the question of rather technical implementation than methodological. In the article we introduce two ways of system implementation by JADE platform: the first one-using communicating agents, and the second one-using mobile agents.
基金supported in part by the National Natural Science Foundation of China (62136008,62236002,61921004,62173251,62103104)the “Zhishan” Scholars Programs of Southeast Universitythe Fundamental Research Funds for the Central Universities (2242023K30034)。
文摘Efficient exploration in complex coordination tasks has been considered a challenging problem in multi-agent reinforcement learning(MARL). It is significantly more difficult for those tasks with latent variables that agents cannot directly observe. However, most of the existing latent variable discovery methods lack a clear representation of latent variables and an effective evaluation of the influence of latent variables on the agent. In this paper, we propose a new MARL algorithm based on the soft actor-critic method for complex continuous control tasks with confounders. It is called the multi-agent soft actor-critic with latent variable(MASAC-LV) algorithm, which uses variational inference theory to infer the compact latent variables representation space from a large amount of offline experience.Besides, we derive the counterfactual policy whose input has no latent variables and quantify the difference between the actual policy and the counterfactual policy via a distance function. This quantified difference is considered an intrinsic motivation that gives additional rewards based on how much the latent variable affects each agent. The proposed algorithm is evaluated on two collaboration tasks with confounders, and the experimental results demonstrate the effectiveness of MASAC-LV compared to other baseline algorithms.