In this paper,fixed-time consensus tracking for mul-tiagent systems(MASs)with dynamics in the form of strict feed-back affine nonlinearity is addressed.A fixed-time antidistur-bance consensus tracking protocol is prop...In this paper,fixed-time consensus tracking for mul-tiagent systems(MASs)with dynamics in the form of strict feed-back affine nonlinearity is addressed.A fixed-time antidistur-bance consensus tracking protocol is proposed,which consists of a distributed fixed-time observer,a fixed-time disturbance observer,a nonsmooth antidisturbance backstepping controller,and the fixed-time stability analysis is conducted by using the Lyapunov theory correspondingly.This paper includes three main improvements.First,a distributed fixed-time observer is developed for each follower to obtain an estimate of the leader’s output by utilizing the topology of the communication network.Second,a fixed-time disturbance observer is given to estimate the lumped disturbances for feedforward compensation.Finally,a nonsmooth antidisturbance backstepping tracking controller with feedforward compensation for lumped disturbances is designed.In order to mitigate the“explosion of complexity”in the tradi-tional backstepping approach,we have implemented a modified nonsmooth command filter to enhance the performance of the closed-loop system.The simulation results show that the pro-posed method is effective.展开更多
Dear Editor,This letter develops a novel method to implement event-triggered optimal control(ETOC) for discrete-time nonlinear systems using parallel control and deep reinforcement learning(DRL), referred to as Deep-E...Dear Editor,This letter develops a novel method to implement event-triggered optimal control(ETOC) for discrete-time nonlinear systems using parallel control and deep reinforcement learning(DRL), referred to as Deep-ETOC. The developed Deep-ETOC method introduces the communication cost into the performance index through parallel control, so that the developed method enables control systems to learn ETOC policies directly without triggering conditions.展开更多
In this paper,a new optimal adaptive backstepping control approach for nonlinear systems under deception attacks via reinforcement learning is presented in this paper.The existence of nonlinear terms in the studied sy...In this paper,a new optimal adaptive backstepping control approach for nonlinear systems under deception attacks via reinforcement learning is presented in this paper.The existence of nonlinear terms in the studied system makes it very difficult to design the optimal controller using traditional methods.To achieve optimal control,RL algorithm based on critic–actor architecture is considered for the nonlinear system.Due to the significant security risks of network transmission,the system is vulnerable to deception attacks,which can make all the system state unavailable.By using the attacked states to design coordinate transformation,the harm brought by unknown deception attacks has been overcome.The presented control strategy can ensure that all signals in the closed-loop system are semi-globally ultimately bounded.Finally,the simulation experiment is shown to prove the effectiveness of the strategy.展开更多
This paper studies the problem of optimal parallel tracking control for continuous-time general nonlinear systems.Unlike existing optimal state feedback control,the control input of the optimal parallel control is int...This paper studies the problem of optimal parallel tracking control for continuous-time general nonlinear systems.Unlike existing optimal state feedback control,the control input of the optimal parallel control is introduced into the feedback system.However,due to the introduction of control input into the feedback system,the optimal state feedback control methods can not be applied directly.To address this problem,an augmented system and an augmented performance index function are proposed firstly.Thus,the general nonlinear system is transformed into an affine nonlinear system.The difference between the optimal parallel control and the optimal state feedback control is analyzed theoretically.It is proven that the optimal parallel control with the augmented performance index function can be seen as the suboptimal state feedback control with the traditional performance index function.Moreover,an adaptive dynamic programming(ADP)technique is utilized to implement the optimal parallel tracking control using a critic neural network(NN)to approximate the value function online.The stability analysis of the closed-loop system is performed using the Lyapunov theory,and the tracking error and NN weights errors are uniformly ultimately bounded(UUB).Also,the optimal parallel controller guarantees the continuity of the control input under the circumstance that there are finite jump discontinuities in the reference signals.Finally,the effectiveness of the developed optimal parallel control method is verified in two cases.展开更多
In this paper,a new parallel controller is developed for continuous-time linear systems.The main contribution of the method is to establish a new parallel control law,where both state and control are considered as the...In this paper,a new parallel controller is developed for continuous-time linear systems.The main contribution of the method is to establish a new parallel control law,where both state and control are considered as the input.The structure of the parallel control is provided,and the relationship between the parallel control and traditional feedback controls is presented.Considering the situations that the systems are controllable and incompletely controllable,the properties of the parallel control law are analyzed.The parallel controller design algorithms are given under the conditions that the systems are controllable and incompletely controllable.Finally,numerical simulations are carried out to demonstrate the effectiveness and applicability of the present method.Index Terms-Continuous-time linear systems,digital twin,parallel controller,parallel intelligence,parallel systems.展开更多
The residential energy scheduling of solar energy is an important research area of smart grid. On the demand side, factors such as household loads, storage batteries, the outside public utility grid and renewable ener...The residential energy scheduling of solar energy is an important research area of smart grid. On the demand side, factors such as household loads, storage batteries, the outside public utility grid and renewable energy resources, are combined together as a nonlinear, time-varying, indefinite and complex system, which is difficult to manage or optimize. Many nations have already applied the residential real-time pricing to balance the burden on their grid. In order to enhance electricity efficiency of the residential micro grid, this paper presents an action dependent heuristic dynamic programming(ADHDP) method to solve the residential energy scheduling problem. The highlights of this paper are listed below. First,the weather-type classification is adopted to establish three types of programming models based on the features of the solar energy. In addition, the priorities of different energy resources are set to reduce the loss of electrical energy transmissions.Second, three ADHDP-based neural networks, which can update themselves during applications, are designed to manage the flows of electricity. Third, simulation results show that the proposed scheduling method has effectively reduced the total electricity cost and improved load balancing process. The comparison with the particle swarm optimization algorithm further proves that the present method has a promising effect on energy management to save cost.展开更多
This paper investigates the consensus problem for linear multi-agent systems with the heterogeneous disturbances generated by the Brown motion.Its main contribution is that a control scheme is designed to achieve the ...This paper investigates the consensus problem for linear multi-agent systems with the heterogeneous disturbances generated by the Brown motion.Its main contribution is that a control scheme is designed to achieve the dynamic consensus for the multi-agent systems in directed topology interfered by stochastic noise.In traditional ways,the coupling weights depending on the communication structure are static.A new distributed controller is designed based on Riccati inequalities,while updating the coupling weights associated with the gain matrix by state errors between adjacent agents.By introducing time-varying coupling weights into this novel control law,the state errors between leader and followers asymptotically converge to the minimum value utilizing the local interaction.Through the Lyapunov directed method and It?formula,the stability of the closed-loop system with the proposed control law is analyzed.Two simulation results conducted by the new and traditional schemes are presented to demonstrate the effectiveness and advantage of the developed control method.展开更多
As an interdisciplinary research approach,traditional cognitive science adopts mainly the experiment,induction,modeling,and validation paradigm.Such models are sometimes not applicable in cyber-physical-socialsystems ...As an interdisciplinary research approach,traditional cognitive science adopts mainly the experiment,induction,modeling,and validation paradigm.Such models are sometimes not applicable in cyber-physical-socialsystems (CPSSs),where the large number of human users involves severe heterogeneity and dynamics.To reduce the decision-making conflicts between people and machines in human-centered systems,we propose a new research paradigm called parallel cognition that uses the system of intelligent techniques to investigate cognitive activities and functionals in three stages:descriptive cognition based on artificial cognitive systems (ACSs),predictive cognition with computational deliberation experiments,and prescriptive cognition via parallel behavioral prescription.To make iteration of these stages constantly on-line,a hybrid learning method based on both a psychological model and user behavioral data is further proposed to adaptively learn an individual’s cognitive knowledge.Preliminary experiments on two representative scenarios,urban travel behavioral prescription and cognitive visual reasoning,indicate that our parallel cognition learning is effective and feasible for human behavioral prescription,and can thus facilitate human-machine cooperation in both complex engineering and social systems.展开更多
This paper presents a novel optimal synchronization control method for multi-agent systems with input saturation.The multi-agent game theory is introduced to transform the optimal synchronization control problem into ...This paper presents a novel optimal synchronization control method for multi-agent systems with input saturation.The multi-agent game theory is introduced to transform the optimal synchronization control problem into a multi-agent nonzero-sum game.Then,the Nash equilibrium can be achieved by solving the coupled Hamilton–Jacobi–Bellman(HJB)equations with nonquadratic input energy terms.A novel off-policy reinforcement learning method is presented to obtain the Nash equilibrium solution without the system models,and the critic neural networks(NNs)and actor NNs are introduced to implement the presented method.Theoretical analysis is provided,which shows that the iterative control laws converge to the Nash equilibrium.Simulation results show the good performance of the presented method.展开更多
Multi-agent system gaming(MASG)is widely applied in military intelligence,information networks,unmanned systems,intelligent transportation,and smart grids,exhibiting systematic and organizational characteristics.It re...Multi-agent system gaming(MASG)is widely applied in military intelligence,information networks,unmanned systems,intelligent transportation,and smart grids,exhibiting systematic and organizational characteristics.It requires the multi-agent system perceive and act in a complex dynamic environment and at the same time achieve a balance between individual interests and the maximization of group interests within the system.展开更多
基金supported by the National Defense Basic Scientific Research Project(JCKY2020130C025)the National Science and Technology Major Project(J2019-III-0020-0064,J2019-V-0014-0109)。
文摘In this paper,fixed-time consensus tracking for mul-tiagent systems(MASs)with dynamics in the form of strict feed-back affine nonlinearity is addressed.A fixed-time antidistur-bance consensus tracking protocol is proposed,which consists of a distributed fixed-time observer,a fixed-time disturbance observer,a nonsmooth antidisturbance backstepping controller,and the fixed-time stability analysis is conducted by using the Lyapunov theory correspondingly.This paper includes three main improvements.First,a distributed fixed-time observer is developed for each follower to obtain an estimate of the leader’s output by utilizing the topology of the communication network.Second,a fixed-time disturbance observer is given to estimate the lumped disturbances for feedforward compensation.Finally,a nonsmooth antidisturbance backstepping tracking controller with feedforward compensation for lumped disturbances is designed.In order to mitigate the“explosion of complexity”in the tradi-tional backstepping approach,we have implemented a modified nonsmooth command filter to enhance the performance of the closed-loop system.The simulation results show that the pro-posed method is effective.
基金supported by the Motion G,Inc.Collaborative Research Project for Fundamental Modeling and Parallel Drive-Control of Servo Drive Systems。
文摘Dear Editor,This letter develops a novel method to implement event-triggered optimal control(ETOC) for discrete-time nonlinear systems using parallel control and deep reinforcement learning(DRL), referred to as Deep-ETOC. The developed Deep-ETOC method introduces the communication cost into the performance index through parallel control, so that the developed method enables control systems to learn ETOC policies directly without triggering conditions.
基金supported in part by the National Key R&D Program of China under Grants 2021YFE0206100in part by the National Natural Science Foundation of China under Grant 62073321+2 种基金in part by National Defense Basic Scientific Research Program JCKY2019203C029in part by the Science and Technology Development Fund,Macao SAR under Grants FDCT-22-009-MISE,0060/2021/A2 and 0015/2020/AMJin part by the financial support from the National Defense Basic Scientific Research Project(JCKY2020130C025).
文摘In this paper,a new optimal adaptive backstepping control approach for nonlinear systems under deception attacks via reinforcement learning is presented in this paper.The existence of nonlinear terms in the studied system makes it very difficult to design the optimal controller using traditional methods.To achieve optimal control,RL algorithm based on critic–actor architecture is considered for the nonlinear system.Due to the significant security risks of network transmission,the system is vulnerable to deception attacks,which can make all the system state unavailable.By using the attacked states to design coordinate transformation,the harm brought by unknown deception attacks has been overcome.The presented control strategy can ensure that all signals in the closed-loop system are semi-globally ultimately bounded.Finally,the simulation experiment is shown to prove the effectiveness of the strategy.
基金supported in part by the National Key Reseanch and Development Program of China(2018AAA0101502,2018YFB1702300)in part by the National Natural Science Foundation of China(61722312,61533019,U1811463,61533017)in part by the Intel Collaborative Research Institute for Intelligent and Automated Connected Vehicles。
文摘This paper studies the problem of optimal parallel tracking control for continuous-time general nonlinear systems.Unlike existing optimal state feedback control,the control input of the optimal parallel control is introduced into the feedback system.However,due to the introduction of control input into the feedback system,the optimal state feedback control methods can not be applied directly.To address this problem,an augmented system and an augmented performance index function are proposed firstly.Thus,the general nonlinear system is transformed into an affine nonlinear system.The difference between the optimal parallel control and the optimal state feedback control is analyzed theoretically.It is proven that the optimal parallel control with the augmented performance index function can be seen as the suboptimal state feedback control with the traditional performance index function.Moreover,an adaptive dynamic programming(ADP)technique is utilized to implement the optimal parallel tracking control using a critic neural network(NN)to approximate the value function online.The stability analysis of the closed-loop system is performed using the Lyapunov theory,and the tracking error and NN weights errors are uniformly ultimately bounded(UUB).Also,the optimal parallel controller guarantees the continuity of the control input under the circumstance that there are finite jump discontinuities in the reference signals.Finally,the effectiveness of the developed optimal parallel control method is verified in two cases.
基金supported in part by National Natural Science Foundation of China(61533017,61273140,61304079,61374105,61379099,61233001)Fundamental Research Funds for the Central Universities(FRF-TP-15-056A3)the Open Research Project from SKLMCCS(20150104)
基金supported in part by the National Key Research and Development Program of China(2018AAA0101502,2018YFB1702300)the National Natural Science Foundation of China(61722312,61533019,U1811463,61533017)。
文摘In this paper,a new parallel controller is developed for continuous-time linear systems.The main contribution of the method is to establish a new parallel control law,where both state and control are considered as the input.The structure of the parallel control is provided,and the relationship between the parallel control and traditional feedback controls is presented.Considering the situations that the systems are controllable and incompletely controllable,the properties of the parallel control law are analyzed.The parallel controller design algorithms are given under the conditions that the systems are controllable and incompletely controllable.Finally,numerical simulations are carried out to demonstrate the effectiveness and applicability of the present method.Index Terms-Continuous-time linear systems,digital twin,parallel controller,parallel intelligence,parallel systems.
基金supported in part by the National Natural Science Foundation of China(61533017,U1501251,61374105,61722312)
文摘The residential energy scheduling of solar energy is an important research area of smart grid. On the demand side, factors such as household loads, storage batteries, the outside public utility grid and renewable energy resources, are combined together as a nonlinear, time-varying, indefinite and complex system, which is difficult to manage or optimize. Many nations have already applied the residential real-time pricing to balance the burden on their grid. In order to enhance electricity efficiency of the residential micro grid, this paper presents an action dependent heuristic dynamic programming(ADHDP) method to solve the residential energy scheduling problem. The highlights of this paper are listed below. First,the weather-type classification is adopted to establish three types of programming models based on the features of the solar energy. In addition, the priorities of different energy resources are set to reduce the loss of electrical energy transmissions.Second, three ADHDP-based neural networks, which can update themselves during applications, are designed to manage the flows of electricity. Third, simulation results show that the proposed scheduling method has effectively reduced the total electricity cost and improved load balancing process. The comparison with the particle swarm optimization algorithm further proves that the present method has a promising effect on energy management to save cost.
基金supported in part by the National Natural Science Foundation of China(61722312,61533017,62073321)the National Key Research and Development Program of China(2018YFB1702300)。
文摘This paper investigates the consensus problem for linear multi-agent systems with the heterogeneous disturbances generated by the Brown motion.Its main contribution is that a control scheme is designed to achieve the dynamic consensus for the multi-agent systems in directed topology interfered by stochastic noise.In traditional ways,the coupling weights depending on the communication structure are static.A new distributed controller is designed based on Riccati inequalities,while updating the coupling weights associated with the gain matrix by state errors between adjacent agents.By introducing time-varying coupling weights into this novel control law,the state errors between leader and followers asymptotically converge to the minimum value utilizing the local interaction.Through the Lyapunov directed method and It?formula,the stability of the closed-loop system with the proposed control law is analyzed.Two simulation results conducted by the new and traditional schemes are presented to demonstrate the effectiveness and advantage of the developed control method.
基金Project supported by the National Natural Science Foundation of China (Nos.62076237,62073321,and U1811463)the Youth Innovation Promotion Association,Chinese Academy of Sciences (No.2021130)。
文摘As an interdisciplinary research approach,traditional cognitive science adopts mainly the experiment,induction,modeling,and validation paradigm.Such models are sometimes not applicable in cyber-physical-socialsystems (CPSSs),where the large number of human users involves severe heterogeneity and dynamics.To reduce the decision-making conflicts between people and machines in human-centered systems,we propose a new research paradigm called parallel cognition that uses the system of intelligent techniques to investigate cognitive activities and functionals in three stages:descriptive cognition based on artificial cognitive systems (ACSs),predictive cognition with computational deliberation experiments,and prescriptive cognition via parallel behavioral prescription.To make iteration of these stages constantly on-line,a hybrid learning method based on both a psychological model and user behavioral data is further proposed to adaptively learn an individual’s cognitive knowledge.Preliminary experiments on two representative scenarios,urban travel behavioral prescription and cognitive visual reasoning,indicate that our parallel cognition learning is effective and feasible for human behavioral prescription,and can thus facilitate human-machine cooperation in both complex engineering and social systems.
基金Project supported by the National Key R&D Program of China(No.2018YFB1702300)the National Natural Science Foundation of China(Nos.61722312 and 61533017)。
文摘This paper presents a novel optimal synchronization control method for multi-agent systems with input saturation.The multi-agent game theory is introduced to transform the optimal synchronization control problem into a multi-agent nonzero-sum game.Then,the Nash equilibrium can be achieved by solving the coupled Hamilton–Jacobi–Bellman(HJB)equations with nonquadratic input energy terms.A novel off-policy reinforcement learning method is presented to obtain the Nash equilibrium solution without the system models,and the critic neural networks(NNs)and actor NNs are introduced to implement the presented method.Theoretical analysis is provided,which shows that the iterative control laws converge to the Nash equilibrium.Simulation results show the good performance of the presented method.
文摘Multi-agent system gaming(MASG)is widely applied in military intelligence,information networks,unmanned systems,intelligent transportation,and smart grids,exhibiting systematic and organizational characteristics.It requires the multi-agent system perceive and act in a complex dynamic environment and at the same time achieve a balance between individual interests and the maximization of group interests within the system.