A generalized policy-iteration-based solution to a class of discrete-time multi-player nonzero-sum games concerning the control constraints was proposed.Based on initial admissible control policies,the iterative value...A generalized policy-iteration-based solution to a class of discrete-time multi-player nonzero-sum games concerning the control constraints was proposed.Based on initial admissible control policies,the iterative value function of each player converges to the optimum approximately,which is structured by the iterative control policies satisfying the Nash equilibrium.Afterwards,the stability analysis is shown to illustrate that the iterative control policies can stabilize the system and minimize the performance index function of each player.Meanwhile,neural networks are implemented to approximate the iterative control policies and value functions with the impact of control constraints.Finally,two numerical simulations of the discrete-time two-player non-zero-sum games for linear and non-linear systems are shown to illustrate the effectiveness of the proposed scheme.展开更多
In this paper,a data-based feedback relearning algorithm is proposed for the robust control problem of uncertain nonlinear systems.Motivated by the classical on-policy and off-policy algorithms of reinforcement learni...In this paper,a data-based feedback relearning algorithm is proposed for the robust control problem of uncertain nonlinear systems.Motivated by the classical on-policy and off-policy algorithms of reinforcement learning,the online feedback relearning(FR)algorithm is developed where the collected data includes the influence of disturbance signals.The FR algorithm has better adaptability to environmental changes(such as the control channel disturbances)compared with the off-policy algorithm,and has higher computational efficiency and better convergence performance compared with the on-policy algorithm.Data processing based on experience replay technology is used for great data efficiency and convergence stability.Simulation experiments are presented to illustrate convergence stability,optimality and algorithmic performance of FR algorithm by comparison.展开更多
With the incorporation of renewable energy,load frequency control(LFC)becomes more challenging due to uncertain power generation and changeable load demands.The electric vehicle(EV)has been a popular transportation an...With the incorporation of renewable energy,load frequency control(LFC)becomes more challenging due to uncertain power generation and changeable load demands.The electric vehicle(EV)has been a popular transportation and can also provide flexible options to play a role in frequency regulation.In this paper,a novel adaptive composite controller is designed to solve the LFC problem for the interconnected power system with electric vehicles and wind turbine.EVs are used as regulation resources to effectively compensate the power mismatch.First,the sliding mode controller is developed to reduce the random influences caused by the wind turbine generation system.Second,an auxiliary controller with reinforcement learning is proposed to produce adaptive control signals,which will be attached to the primary proportion-integration-differentiation control signal in a realtime manner.Finally,by considering random wind power,load disturbances and output constraints,the proposed scheme is verified on a two-area power system under four different cases.Simulation results demonstrate that the proposed adaptive composite frequency control scheme has a competitive performance with regard to dynamic performance.展开更多
Owing to extensive applications in many fields,the synchronization problem has been widely investigated in multi-agent systems.The synchronization for multi-agent systems is a pivotal issue,which means that under the ...Owing to extensive applications in many fields,the synchronization problem has been widely investigated in multi-agent systems.The synchronization for multi-agent systems is a pivotal issue,which means that under the designed control policy,the output of systems or the state of each agent can be consistent with the leader.The purpose of this paper is to investigate a heuristic dynamic programming(HDP)-based learning tracking control for discrete-time multi-agent systems to achieve synchronization while considering disturbances in systems.Besides,due to the difficulty of solving the coupled Hamilton–Jacobi–Bellman equation analytically,an improved HDP learning control algorithm is proposed to realize the synchronization between the leader and all following agents,which is executed by an action-critic neural network.The action and critic neural network are utilized to learn the optimal control policy and cost function,respectively,by means of introducing an auxiliary action network.Finally,two numerical examples and a practical application of mobile robots are presented to demonstrate the control performance of the HDP-based learning control algorithm.展开更多
Aimed at improving the real-time performance of guidance instruction generation,an analytical hypersonic reentry guidance framework is presented.The key steps of the novel guidance framework are the parameterization o...Aimed at improving the real-time performance of guidance instruction generation,an analytical hypersonic reentry guidance framework is presented.The key steps of the novel guidance framework are the parameterization of reentry guidance problems and the optimization of parameters.First,a quintic polynomial function of energy was designed to describe the altitude profile.Then,according to the altitude-energy profile,the altitude,velocity,flight path angle,and bank angle were obtained analytically,which naturally met the terminal constraints.In addition,the angle of the attack profile was determined using the velocity parameter.The swarm intelligent optimization algorithms were used to optimize the parameters.The path constraints were enforced by the penalty function method.Finally,extensive simulations were carried out in both nominal and dispersed cases,and the simulation results showed that the proposed guidance framework was effective,high-precision,and robust in different scenarios.展开更多
In this paper,the neural network-based adaptive decentralized learning control is investigated for nonlinear interconnected systems with input constraints.Because the decentralized control of interconnected systems is...In this paper,the neural network-based adaptive decentralized learning control is investigated for nonlinear interconnected systems with input constraints.Because the decentralized control of interconnected systems is related to the optimal control of each isolated subsystem,the decentralized control strategy can be established by a series of optimal control policies.A novel policy iteration algorithm is presented to solve the Hamilton–Jacobi–Bellman equation related to the optimal control problem.This algorithm is implemented under the actor-critic structure where both neural networks are simultaneously updated to approximate the optimal control policy and the optimal cost function,respectively.The additional stabilizing term is introduced and an improved weight updating law is derived,which relaxes the requirement of initial admissible control policy.Besides,the input constraints of interconnected systems are taken into account and the Hamilton–Jacobi–Bellman equation is solved in the presence of input constraints.The interconnected system states and the weight approximation errors of two neural networks are proven to be uniformly ultimately bounded by utilizing Lyapunov theory.Finally,the effectiveness of the proposed decentralized learning control method is verified by simulation results.展开更多
With the ever increasing complexity of industrial systems,model-based control has encountered difficulties and is facing problems,while the interest in data-based control has been booming.This paper gives an overview ...With the ever increasing complexity of industrial systems,model-based control has encountered difficulties and is facing problems,while the interest in data-based control has been booming.This paper gives an overview of data-based control,which divides it into two subfields,intelligent modeling and direct controller design.In the two subfields,some important methods concerning data-based control are intensively investigated.Within the framework of data-based modeling,main modeling technologies and control strategies are discussed,and then fundamental concepts and various algorithms are presented for the design of a data-based controller.Finally,some remaining challenges are suggested.展开更多
基金supported by National Natural Science Foundation of China(61125306,61273092,61301035,61304018,and 61411130160)National HighTechnology Research and Development Program of China(2014AA051901)+4 种基金Tianjin Science and Technology Supporting Program(14JCQNJC05400)Research Innovation Program of Tianjin University(2013XQ0101)Hubei Science and Technology Supporting Program(XYJ2014000314)Science Foundation of China Supported by Science and Technology on Aircraft Control Laboratory(20125848004)China Post-doctoral Science Foundation(2014M561559)
基金supported by National Natural Science Foundation of China(61125306,91016004)Foundation of Ministry of Education of China(20110092110020,20120092110026)the Post-Doctoral Research Funds(1108000137,3208004602)
基金National Natural Science Foundation of China,Grant/Award Number:62022061,61773284。
文摘A generalized policy-iteration-based solution to a class of discrete-time multi-player nonzero-sum games concerning the control constraints was proposed.Based on initial admissible control policies,the iterative value function of each player converges to the optimum approximately,which is structured by the iterative control policies satisfying the Nash equilibrium.Afterwards,the stability analysis is shown to illustrate that the iterative control policies can stabilize the system and minimize the performance index function of each player.Meanwhile,neural networks are implemented to approximate the iterative control policies and value functions with the impact of control constraints.Finally,two numerical simulations of the discrete-time two-player non-zero-sum games for linear and non-linear systems are shown to illustrate the effectiveness of the proposed scheme.
基金supported in part by the National Key Research and Development Program of China(2021YFB1714700)the National Natural Science Foundation of China(62022061,6192100028)。
文摘In this paper,a data-based feedback relearning algorithm is proposed for the robust control problem of uncertain nonlinear systems.Motivated by the classical on-policy and off-policy algorithms of reinforcement learning,the online feedback relearning(FR)algorithm is developed where the collected data includes the influence of disturbance signals.The FR algorithm has better adaptability to environmental changes(such as the control channel disturbances)compared with the off-policy algorithm,and has higher computational efficiency and better convergence performance compared with the on-policy algorithm.Data processing based on experience replay technology is used for great data efficiency and convergence stability.Simulation experiments are presented to illustrate convergence stability,optimality and algorithmic performance of FR algorithm by comparison.
基金Science and Technology Project of SGCC(State Grid Corporation of China),Grant/Award Number:5700‐202212197A‐1‐1‐ZN。
文摘With the incorporation of renewable energy,load frequency control(LFC)becomes more challenging due to uncertain power generation and changeable load demands.The electric vehicle(EV)has been a popular transportation and can also provide flexible options to play a role in frequency regulation.In this paper,a novel adaptive composite controller is designed to solve the LFC problem for the interconnected power system with electric vehicles and wind turbine.EVs are used as regulation resources to effectively compensate the power mismatch.First,the sliding mode controller is developed to reduce the random influences caused by the wind turbine generation system.Second,an auxiliary controller with reinforcement learning is proposed to produce adaptive control signals,which will be attached to the primary proportion-integration-differentiation control signal in a realtime manner.Finally,by considering random wind power,load disturbances and output constraints,the proposed scheme is verified on a two-area power system under four different cases.Simulation results demonstrate that the proposed adaptive composite frequency control scheme has a competitive performance with regard to dynamic performance.
基金This work was supported by Tianjin Natural Science Foundation under Grant 20JCYBJC00880Beijing key Laboratory Open Fund of Long-Life Technology of Precise Rotation and Transmission MechanismsGuangdong Provincial Key Laboratory of Intelligent Decision and Cooperative Control.
文摘Owing to extensive applications in many fields,the synchronization problem has been widely investigated in multi-agent systems.The synchronization for multi-agent systems is a pivotal issue,which means that under the designed control policy,the output of systems or the state of each agent can be consistent with the leader.The purpose of this paper is to investigate a heuristic dynamic programming(HDP)-based learning tracking control for discrete-time multi-agent systems to achieve synchronization while considering disturbances in systems.Besides,due to the difficulty of solving the coupled Hamilton–Jacobi–Bellman equation analytically,an improved HDP learning control algorithm is proposed to realize the synchronization between the leader and all following agents,which is executed by an action-critic neural network.The action and critic neural network are utilized to learn the optimal control policy and cost function,respectively,by means of introducing an auxiliary action network.Finally,two numerical examples and a practical application of mobile robots are presented to demonstrate the control performance of the HDP-based learning control algorithm.
基金co-supported by the National Natural Science Foundation of China(No.61773387)Tianjin Natural Science Foundation,China(No.20JCYBJC00880)。
文摘Aimed at improving the real-time performance of guidance instruction generation,an analytical hypersonic reentry guidance framework is presented.The key steps of the novel guidance framework are the parameterization of reentry guidance problems and the optimization of parameters.First,a quintic polynomial function of energy was designed to describe the altitude profile.Then,according to the altitude-energy profile,the altitude,velocity,flight path angle,and bank angle were obtained analytically,which naturally met the terminal constraints.In addition,the angle of the attack profile was determined using the velocity parameter.The swarm intelligent optimization algorithms were used to optimize the parameters.The path constraints were enforced by the penalty function method.Finally,extensive simulations were carried out in both nominal and dispersed cases,and the simulation results showed that the proposed guidance framework was effective,high-precision,and robust in different scenarios.
基金This work was supported by the National Key R&D Program of China(No.2018AAA0101400)the National Natural Science Foundation of China(Nos.62022061,61921004).
文摘In this paper,the neural network-based adaptive decentralized learning control is investigated for nonlinear interconnected systems with input constraints.Because the decentralized control of interconnected systems is related to the optimal control of each isolated subsystem,the decentralized control strategy can be established by a series of optimal control policies.A novel policy iteration algorithm is presented to solve the Hamilton–Jacobi–Bellman equation related to the optimal control problem.This algorithm is implemented under the actor-critic structure where both neural networks are simultaneously updated to approximate the optimal control policy and the optimal cost function,respectively.The additional stabilizing term is introduced and an improved weight updating law is derived,which relaxes the requirement of initial admissible control policy.Besides,the input constraints of interconnected systems are taken into account and the Hamilton–Jacobi–Bellman equation is solved in the presence of input constraints.The interconnected system states and the weight approximation errors of two neural networks are proven to be uniformly ultimately bounded by utilizing Lyapunov theory.Finally,the effectiveness of the proposed decentralized learning control method is verified by simulation results.
基金This work was supported by the National Natural Science Foundation of China(Grant Nos.60874013,60953001 and 61034002).
文摘With the ever increasing complexity of industrial systems,model-based control has encountered difficulties and is facing problems,while the interest in data-based control has been booming.This paper gives an overview of data-based control,which divides it into two subfields,intelligent modeling and direct controller design.In the two subfields,some important methods concerning data-based control are intensively investigated.Within the framework of data-based modeling,main modeling technologies and control strategies are discussed,and then fundamental concepts and various algorithms are presented for the design of a data-based controller.Finally,some remaining challenges are suggested.