A control algorithm for improving vehicle handling was proposed by applying right angle to the steering wheel,based on the nonlinear adaptive optimal control(NAOC).A nonlinear 4-DOF model was initially developed,then ...A control algorithm for improving vehicle handling was proposed by applying right angle to the steering wheel,based on the nonlinear adaptive optimal control(NAOC).A nonlinear 4-DOF model was initially developed,then it was simplified to a 2-DOF model with reasonable assumptions to design observer and optimal controllers.Then a simplified model was developed for steering system.The numerical simulations were carried out using vehicle parameters for standard maneuvers in dry and wet road conditions.Moreover,the hardware in the loop method was implemented to prove the controller ability in realistic conditions.Simulation results obviously show the effectiveness of NAOC on vehicle handling and reveal that the proposed controller can significantly improve vehicle handling during severe maneuvers.展开更多
This paper investigates the issue of adaptive optimal tracking control for nonlinear systems with dynamic state constraints.An asymmetric time-varying integral barrier Lyapunov function(ATIBLF)based integral reinforce...This paper investigates the issue of adaptive optimal tracking control for nonlinear systems with dynamic state constraints.An asymmetric time-varying integral barrier Lyapunov function(ATIBLF)based integral reinforcement learning(IRL)control algorithm with an actor–critic structure is first proposed.The ATIBLF items are appropriately arranged in every step of the optimized backstepping control design to ensure that the dynamic full-state constraints are never violated.Thus,optimal virtual/actual control in every backstepping subsystem is decomposed with ATIBLF items and also with an adaptive optimized item.Meanwhile,neural networks are used to approximate the gradient value functions.According to the Lyapunov stability theorem,the boundedness of all signals of the closed-loop system is proved,and the proposed control scheme ensures that the system states are within predefined compact sets.Finally,the effectiveness of the proposed control approach is validated by simulations.展开更多
This article studies the adaptive optimal output regulation problem for a class of interconnected singularly perturbed systems(SPSs) with unknown dynamics based on reinforcement learning(RL).Taking into account the sl...This article studies the adaptive optimal output regulation problem for a class of interconnected singularly perturbed systems(SPSs) with unknown dynamics based on reinforcement learning(RL).Taking into account the slow and fast characteristics among system states,the interconnected SPS is decomposed into the slow time-scale dynamics and the fast timescale dynamics through singular perturbation theory.For the fast time-scale dynamics with interconnections,we devise a decentralized optimal control strategy by selecting appropriate weight matrices in the cost function.For the slow time-scale dynamics with unknown system parameters,an off-policy RL algorithm with convergence guarantee is given to learn the optimal control strategy in terms of measurement data.By combining the slow and fast controllers,we establish the composite decentralized adaptive optimal output regulator,and rigorously analyze the stability and optimality of the closed-loop system.The proposed decomposition design not only bypasses the numerical stiffness but also alleviates the high-dimensionality.The efficacy of the proposed methodology is validated by a load-frequency control application of a two-area power system.展开更多
The combination of structural health monitoring and vibration control is of great importance to provide components of smart structures.While synthetic algorithms have been proposed,adaptive control that is compatible ...The combination of structural health monitoring and vibration control is of great importance to provide components of smart structures.While synthetic algorithms have been proposed,adaptive control that is compatible with changing conditions still needs to be used,and time-varying systems are required to be simultaneously estimated with the application of adaptive control.In this research,the identification of structural time-varying dynamic characteristics and optimized simple adaptive control are integrated.First,reduced variations of physical parameters are estimated online using the multiple forgetting factor recursive least squares(MFRLS)method.Then,the energy from the structural vibration is simultaneously specified to optimize the control force with the identified parameters to be operational.Optimization is also performed based on the probability density function of the energy under the seismic excitation at any time.Finally,the optimal control force is obtained by the simple adaptive control(SAC)algorithm and energy coefficient.A numerical example and benchmark structure are employed to investigate the efficiency of the proposed approach.The simulation results revealed the effectiveness of the integrated online identification and optimal adaptive control in systems.展开更多
This paper attempts to study a optimal adaptive con tr ol problem using game theory, and proposes an important practical result that an adaptive processes is a set of sufficient conditions under which pure strategy is...This paper attempts to study a optimal adaptive con tr ol problem using game theory, and proposes an important practical result that an adaptive processes is a set of sufficient conditions under which pure strategy is essentially complete, and thus the fact that yield a very useful desirable pu re optimal control rule.展开更多
We present a novel indirect adaptive fuzzy-regulated optimal control scheme for continuous-time nonlinear systems with unknown dynamics,mismatches,and disturbances.Initially,the Hamilton-Jacobi-Bellman(HJB)equation as...We present a novel indirect adaptive fuzzy-regulated optimal control scheme for continuous-time nonlinear systems with unknown dynamics,mismatches,and disturbances.Initially,the Hamilton-Jacobi-Bellman(HJB)equation associated with its performance function is derived for the original nonlinear systems.Unlike existing adaptive dynamic programming(ADP)approaches,this scheme uses a special non-quadratic variable performance function as the reinforcement medium in the actor-critic architecture.An adaptive fuzzy-regulated critic structure is correspondingly constructed to configure the weighting matrix of the performance function for the purpose of approximating and balancing the HJB equation.A concurrent self-organizing learning technique is designed to adaptively update the critic weights.Based on this particular critic,an adaptive optimal feedback controller is developed as the actor with a new form of augmented Riccati equation to optimize the fuzzy-regulated variable performance function in real time.The result is an online indirect adaptive optimal control mechanism implemented as an actor-critic structure,which involves continuous-time adaptation of both the optimal cost and the optimal control policy.The convergence and closed-loop stability of the proposed system are proved and guaranteed.Simulation examples and comparisons show the effectiveness and advantages of the proposed method.展开更多
The least-squares (LS) algorithm has been used for system modeling for a long time. Without any excitation conditions, only the convergence rate of the common LS algorithm can be obtained. This paper analyzed the we...The least-squares (LS) algorithm has been used for system modeling for a long time. Without any excitation conditions, only the convergence rate of the common LS algorithm can be obtained. This paper analyzed the weighted least-squares (WLS) algorithm and described the good properties of the WLS algorithm. The WLS algorithm was then used for adaptive control of linear stochastic systems to show that the linear closed-loop system was globally stable and that the system identification was consistent. Compared to the past optimal adaptive controller, this controller does not impose restricted conditions on the coefficients of the system, such as knowing the first coefficient before the controller. Without any persistent excitation conditions, the analysis shows that, with the regulation of the adaptive control, the closed-loop system was globally stable and the adaptive controller converged to the one-step-ahead optimal controller in some sense.展开更多
An adaptive inverse optimal attitude controller for flexible spacecraft with fault-free actuator is designed based on adaptive control Lyapunov function and inverse optimal methodology subjected to unknown parameter u...An adaptive inverse optimal attitude controller for flexible spacecraft with fault-free actuator is designed based on adaptive control Lyapunov function and inverse optimal methodology subjected to unknown parameter uncertainties,external disturbances and input saturation.The partial loss of actuator effectiveness and the additive faults are considered simultaneously to deal with actuator faults,and the prior knowledge of bounds on the effectiveness factors of the actuators is assumed to be unknown.A fault-tolerant control version is designed to handle the system with actuator fault by introducing a parameter update law to estimate the lower bound of the partial loss of actuator effectiveness faults.The proposed fault-tolerant attitude controller ensures robustness and stabilization,and it achieves H_∞ optimality with respect to a family of cost functionals.The usefulness of the proposed algorithms is assessed and compared with the conventional approaches through numerical simulations.展开更多
文摘A control algorithm for improving vehicle handling was proposed by applying right angle to the steering wheel,based on the nonlinear adaptive optimal control(NAOC).A nonlinear 4-DOF model was initially developed,then it was simplified to a 2-DOF model with reasonable assumptions to design observer and optimal controllers.Then a simplified model was developed for steering system.The numerical simulations were carried out using vehicle parameters for standard maneuvers in dry and wet road conditions.Moreover,the hardware in the loop method was implemented to prove the controller ability in realistic conditions.Simulation results obviously show the effectiveness of NAOC on vehicle handling and reveal that the proposed controller can significantly improve vehicle handling during severe maneuvers.
基金Project supported by the National Natural Science Foundation of China(Nos.62203392 and 62373329)the Natural Science Foundation of Zhejiang Province,China(No.LY23F030009)the Baima Lake Laboratory Joint Funds of the Zhejiang Provincial Natural Science Foundation of China(No.LBMHD24F030002)。
文摘This paper investigates the issue of adaptive optimal tracking control for nonlinear systems with dynamic state constraints.An asymmetric time-varying integral barrier Lyapunov function(ATIBLF)based integral reinforcement learning(IRL)control algorithm with an actor–critic structure is first proposed.The ATIBLF items are appropriately arranged in every step of the optimized backstepping control design to ensure that the dynamic full-state constraints are never violated.Thus,optimal virtual/actual control in every backstepping subsystem is decomposed with ATIBLF items and also with an adaptive optimized item.Meanwhile,neural networks are used to approximate the gradient value functions.According to the Lyapunov stability theorem,the boundedness of all signals of the closed-loop system is proved,and the proposed control scheme ensures that the system states are within predefined compact sets.Finally,the effectiveness of the proposed control approach is validated by simulations.
基金supported by the National Natural Science Foundation of China (62073327,62273350)the Natural Science Foundation of Jiangsu Province (BK20221112)。
文摘This article studies the adaptive optimal output regulation problem for a class of interconnected singularly perturbed systems(SPSs) with unknown dynamics based on reinforcement learning(RL).Taking into account the slow and fast characteristics among system states,the interconnected SPS is decomposed into the slow time-scale dynamics and the fast timescale dynamics through singular perturbation theory.For the fast time-scale dynamics with interconnections,we devise a decentralized optimal control strategy by selecting appropriate weight matrices in the cost function.For the slow time-scale dynamics with unknown system parameters,an off-policy RL algorithm with convergence guarantee is given to learn the optimal control strategy in terms of measurement data.By combining the slow and fast controllers,we establish the composite decentralized adaptive optimal output regulator,and rigorously analyze the stability and optimality of the closed-loop system.The proposed decomposition design not only bypasses the numerical stiffness but also alleviates the high-dimensionality.The efficacy of the proposed methodology is validated by a load-frequency control application of a two-area power system.
文摘The combination of structural health monitoring and vibration control is of great importance to provide components of smart structures.While synthetic algorithms have been proposed,adaptive control that is compatible with changing conditions still needs to be used,and time-varying systems are required to be simultaneously estimated with the application of adaptive control.In this research,the identification of structural time-varying dynamic characteristics and optimized simple adaptive control are integrated.First,reduced variations of physical parameters are estimated online using the multiple forgetting factor recursive least squares(MFRLS)method.Then,the energy from the structural vibration is simultaneously specified to optimize the control force with the identified parameters to be operational.Optimization is also performed based on the probability density function of the energy under the seismic excitation at any time.Finally,the optimal control force is obtained by the simple adaptive control(SAC)algorithm and energy coefficient.A numerical example and benchmark structure are employed to investigate the efficiency of the proposed approach.The simulation results revealed the effectiveness of the integrated online identification and optimal adaptive control in systems.
文摘This paper attempts to study a optimal adaptive con tr ol problem using game theory, and proposes an important practical result that an adaptive processes is a set of sufficient conditions under which pure strategy is essentially complete, and thus the fact that yield a very useful desirable pu re optimal control rule.
基金Project supported by the National Natural Science Foundation of China(Nos.51805531 and 51675470)the Natural Science Foundation of Jiangsu Province,China(No.BK20150200)+1 种基金the Key R&D Program of Zhejiang Province,China(No.2020C01026)the China Postdoctoral Science Foundation(No.2020M671706)。
文摘We present a novel indirect adaptive fuzzy-regulated optimal control scheme for continuous-time nonlinear systems with unknown dynamics,mismatches,and disturbances.Initially,the Hamilton-Jacobi-Bellman(HJB)equation associated with its performance function is derived for the original nonlinear systems.Unlike existing adaptive dynamic programming(ADP)approaches,this scheme uses a special non-quadratic variable performance function as the reinforcement medium in the actor-critic architecture.An adaptive fuzzy-regulated critic structure is correspondingly constructed to configure the weighting matrix of the performance function for the purpose of approximating and balancing the HJB equation.A concurrent self-organizing learning technique is designed to adaptively update the critic weights.Based on this particular critic,an adaptive optimal feedback controller is developed as the actor with a new form of augmented Riccati equation to optimize the fuzzy-regulated variable performance function in real time.The result is an online indirect adaptive optimal control mechanism implemented as an actor-critic structure,which involves continuous-time adaptation of both the optimal cost and the optimal control policy.The convergence and closed-loop stability of the proposed system are proved and guaranteed.Simulation examples and comparisons show the effectiveness and advantages of the proposed method.
基金the National Natural Science Foundation of China(No.60474026)the Asia Research Center at Tsinghua University
文摘The least-squares (LS) algorithm has been used for system modeling for a long time. Without any excitation conditions, only the convergence rate of the common LS algorithm can be obtained. This paper analyzed the weighted least-squares (WLS) algorithm and described the good properties of the WLS algorithm. The WLS algorithm was then used for adaptive control of linear stochastic systems to show that the linear closed-loop system was globally stable and that the system identification was consistent. Compared to the past optimal adaptive controller, this controller does not impose restricted conditions on the coefficients of the system, such as knowing the first coefficient before the controller. Without any persistent excitation conditions, the analysis shows that, with the regulation of the adaptive control, the closed-loop system was globally stable and the adaptive controller converged to the one-step-ahead optimal controller in some sense.
基金the National High Technology Research and Development Program(863)of China(No.2012AA121602)the Preliminary Research Program of the General Armament Department of China(No.51322050202)
文摘An adaptive inverse optimal attitude controller for flexible spacecraft with fault-free actuator is designed based on adaptive control Lyapunov function and inverse optimal methodology subjected to unknown parameter uncertainties,external disturbances and input saturation.The partial loss of actuator effectiveness and the additive faults are considered simultaneously to deal with actuator faults,and the prior knowledge of bounds on the effectiveness factors of the actuators is assumed to be unknown.A fault-tolerant control version is designed to handle the system with actuator fault by introducing a parameter update law to estimate the lower bound of the partial loss of actuator effectiveness faults.The proposed fault-tolerant attitude controller ensures robustness and stabilization,and it achieves H_∞ optimality with respect to a family of cost functionals.The usefulness of the proposed algorithms is assessed and compared with the conventional approaches through numerical simulations.