This paper considers the optimal trajectory tracking control problem for near-surface autonomous underwater vehicles(AUVs) in the presence of wave disturbances. An approximate optimal tracking control(AOTC) approach i...This paper considers the optimal trajectory tracking control problem for near-surface autonomous underwater vehicles(AUVs) in the presence of wave disturbances. An approximate optimal tracking control(AOTC) approach is proposed. Firstly, a six-degrees-of-freedom(six-DOF) AUV model with its body-fixed coordinate system is decoupled and simplified and then a nonlinear control model of AUVs in the vertical plane is given. Also, an exosystem model of wave disturbances is constructed based on Hirom approximation formula. Secondly, the time-parameterized desired trajectory which is tracked by the AUV's system is represented by the exosystem. Then, the coupled two-point boundary value(TPBV) problem of optimal tracking control for AUVs is derived from the theory of quadratic optimal control. By using a recently developed successive approximation approach to construct sequences, the coupled TPBV problem is transformed into a problem of solving two decoupled linear differential sequences of state vectors and adjoint vectors. By iteratively solving the two equation sequences, the AOTC law is obtained, which consists of a nonlinear optimal feedback item, an expected output tracking item, a feedforward disturbances rejection item, and a nonlinear compensatory term. Furthermore, a wave disturbances observer model is designed in order to solve the physically realizable problem. Simulation is carried out by using the Remote Environmental Unit(REMUS) AUV model to demonstrate the effectiveness of the proposed algorithm.展开更多
In order to improve the shift quality, a linear quadratic optimal tracking control algorithm for automatic transmission shift process is proposed. The dynamic equations of the shift process are derived using a Lagrang...In order to improve the shift quality, a linear quadratic optimal tracking control algorithm for automatic transmission shift process is proposed. The dynamic equations of the shift process are derived using a Lagrange method. And a powertrain model is built in the Matlab/Simulink and veri- fied by the measurements. Considering the shift jerk and friction loss during the shift process, the tracking trajectories of the turbine speed and output shaft speed are defined. Furthermore, the linear quadratic optimal tracking control performance index is proposed. Based on the Pontryagin' s mini- mum principle, the optimal control law of the shift process is presented. Finally, the simulation study of the 1 - 2 upshift process under different load conditions is carried out with the powertrain model. The simulation results demonstrate that the shift jerk and friction loss can be significantly re- duced by applying the proposed optimal tracking control method.展开更多
An optimal tracking control (OTC) problem for linear time-delay large-scale systems affected by external persistent disturbances is investigated. Based on the internal model principle, a disturbance compensator is c...An optimal tracking control (OTC) problem for linear time-delay large-scale systems affected by external persistent disturbances is investigated. Based on the internal model principle, a disturbance compensator is constructed. The system with persistent disturbances is transformed into an augmented system without persistent disturbances. The original OTC problem of linear time-delay system is transformed into a sequence of linear two- point boundary value (TPBV) problems by introducing a sensitivity parameter and expanding Maclaurin series around it. By solving an OTC law of the augmented system, the OTC law of the original system is obtained. A numerical simulation is provided to illustrate the effectiveness of the proposed method.展开更多
This paper studies the optimal control with zero steady-state error problem for nonlinear large-scale systems affected by external persistent disturbances.The nonlinear large-scale system is transformed into N nonline...This paper studies the optimal control with zero steady-state error problem for nonlinear large-scale systems affected by external persistent disturbances.The nonlinear large-scale system is transformed into N nonlinear subsystems with interconnect terms.Based on the internal model principle,a disturbance compensator is constructed such that the ith subsystem with external persistent disturbances is transformed into an augmented subsystem without disturbances.According to the sensitivity approach,the optimal tracking control law for the ith nonlinear subsystem can be obtained.The optimal tracking control law for the nonlinear large-scale systems can be obtained.A numerical simulation shows that the method is effective.展开更多
This paper estimates an off-policy integral reinforcement learning(IRL) algorithm to obtain the optimal tracking control of unknown chaotic systems. Off-policy IRL can learn the solution of the HJB equation from the...This paper estimates an off-policy integral reinforcement learning(IRL) algorithm to obtain the optimal tracking control of unknown chaotic systems. Off-policy IRL can learn the solution of the HJB equation from the system data generated by an arbitrary control. Moreover, off-policy IRL can be regarded as a direct learning method, which avoids the identification of system dynamics. In this paper, the performance index function is first given based on the system tracking error and control error. For solving the Hamilton–Jacobi–Bellman(HJB) equation, an off-policy IRL algorithm is proposed.It is proven that the iterative control makes the tracking error system asymptotically stable, and the iterative performance index function is convergent. Simulation study demonstrates the effectiveness of the developed tracking control method.展开更多
In this paper, an optimal tracking control scheme is proposed for a class of discrete-time chaotic systems using the approximation-error-based adaptive dynamic programming (ADP) algorithm. Via the system transformat...In this paper, an optimal tracking control scheme is proposed for a class of discrete-time chaotic systems using the approximation-error-based adaptive dynamic programming (ADP) algorithm. Via the system transformation, the optimal tracking problem is transformed into an optimal regulation problem, and then the novel optimal tracking control method is proposed. It is shown that for the iterative ADP algorithm with finite approximation error, the iterative performance index functions can converge to a finite neighborhood of the greatest lower bound of all performance index functions under some convergence conditions. Two examples are given to demonstrate the validity of the proposed optimal tracking control scheme for chaotic systems.展开更多
We propose an adaptive learning-based optimal control scheme for height-velocity control models considering model uncertainties and external disturbances of hypersonic winged-cone vehicles.The longitudinal nonlinear m...We propose an adaptive learning-based optimal control scheme for height-velocity control models considering model uncertainties and external disturbances of hypersonic winged-cone vehicles.The longitudinal nonlinear model is first established and transformed into the control-oriented error equations,and the control scheme is organized by a steady-compensation combination.To overcome and eliminate the impact of model uncertainties and external disturbances,an adaptive radial basis function neural network(RBFNN)is designed by a q-gradient approach.Taking the height-velocity error system with estimated uncertainties into account,the adaptive learning-based optimal tracking control(ALOTC)scheme is proposed by combining the critic-only adaptive dynamic programming(ADP)framework and parameter optimization of system settling time.Furthermore,a novel weight update law is proposed to satisfy the online iteration requirements,and the algorithm convergence and closedloop stability are discussed by the Lyapunov theory.Finally,four simulation cases are provided to prove the effectiveness,accuracy,and robustness of the proposed scheme for the hypersonic longitudinal control system.展开更多
In this paper,a stochastic linear quadratic optimal tracking scheme is proposed for unknown linear discrete-time(DT)systems based on adaptive dynamic programming(ADP)algorithm.First,an augmented system composed of the...In this paper,a stochastic linear quadratic optimal tracking scheme is proposed for unknown linear discrete-time(DT)systems based on adaptive dynamic programming(ADP)algorithm.First,an augmented system composed of the original system and the command generator is constructed and then an augmented stochastic algebraic equation is derived based on the augmented system.Next,to obtain the optimal control strategy,the stochastic case is converted into the deterministic one by system transformation,and then an ADP algorithm is proposed with convergence analysis.For the purpose of realizing the ADP algorithm,three back propagation neural networks including model network,critic network and action network are devised to guarantee unknown system model,optimal value function and optimal control strategy,respectively.Finally,the obtained optimal control strategy is applied to the original stochastic system,and two simulations are provided to demonstrate the effectiveness of the proposed algorithm.展开更多
An optimal tracking control problem for a class of nonlinear systems with guaranteed performance and asymmetric input constraints is discussed in this paper.The control policy is implemented by adaptive dynamic progra...An optimal tracking control problem for a class of nonlinear systems with guaranteed performance and asymmetric input constraints is discussed in this paper.The control policy is implemented by adaptive dynamic programming(ADP)algorithm under two event-based triggering mechanisms.It is often challenging to design an optimal control law due to the system deviation caused by asymmetric input constraints.First,a prescribed performance control technique is employed to guarantee the tracking errors within predetermined boundaries.Subsequently,considering the asymmetric input constraints,a discounted non-quadratic cost function is introduced.Moreover,in order to reduce controller updates,an event-triggered control law is developed for ADP algorithm.After that,to further simplify the complexity of controller design,this work is extended to a self-triggered case for relaxing the need for continuous signal monitoring by hardware devices.By employing the Lyapunov method,the uniform ultimate boundedness of all signals is proved to be guaranteed.Finally,a simulation example on a mass–spring–damper system subject to asymmetric input constraints is provided to validate the effectiveness of the proposed control scheme.展开更多
Based on generalized the variation method, by introducing Hamilton function and Lagrange multiplier, this paper proposed a linear quadratic optimal control strategy for an incomplete controllable system with fixed ter...Based on generalized the variation method, by introducing Hamilton function and Lagrange multiplier, this paper proposed a linear quadratic optimal control strategy for an incomplete controllable system with fixed terminal state and time. Applying the proposed optimal control to the simple two-input dual-stage actuator magnetic head positioning system with three degrees-of-freedom, the simulation results show that the system has no residual vibration at the terminal position and time, which can reduce the total access time during head positioning process. To verify the validation of the optimal control strategy of three degrees-of-freedom spring-mass models in actual magnetic head positioning of hard disk drives, a finite element model of an actual magnetic head positioning system is presented. Substituting the optimal control force from simple three degrees-of-freedom spring-mass models into the finite element model, the simulation results show that the magnetic head also has no residual vibration at the end of track-to-track travel. That is to say, the linear quadratic optimal control technique based on simple two-input dual- stage actuator system with three degrees-of-freedom proposed in this paper is of high reliability for the industrial application of an actual magnetic head positioning system.展开更多
基金supported in part by the National Natural Science Foundation of China (41276085)the Natural Science Foundation of Shandong Province (ZR2015FM004)
文摘This paper considers the optimal trajectory tracking control problem for near-surface autonomous underwater vehicles(AUVs) in the presence of wave disturbances. An approximate optimal tracking control(AOTC) approach is proposed. Firstly, a six-degrees-of-freedom(six-DOF) AUV model with its body-fixed coordinate system is decoupled and simplified and then a nonlinear control model of AUVs in the vertical plane is given. Also, an exosystem model of wave disturbances is constructed based on Hirom approximation formula. Secondly, the time-parameterized desired trajectory which is tracked by the AUV's system is represented by the exosystem. Then, the coupled two-point boundary value(TPBV) problem of optimal tracking control for AUVs is derived from the theory of quadratic optimal control. By using a recently developed successive approximation approach to construct sequences, the coupled TPBV problem is transformed into a problem of solving two decoupled linear differential sequences of state vectors and adjoint vectors. By iteratively solving the two equation sequences, the AOTC law is obtained, which consists of a nonlinear optimal feedback item, an expected output tracking item, a feedforward disturbances rejection item, and a nonlinear compensatory term. Furthermore, a wave disturbances observer model is designed in order to solve the physically realizable problem. Simulation is carried out by using the Remote Environmental Unit(REMUS) AUV model to demonstrate the effectiveness of the proposed algorithm.
基金Supported by the National Natural Science Foundation of China(51475043)
文摘In order to improve the shift quality, a linear quadratic optimal tracking control algorithm for automatic transmission shift process is proposed. The dynamic equations of the shift process are derived using a Lagrange method. And a powertrain model is built in the Matlab/Simulink and veri- fied by the measurements. Considering the shift jerk and friction loss during the shift process, the tracking trajectories of the turbine speed and output shaft speed are defined. Furthermore, the linear quadratic optimal tracking control performance index is proposed. Based on the Pontryagin' s mini- mum principle, the optimal control law of the shift process is presented. Finally, the simulation study of the 1 - 2 upshift process under different load conditions is carried out with the powertrain model. The simulation results demonstrate that the shift jerk and friction loss can be significantly re- duced by applying the proposed optimal tracking control method.
基金supported by the National Natural Science Foundation of China(60574023)the Natural Science Foundation of Shandong Province(Z2005G01).
文摘An optimal tracking control (OTC) problem for linear time-delay large-scale systems affected by external persistent disturbances is investigated. Based on the internal model principle, a disturbance compensator is constructed. The system with persistent disturbances is transformed into an augmented system without persistent disturbances. The original OTC problem of linear time-delay system is transformed into a sequence of linear two- point boundary value (TPBV) problems by introducing a sensitivity parameter and expanding Maclaurin series around it. By solving an OTC law of the augmented system, the OTC law of the original system is obtained. A numerical simulation is provided to illustrate the effectiveness of the proposed method.
基金supported by the National Natural Science Foundation of China(No.60574023)the Natural Science Foundation of Shandong Province(No.Z2005G01)
文摘This paper studies the optimal control with zero steady-state error problem for nonlinear large-scale systems affected by external persistent disturbances.The nonlinear large-scale system is transformed into N nonlinear subsystems with interconnect terms.Based on the internal model principle,a disturbance compensator is constructed such that the ith subsystem with external persistent disturbances is transformed into an augmented subsystem without disturbances.According to the sensitivity approach,the optimal tracking control law for the ith nonlinear subsystem can be obtained.The optimal tracking control law for the nonlinear large-scale systems can be obtained.A numerical simulation shows that the method is effective.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.61304079 and 61374105)the Beijing Natural Science Foundation,China(Grant Nos.4132078 and 4143065)+2 种基金the China Postdoctoral Science Foundation(Grant No.2013M530527)the Fundamental Research Funds for the Central Universities,China(Grant No.FRF-TP-14-119A2)the Open Research Project from State Key Laboratory of Management and Control for Complex Systems,China(Grant No.20150104)
文摘This paper estimates an off-policy integral reinforcement learning(IRL) algorithm to obtain the optimal tracking control of unknown chaotic systems. Off-policy IRL can learn the solution of the HJB equation from the system data generated by an arbitrary control. Moreover, off-policy IRL can be regarded as a direct learning method, which avoids the identification of system dynamics. In this paper, the performance index function is first given based on the system tracking error and control error. For solving the Hamilton–Jacobi–Bellman(HJB) equation, an off-policy IRL algorithm is proposed.It is proven that the iterative control makes the tracking error system asymptotically stable, and the iterative performance index function is convergent. Simulation study demonstrates the effectiveness of the developed tracking control method.
基金supported by the Open Research Project from SKLMCCS (Grant No. 20120106)the Fundamental Research Funds for the Central Universities of China (Grant No. FRF-TP-13-018A)+1 种基金the Postdoctoral Science Foundation of China (Grant No. 2013M530527)the National Natural Science Foundation of China (Grant Nos. 61304079, 61125306, and 61034002)
文摘In this paper, an optimal tracking control scheme is proposed for a class of discrete-time chaotic systems using the approximation-error-based adaptive dynamic programming (ADP) algorithm. Via the system transformation, the optimal tracking problem is transformed into an optimal regulation problem, and then the novel optimal tracking control method is proposed. It is shown that for the iterative ADP algorithm with finite approximation error, the iterative performance index functions can converge to a finite neighborhood of the greatest lower bound of all performance index functions under some convergence conditions. Two examples are given to demonstrate the validity of the proposed optimal tracking control scheme for chaotic systems.
基金supported by the Natural Science Foundation of Hunan Province(Grant No.2021JJ10045)the National Natural Science Foundation of China(Grant No.11972368)the National Key R&D Program of China(Grant No.2019YFA0405300)。
文摘We propose an adaptive learning-based optimal control scheme for height-velocity control models considering model uncertainties and external disturbances of hypersonic winged-cone vehicles.The longitudinal nonlinear model is first established and transformed into the control-oriented error equations,and the control scheme is organized by a steady-compensation combination.To overcome and eliminate the impact of model uncertainties and external disturbances,an adaptive radial basis function neural network(RBFNN)is designed by a q-gradient approach.Taking the height-velocity error system with estimated uncertainties into account,the adaptive learning-based optimal tracking control(ALOTC)scheme is proposed by combining the critic-only adaptive dynamic programming(ADP)framework and parameter optimization of system settling time.Furthermore,a novel weight update law is proposed to satisfy the online iteration requirements,and the algorithm convergence and closedloop stability are discussed by the Lyapunov theory.Finally,four simulation cases are provided to prove the effectiveness,accuracy,and robustness of the proposed scheme for the hypersonic longitudinal control system.
基金This work was supported by the National Natural Science Foundation of China(No.61873248)the Hubei Provincial Natural Science Foundation of China(Nos.2017CFA030,2015CFA010)the 111 project(No.B17040).
文摘In this paper,a stochastic linear quadratic optimal tracking scheme is proposed for unknown linear discrete-time(DT)systems based on adaptive dynamic programming(ADP)algorithm.First,an augmented system composed of the original system and the command generator is constructed and then an augmented stochastic algebraic equation is derived based on the augmented system.Next,to obtain the optimal control strategy,the stochastic case is converted into the deterministic one by system transformation,and then an ADP algorithm is proposed with convergence analysis.For the purpose of realizing the ADP algorithm,three back propagation neural networks including model network,critic network and action network are devised to guarantee unknown system model,optimal value function and optimal control strategy,respectively.Finally,the obtained optimal control strategy is applied to the original stochastic system,and two simulations are provided to demonstrate the effectiveness of the proposed algorithm.
基金supported in part by the National Natural Science Foundation of China(62033003,62003093,62373113,U23A20341,U21A20522)the Natural Science Foundation of Guangdong Province,China(2023A1515011527,2022A1515011506).
文摘An optimal tracking control problem for a class of nonlinear systems with guaranteed performance and asymmetric input constraints is discussed in this paper.The control policy is implemented by adaptive dynamic programming(ADP)algorithm under two event-based triggering mechanisms.It is often challenging to design an optimal control law due to the system deviation caused by asymmetric input constraints.First,a prescribed performance control technique is employed to guarantee the tracking errors within predetermined boundaries.Subsequently,considering the asymmetric input constraints,a discounted non-quadratic cost function is introduced.Moreover,in order to reduce controller updates,an event-triggered control law is developed for ADP algorithm.After that,to further simplify the complexity of controller design,this work is extended to a self-triggered case for relaxing the need for continuous signal monitoring by hardware devices.By employing the Lyapunov method,the uniform ultimate boundedness of all signals is proved to be guaranteed.Finally,a simulation example on a mass–spring–damper system subject to asymmetric input constraints is provided to validate the effectiveness of the proposed control scheme.
基金Project supported by the National Natural Science Foundation of China (No. 10472038);the Science Foundation of the Ministry of Education of China for Ph.D. Programme (No. 20050730016);the National Science Foundation of China for 0utstanding Young Researchers (No. 10025208).
文摘Based on generalized the variation method, by introducing Hamilton function and Lagrange multiplier, this paper proposed a linear quadratic optimal control strategy for an incomplete controllable system with fixed terminal state and time. Applying the proposed optimal control to the simple two-input dual-stage actuator magnetic head positioning system with three degrees-of-freedom, the simulation results show that the system has no residual vibration at the terminal position and time, which can reduce the total access time during head positioning process. To verify the validation of the optimal control strategy of three degrees-of-freedom spring-mass models in actual magnetic head positioning of hard disk drives, a finite element model of an actual magnetic head positioning system is presented. Substituting the optimal control force from simple three degrees-of-freedom spring-mass models into the finite element model, the simulation results show that the magnetic head also has no residual vibration at the end of track-to-track travel. That is to say, the linear quadratic optimal control technique based on simple two-input dual- stage actuator system with three degrees-of-freedom proposed in this paper is of high reliability for the industrial application of an actual magnetic head positioning system.