This paper investigates the issue of adaptive optimal tracking control for nonlinear systems with dynamic state constraints.An asymmetric time-varying integral barrier Lyapunov function(ATIBLF)based integral reinforce...This paper investigates the issue of adaptive optimal tracking control for nonlinear systems with dynamic state constraints.An asymmetric time-varying integral barrier Lyapunov function(ATIBLF)based integral reinforcement learning(IRL)control algorithm with an actor–critic structure is first proposed.The ATIBLF items are appropriately arranged in every step of the optimized backstepping control design to ensure that the dynamic full-state constraints are never violated.Thus,optimal virtual/actual control in every backstepping subsystem is decomposed with ATIBLF items and also with an adaptive optimized item.Meanwhile,neural networks are used to approximate the gradient value functions.According to the Lyapunov stability theorem,the boundedness of all signals of the closed-loop system is proved,and the proposed control scheme ensures that the system states are within predefined compact sets.Finally,the effectiveness of the proposed control approach is validated by simulations.展开更多
基金Project supported by the National Natural Science Foundation of China(Nos.62203392 and 62373329)the Natural Science Foundation of Zhejiang Province,China(No.LY23F030009)the Baima Lake Laboratory Joint Funds of the Zhejiang Provincial Natural Science Foundation of China(No.LBMHD24F030002)。
文摘This paper investigates the issue of adaptive optimal tracking control for nonlinear systems with dynamic state constraints.An asymmetric time-varying integral barrier Lyapunov function(ATIBLF)based integral reinforcement learning(IRL)control algorithm with an actor–critic structure is first proposed.The ATIBLF items are appropriately arranged in every step of the optimized backstepping control design to ensure that the dynamic full-state constraints are never violated.Thus,optimal virtual/actual control in every backstepping subsystem is decomposed with ATIBLF items and also with an adaptive optimized item.Meanwhile,neural networks are used to approximate the gradient value functions.According to the Lyapunov stability theorem,the boundedness of all signals of the closed-loop system is proved,and the proposed control scheme ensures that the system states are within predefined compact sets.Finally,the effectiveness of the proposed control approach is validated by simulations.