A minimax optimal control strategy for quasi-Hamiltonian systems with bounded parametric and/or external disturbances is proposed based on the stochastic averaging method and stochastic differential game. To conduct t...A minimax optimal control strategy for quasi-Hamiltonian systems with bounded parametric and/or external disturbances is proposed based on the stochastic averaging method and stochastic differential game. To conduct the system energy control, the partially averaged Ito stochastic differential equations for the energy processes are first derived by using the stochastic averaging method for quasi-Hamiltonian systems. Combining the above equations with an appropriate performance index, the proposed strategy is searching for an optimal worst-case controller by solving a stochastic differential game problem. The worst-case disturbances and the optimal controls are obtained by solving a Hamilton-Jacobi-Isaacs (HJI) equation. Numerical results for a controlled and stochastically excited DulTlng oscillator with uncertain disturbances exhibit the efficacy of the proposed control strategy.展开更多
In this paper, a novel iterative Q-learning algorithm, called "policy iteration based deterministic Qlearning algorithm", is developed to solve the optimal control problems for discrete-time deterministic no...In this paper, a novel iterative Q-learning algorithm, called "policy iteration based deterministic Qlearning algorithm", is developed to solve the optimal control problems for discrete-time deterministic nonlinear systems. The idea is to use an iterative adaptive dynamic programming(ADP) technique to construct the iterative control law which optimizes the iterative Q function. When the optimal Q function is obtained, the optimal control law can be achieved by directly minimizing the optimal Q function, where the mathematical model of the system is not necessary. Convergence property is analyzed to show that the iterative Q function is monotonically non-increasing and converges to the solution of the optimality equation. It is also proven that any of the iterative control laws is a stable control law. Neural networks are employed to implement the policy iteration based deterministic Q-learning algorithm, by approximating the iterative Q function and the iterative control law, respectively. Finally, two simulation examples are presented to illustrate the performance of the developed algorithm.展开更多
The optimal control of nonlinear systems has been studied for years by many researchers. However, the application of optimal control problem to nonlinear non-affine systems needs more attention. In this paper we propo...The optimal control of nonlinear systems has been studied for years by many researchers. However, the application of optimal control problem to nonlinear non-affine systems needs more attention. In this paper we propose an optimal control design technique for a class of nonlinear and control non-affine equations. The dynamic equations of a flexible shaft supported by a pair of active magnetic bearings (AMBs) are used as the nonlinear control non-affine equations. Mathematical model for the flexible beam is chosen to be the well known Timoshenko beam model, which takes rotary inertia and shear deformations into account, and it is assumed that the shaft is supported by two frictionless bearings at the ends. The effective control of such systems is extremely important for very high angular velocity shafts which are a feature of many modern machines. The control must be able to cope with unbalanced masses and hence be very robust. We shall approach the problem by discretising the Timoshenko beam model and using standard difference formulae to develop a finite-dimensional model of the system. Then we use a recently developed technique for controlling nonlinear systems by reducing the problem to a sequence of linear time-varying (LTV) systems. An optimal control designed for each approximating linear, time-varying system and recent results show that this method will converge uniformly on compact time intervals to the optimal solution.展开更多
This paper discusses a problem of optimal tracking for a linear control system driven by fractional Brownian motion.An equation is obtained for the linear Markov feedback control.The existence and uniqueness of the so...This paper discusses a problem of optimal tracking for a linear control system driven by fractional Brownian motion.An equation is obtained for the linear Markov feedback control.The existence and uniqueness of the solution to the equation are also studied.展开更多
基金the National Natural Science Foundation of China (No. 10772159)the Specialized Research Fund for DoctorProgram of Higher Education of China (No. 20060335125) theNatural Science Foundation of Zhejiang Province (No. Y607087),China
文摘A minimax optimal control strategy for quasi-Hamiltonian systems with bounded parametric and/or external disturbances is proposed based on the stochastic averaging method and stochastic differential game. To conduct the system energy control, the partially averaged Ito stochastic differential equations for the energy processes are first derived by using the stochastic averaging method for quasi-Hamiltonian systems. Combining the above equations with an appropriate performance index, the proposed strategy is searching for an optimal worst-case controller by solving a stochastic differential game problem. The worst-case disturbances and the optimal controls are obtained by solving a Hamilton-Jacobi-Isaacs (HJI) equation. Numerical results for a controlled and stochastically excited DulTlng oscillator with uncertain disturbances exhibit the efficacy of the proposed control strategy.
基金supported in part by National Natural Science Foundation of China(Grant Nos.6137410561233001+1 种基金61273140)in part by Beijing Natural Science Foundation(Grant No.4132078)
文摘In this paper, a novel iterative Q-learning algorithm, called "policy iteration based deterministic Qlearning algorithm", is developed to solve the optimal control problems for discrete-time deterministic nonlinear systems. The idea is to use an iterative adaptive dynamic programming(ADP) technique to construct the iterative control law which optimizes the iterative Q function. When the optimal Q function is obtained, the optimal control law can be achieved by directly minimizing the optimal Q function, where the mathematical model of the system is not necessary. Convergence property is analyzed to show that the iterative Q function is monotonically non-increasing and converges to the solution of the optimality equation. It is also proven that any of the iterative control laws is a stable control law. Neural networks are employed to implement the policy iteration based deterministic Q-learning algorithm, by approximating the iterative Q function and the iterative control law, respectively. Finally, two simulation examples are presented to illustrate the performance of the developed algorithm.
文摘The optimal control of nonlinear systems has been studied for years by many researchers. However, the application of optimal control problem to nonlinear non-affine systems needs more attention. In this paper we propose an optimal control design technique for a class of nonlinear and control non-affine equations. The dynamic equations of a flexible shaft supported by a pair of active magnetic bearings (AMBs) are used as the nonlinear control non-affine equations. Mathematical model for the flexible beam is chosen to be the well known Timoshenko beam model, which takes rotary inertia and shear deformations into account, and it is assumed that the shaft is supported by two frictionless bearings at the ends. The effective control of such systems is extremely important for very high angular velocity shafts which are a feature of many modern machines. The control must be able to cope with unbalanced masses and hence be very robust. We shall approach the problem by discretising the Timoshenko beam model and using standard difference formulae to develop a finite-dimensional model of the system. Then we use a recently developed technique for controlling nonlinear systems by reducing the problem to a sequence of linear time-varying (LTV) systems. An optimal control designed for each approximating linear, time-varying system and recent results show that this method will converge uniformly on compact time intervals to the optimal solution.
基金partially supported by a grant from the Simons Foundation #209206
文摘This paper discusses a problem of optimal tracking for a linear control system driven by fractional Brownian motion.An equation is obtained for the linear Markov feedback control.The existence and uniqueness of the solution to the equation are also studied.