In this paper two different control strategies designed to alleviate the response of quasi partially integrable Hamiltonian systems subjected to stochastic excitation are proposed. First, by using the stochastic avera...In this paper two different control strategies designed to alleviate the response of quasi partially integrable Hamiltonian systems subjected to stochastic excitation are proposed. First, by using the stochastic averaging method for quasi partially integrable Hamiltonian systems, an n-DOF controlled quasi partially integrable Hamiltonian system with stochastic excitation is converted into a set of partially averaged It^↑o stochastic differential equations. Then, the dynamical programming equation associated with the partially averaged It^↑o equations is formulated by applying the stochastic dynamical programming principle. In the first control strategy, the optimal control law is derived from the dynamical programming equation and the control constraints without solving the dynamical programming equation. In the second control strategy, the optimal control law is obtained by solving the dynamical programming equation. Finally, both the responses of controlled and uncontrolled systems are predicted through solving the Fokker-Plank-Kolmogorov equation associated with fully averaged It^↑o equations. An example is worked out to illustrate the application and effectiveness of the two proposed control strategies.展开更多
Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and ...Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.展开更多
Drug treatment, snail control, cercariae control, improved sanitation and health education are the effective strategies which are used to control the schistosomiasis. In this paper, we consider a deterministic model f...Drug treatment, snail control, cercariae control, improved sanitation and health education are the effective strategies which are used to control the schistosomiasis. In this paper, we consider a deterministic model for schistosomiasis transmission dynamics in order to explore the role of the several control strategies. The global stability of a schistosomiasis infection model that involves mating structure including male schistosomes, female schistosomes, paired schistosomes and snails is studied by constructing appropriate Lyapunov functions. We derive the basic reproduction number R0 for the deterministic model, and establish that the global dynamics are completely determined by the values of R0. We show that the disease can be eradicated when R0?≤1;otherwise, the system is persistent. In the case where ?R0?>1, we prove the existence, uniqueness and global asymptotic stability of an endemic steady state. Sensitivity analysis and simulations are carried out in order to determine the relative importance of different control strategies for disease transmission and prevalence. Next, optimal control theory is applied to investigate the control strategies for eliminating schistosomiasis using time dependent controls. The characterization of the optimal control is carried out via the Pontryagins Maximum Principle. The simulation results demonstrate that the insecticide is important in the control of schistosomiasis.展开更多
In this paper, a new approach using linear combination property of intervals and discretization is proposed to solve a class of nonlinear optimal control problems, containing a nonlinear system and linear functional, ...In this paper, a new approach using linear combination property of intervals and discretization is proposed to solve a class of nonlinear optimal control problems, containing a nonlinear system and linear functional, in three phases. In the first phase, using linear combination property of intervals, changes nonlinear system to an equivalent linear system, in the second phase, using discretization method, the attained problem is converted to a linear programming problem, and in the third phase, the latter problem will be solved by linear programming methods. In addition, efficiency of our approach is confirmed by some numerical examples.展开更多
Start-up working condition is the key to the research of optimal engagementof automatic clutch for AMT. In order to guarantee an ideal dynamic performance of the clutchengagement, an optimal controller is designed by ...Start-up working condition is the key to the research of optimal engagementof automatic clutch for AMT. In order to guarantee an ideal dynamic performance of the clutchengagement, an optimal controller is designed by considering throttle angle, engine speed, gearratio, vehicle acceleration and road condition. The minimum value principle is also introduced toachieve an optimal dynamic performance of the nonlinear system compromised in friction plate wearand vehicle drive quality. The optimal trajectory of the clutch engagement can be described in theform of explicit and analytical expressions and characterized by the deterministic and accuratecontrol strategy in stead of indeterministic and soft control techniques which need thousands ofexperiments. For validation of the controller, test work is carried out for the automated clutchengagements in a commercial car with an traditional mechanical transmission, a hydraulic actuator, agroup of sensors and a portable computer system. It is shown through experiments that dynamicbehaviors of the clutch engagement operated by the optimal control are more effective and efficientthan those by fuzzy control.展开更多
The paper introduces a new method for finding optimal control of algebraic dynamic systems. The structure of algebraic dynamical systems is nonlinear with quadratic and bilinear terms. A new hybrid extended Fourier se...The paper introduces a new method for finding optimal control of algebraic dynamic systems. The structure of algebraic dynamical systems is nonlinear with quadratic and bilinear terms. A new hybrid extended Fourier series is introduced, and state and control variables of the system are expanded by this series. Moreover, properties of new series are presented, and integration and product operational matrices are obtained. Using operational matrices, optimal control of the systems is converted to a set of simultaneous nonlinear algebraic relations. An illustrative example is included to compare our results with those in the literature.展开更多
Suppression of the dynamic oscillations of tie-line power exchanges and frequency in the affected interconnected power systems due to loading-condition changes has been assigned as a prominent duty of automatic genera...Suppression of the dynamic oscillations of tie-line power exchanges and frequency in the affected interconnected power systems due to loading-condition changes has been assigned as a prominent duty of automatic generation control(AGC). To alleviate the system oscillation resulting from such load changes, implementation of flexible AC transmission systems(FACTSs) can be considered as one of the practical and effective solutions. In this paper, a thyristor-controlled series compensator(TCSC), which is one series type of the FACTS family, is used to augment the overall dynamic performance of a multi-area multi-source interconnected power system. To this end, we have used a hierarchical adaptive neuro-fuzzy inference system controller-TCSC(HANFISC-TCSC) to abate the two important issues in multi-area interconnected power systems, i.e., low-frequency oscillations and tie-line power exchange deviations. For this purpose, a multi-objective optimization technique is inevitable. Multi-objective particle swarm optimization(MOPSO) has been chosen for this optimization problem, owing to its high performance in untangling non-linear objectives. The efficiency of the suggested HANFISC-TCSC has been precisely evaluated and compared with that of the conventional MOPSO-TCSC in two different multi-area interconnected power systems, i.e., two-area hydro-thermal-diesel and three-area hydro-thermal power systems. The simulation results obtained from both power systems have transparently certified the high performance of HANFISC-TCSC compared to the conventional MOPSO-TCSC.展开更多
In this study,an adaptive neuro-observer-based optimal control(ANOPC)policy is introduced for unknown nonaffine nonlinear systems with control input constraints.Hamilton–Jacobi–Bellman(HJB)framework is employed to m...In this study,an adaptive neuro-observer-based optimal control(ANOPC)policy is introduced for unknown nonaffine nonlinear systems with control input constraints.Hamilton–Jacobi–Bellman(HJB)framework is employed to minimize a non-quadratic cost function corresponding to the constrained control input.ANOPC consists of both analytical and algebraic parts.In the analytical part,first,an observer-based neural network(NN)approximates uncertain system dynamics,and then another NN structure solves the HJB equation.In the algebraic part,the optimal control input that does not exceed the saturation bounds is generated.The weights of two NNs associated with observer and controller are simultaneously updated in an online manner.The ultimately uniformly boundedness(UUB)of all signals of the whole closed-loop system is ensured through Lyapunov’s direct method.Finally,two numerical examples are provided to confirm the effectiveness of the proposed control strategy.展开更多
An identification problem is considered as inaccurate measurements of dynamics on a time interval are given. The model has the form of ordinary differential equations which are linear with respect to unknown parameter...An identification problem is considered as inaccurate measurements of dynamics on a time interval are given. The model has the form of ordinary differential equations which are linear with respect to unknown parameters. A new approach is presented to solve the identification problem in the framework of the optimal control theory. A numerical algorithm based on the dynamic programming method is suggested to identify the unknown parameters. Results of simulations are exposed.展开更多
In this paper, a novel iterative Q-learning algorithm, called "policy iteration based deterministic Qlearning algorithm", is developed to solve the optimal control problems for discrete-time deterministic no...In this paper, a novel iterative Q-learning algorithm, called "policy iteration based deterministic Qlearning algorithm", is developed to solve the optimal control problems for discrete-time deterministic nonlinear systems. The idea is to use an iterative adaptive dynamic programming(ADP) technique to construct the iterative control law which optimizes the iterative Q function. When the optimal Q function is obtained, the optimal control law can be achieved by directly minimizing the optimal Q function, where the mathematical model of the system is not necessary. Convergence property is analyzed to show that the iterative Q function is monotonically non-increasing and converges to the solution of the optimality equation. It is also proven that any of the iterative control laws is a stable control law. Neural networks are employed to implement the policy iteration based deterministic Q-learning algorithm, by approximating the iterative Q function and the iterative control law, respectively. Finally, two simulation examples are presented to illustrate the performance of the developed algorithm.展开更多
Electrohydrostatic actuator(EHA) is a type of power-by-wire actuator that is widely implemented in the aerospace industry for flight control, landing gears, thrust reversers, thrust vector control, and space robots....Electrohydrostatic actuator(EHA) is a type of power-by-wire actuator that is widely implemented in the aerospace industry for flight control, landing gears, thrust reversers, thrust vector control, and space robots. This paper presents the development and evaluation of positionbased impedance control(PBIC) for an EHA. Impedance control provides the actuator with compliance and facilitates the interaction with the environment. Most impedance control applications utilize electrical or valve-controlled hydraulic actuators, whereas this work realizes impedance control via a compact and efficient EHA. The structures of the EHA and PBIC are firstly introduced. A mathematical model of the actuation system is established, and values of its coefficients are identified by particle swarm optimization. This model facilitates the development of a position controller and the selection of target impedance parameters. A nonlinear proportional-integral position controller is developed for the EHA to achieve the accurate positioning requirement of PBIC. The controller compensates for the adverse effect of stiction, and a position accuracy of 0.08 mm is attained.Various experimental results are presented to verify the applicability of PBIC to the EHA. The compliance of the actuator is demonstrated in an impact test.展开更多
基于微分代数控制系统的反馈线性化方法,进一步研究了具有非线性负荷的电力系统中静止无功补偿器(Static var compensator,SVC)和发电机三阶模型的励磁控制,表明具有非线性负荷和SVC装置的NDAS(3)仍可以通过状态反馈精确线性化,从而得...基于微分代数控制系统的反馈线性化方法,进一步研究了具有非线性负荷的电力系统中静止无功补偿器(Static var compensator,SVC)和发电机三阶模型的励磁控制,表明具有非线性负荷和SVC装置的NDAS(3)仍可以通过状态反馈精确线性化,从而得到具有代数方程的Brunovsky标准型。提出了具有非线性负荷的电力系统SVC与发电机励磁控制的完全精确线性化设计。该控制方法可以同时满足发电机功角稳定和SVC节点处电压。仿真结果表明该方法具有很好的效果和优越性。展开更多
基金Supported by National High Technology Research and Development Program of China (863 Program) (2006AA04Z183), National Nat- ural Science Foundation of China (60621001, 60534010, 60572070, 60774048, 60728307), and the Program for Changjiang Scholars and Innovative Research Groups of China (60728307, 4031002)
基金The project supported by the National Natural Science Foundation of China (10332030)Research Fund for Doctoral Program of Higher Education of China(20060335125)
文摘In this paper two different control strategies designed to alleviate the response of quasi partially integrable Hamiltonian systems subjected to stochastic excitation are proposed. First, by using the stochastic averaging method for quasi partially integrable Hamiltonian systems, an n-DOF controlled quasi partially integrable Hamiltonian system with stochastic excitation is converted into a set of partially averaged It^↑o stochastic differential equations. Then, the dynamical programming equation associated with the partially averaged It^↑o equations is formulated by applying the stochastic dynamical programming principle. In the first control strategy, the optimal control law is derived from the dynamical programming equation and the control constraints without solving the dynamical programming equation. In the second control strategy, the optimal control law is obtained by solving the dynamical programming equation. Finally, both the responses of controlled and uncontrolled systems are predicted through solving the Fokker-Plank-Kolmogorov equation associated with fully averaged It^↑o equations. An example is worked out to illustrate the application and effectiveness of the two proposed control strategies.
基金supported in part by the National Natural Science Foundation of China(62222301, 62073085, 62073158, 61890930-5, 62021003)the National Key Research and Development Program of China (2021ZD0112302, 2021ZD0112301, 2018YFC1900800-5)Beijing Natural Science Foundation (JQ19013)。
文摘Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.
文摘Drug treatment, snail control, cercariae control, improved sanitation and health education are the effective strategies which are used to control the schistosomiasis. In this paper, we consider a deterministic model for schistosomiasis transmission dynamics in order to explore the role of the several control strategies. The global stability of a schistosomiasis infection model that involves mating structure including male schistosomes, female schistosomes, paired schistosomes and snails is studied by constructing appropriate Lyapunov functions. We derive the basic reproduction number R0 for the deterministic model, and establish that the global dynamics are completely determined by the values of R0. We show that the disease can be eradicated when R0?≤1;otherwise, the system is persistent. In the case where ?R0?>1, we prove the existence, uniqueness and global asymptotic stability of an endemic steady state. Sensitivity analysis and simulations are carried out in order to determine the relative importance of different control strategies for disease transmission and prevalence. Next, optimal control theory is applied to investigate the control strategies for eliminating schistosomiasis using time dependent controls. The characterization of the optimal control is carried out via the Pontryagins Maximum Principle. The simulation results demonstrate that the insecticide is important in the control of schistosomiasis.
文摘In this paper, a new approach using linear combination property of intervals and discretization is proposed to solve a class of nonlinear optimal control problems, containing a nonlinear system and linear functional, in three phases. In the first phase, using linear combination property of intervals, changes nonlinear system to an equivalent linear system, in the second phase, using discretization method, the attained problem is converted to a linear programming problem, and in the third phase, the latter problem will be solved by linear programming methods. In addition, efficiency of our approach is confirmed by some numerical examples.
文摘Start-up working condition is the key to the research of optimal engagementof automatic clutch for AMT. In order to guarantee an ideal dynamic performance of the clutchengagement, an optimal controller is designed by considering throttle angle, engine speed, gearratio, vehicle acceleration and road condition. The minimum value principle is also introduced toachieve an optimal dynamic performance of the nonlinear system compromised in friction plate wearand vehicle drive quality. The optimal trajectory of the clutch engagement can be described in theform of explicit and analytical expressions and characterized by the deterministic and accuratecontrol strategy in stead of indeterministic and soft control techniques which need thousands ofexperiments. For validation of the controller, test work is carried out for the automated clutchengagements in a commercial car with an traditional mechanical transmission, a hydraulic actuator, agroup of sensors and a portable computer system. It is shown through experiments that dynamicbehaviors of the clutch engagement operated by the optimal control are more effective and efficientthan those by fuzzy control.
文摘The paper introduces a new method for finding optimal control of algebraic dynamic systems. The structure of algebraic dynamical systems is nonlinear with quadratic and bilinear terms. A new hybrid extended Fourier series is introduced, and state and control variables of the system are expanded by this series. Moreover, properties of new series are presented, and integration and product operational matrices are obtained. Using operational matrices, optimal control of the systems is converted to a set of simultaneous nonlinear algebraic relations. An illustrative example is included to compare our results with those in the literature.
文摘Suppression of the dynamic oscillations of tie-line power exchanges and frequency in the affected interconnected power systems due to loading-condition changes has been assigned as a prominent duty of automatic generation control(AGC). To alleviate the system oscillation resulting from such load changes, implementation of flexible AC transmission systems(FACTSs) can be considered as one of the practical and effective solutions. In this paper, a thyristor-controlled series compensator(TCSC), which is one series type of the FACTS family, is used to augment the overall dynamic performance of a multi-area multi-source interconnected power system. To this end, we have used a hierarchical adaptive neuro-fuzzy inference system controller-TCSC(HANFISC-TCSC) to abate the two important issues in multi-area interconnected power systems, i.e., low-frequency oscillations and tie-line power exchange deviations. For this purpose, a multi-objective optimization technique is inevitable. Multi-objective particle swarm optimization(MOPSO) has been chosen for this optimization problem, owing to its high performance in untangling non-linear objectives. The efficiency of the suggested HANFISC-TCSC has been precisely evaluated and compared with that of the conventional MOPSO-TCSC in two different multi-area interconnected power systems, i.e., two-area hydro-thermal-diesel and three-area hydro-thermal power systems. The simulation results obtained from both power systems have transparently certified the high performance of HANFISC-TCSC compared to the conventional MOPSO-TCSC.
文摘In this study,an adaptive neuro-observer-based optimal control(ANOPC)policy is introduced for unknown nonaffine nonlinear systems with control input constraints.Hamilton–Jacobi–Bellman(HJB)framework is employed to minimize a non-quadratic cost function corresponding to the constrained control input.ANOPC consists of both analytical and algebraic parts.In the analytical part,first,an observer-based neural network(NN)approximates uncertain system dynamics,and then another NN structure solves the HJB equation.In the algebraic part,the optimal control input that does not exceed the saturation bounds is generated.The weights of two NNs associated with observer and controller are simultaneously updated in an online manner.The ultimately uniformly boundedness(UUB)of all signals of the whole closed-loop system is ensured through Lyapunov’s direct method.Finally,two numerical examples are provided to confirm the effectiveness of the proposed control strategy.
文摘An identification problem is considered as inaccurate measurements of dynamics on a time interval are given. The model has the form of ordinary differential equations which are linear with respect to unknown parameters. A new approach is presented to solve the identification problem in the framework of the optimal control theory. A numerical algorithm based on the dynamic programming method is suggested to identify the unknown parameters. Results of simulations are exposed.
基金supported in part by National Natural Science Foundation of China(Grant Nos.6137410561233001+1 种基金61273140)in part by Beijing Natural Science Foundation(Grant No.4132078)
文摘In this paper, a novel iterative Q-learning algorithm, called "policy iteration based deterministic Qlearning algorithm", is developed to solve the optimal control problems for discrete-time deterministic nonlinear systems. The idea is to use an iterative adaptive dynamic programming(ADP) technique to construct the iterative control law which optimizes the iterative Q function. When the optimal Q function is obtained, the optimal control law can be achieved by directly minimizing the optimal Q function, where the mathematical model of the system is not necessary. Convergence property is analyzed to show that the iterative Q function is monotonically non-increasing and converges to the solution of the optimality equation. It is also proven that any of the iterative control laws is a stable control law. Neural networks are employed to implement the policy iteration based deterministic Q-learning algorithm, by approximating the iterative Q function and the iterative control law, respectively. Finally, two simulation examples are presented to illustrate the performance of the developed algorithm.
基金completed in the Fluid Power and Tele-Robotics Research Laboratory at the University of Manitobathe supports of the Natural Sciences and Engineering Research Council(NSERC)of Canada+1 种基金China Scholarship Council(CSC)the National Natural Science Foundation of China(Nos.51275021 and 61327807)
文摘Electrohydrostatic actuator(EHA) is a type of power-by-wire actuator that is widely implemented in the aerospace industry for flight control, landing gears, thrust reversers, thrust vector control, and space robots. This paper presents the development and evaluation of positionbased impedance control(PBIC) for an EHA. Impedance control provides the actuator with compliance and facilitates the interaction with the environment. Most impedance control applications utilize electrical or valve-controlled hydraulic actuators, whereas this work realizes impedance control via a compact and efficient EHA. The structures of the EHA and PBIC are firstly introduced. A mathematical model of the actuation system is established, and values of its coefficients are identified by particle swarm optimization. This model facilitates the development of a position controller and the selection of target impedance parameters. A nonlinear proportional-integral position controller is developed for the EHA to achieve the accurate positioning requirement of PBIC. The controller compensates for the adverse effect of stiction, and a position accuracy of 0.08 mm is attained.Various experimental results are presented to verify the applicability of PBIC to the EHA. The compliance of the actuator is demonstrated in an impact test.
文摘基于微分代数控制系统的反馈线性化方法,进一步研究了具有非线性负荷的电力系统中静止无功补偿器(Static var compensator,SVC)和发电机三阶模型的励磁控制,表明具有非线性负荷和SVC装置的NDAS(3)仍可以通过状态反馈精确线性化,从而得到具有代数方程的Brunovsky标准型。提出了具有非线性负荷的电力系统SVC与发电机励磁控制的完全精确线性化设计。该控制方法可以同时满足发电机功角稳定和SVC节点处电压。仿真结果表明该方法具有很好的效果和优越性。