Double cost function linear quadratic regulator (DLQR) is developed from LQR theory to solve an optimal control problem with a general nonlinear cost function. In addition to the traditional LQ cost function, anothe...Double cost function linear quadratic regulator (DLQR) is developed from LQR theory to solve an optimal control problem with a general nonlinear cost function. In addition to the traditional LQ cost function, another free form cost function was introduced to express the physical need plainly and optimize weights of LQ cost function using the search algorithms. As an instance, DLQR was applied in determining the control input in the front steering angle compensation control (FSAC) model for heavy duty vehicles. The brief simulations show that DLQR is powerful enough to specify the engineering requirements correctly and balance many factors effectively. The concept and applicable field of LQR are expanded by DLQR to optimize the system with a free form cost function.展开更多
This paper deals with the problem of singular linear quadratic performance with the worst-disturbance rejection for descriptor systems. Under the conditions we give, the worst-disturbance and the optimal control-state...This paper deals with the problem of singular linear quadratic performance with the worst-disturbance rejection for descriptor systems. Under the conditions we give, the worst-disturbance and the optimal control-state pair are unique respectively, the optimal control can be synthesized as state feedback and the closed-loop system is regular, stable and impulse-free.展开更多
The quarter model of an active suspension is established in the form of controllable autoregressive moving average (CARMA) model. An accelerometer can be mounted on the wheel hub for measuring road disturbance; this...The quarter model of an active suspension is established in the form of controllable autoregressive moving average (CARMA) model. An accelerometer can be mounted on the wheel hub for measuring road disturbance; this signal is used to identify the CARMA model parameters by recursive forgetting factors least square method. The linear quadratic integral (LQI) control method for the active suspension is presented. The LQI control algorithm is fit for vehicle suspension control, for the control performance index can comprise multi controlled variables. The simulation results show that the vertical acceleration and suspension travel both are decreased with the LQI control in the low frequency band, and the suspension travel is increased with the LQI control in the middle or high frequency band. The suspension travel is very small in the middle or high frequency band, the suspension bottoming stop will not happen, so the vehicle ride quality can be improved apparently by the LQI control.展开更多
Optimal control system of state space is a conservative system, whose approximate method should be symplectic conservation. Based on the precise integration method, an algorithm of symplectic conservative perturbation...Optimal control system of state space is a conservative system, whose approximate method should be symplectic conservation. Based on the precise integration method, an algorithm of symplectic conservative perturbation is presented. It gives a uniform way to solve the linear quadratic control (LQ control) problems for linear timevarying systems accurately and efficiently, whose key points are solutions of differential Riccati equation (DRE) with variable coefficients and the state feedback equation. The method is symplectic conservative and has a good numerical stability and high precision. Numerical examples demonstrate the effectiveness of the proposed method.展开更多
In this paper, the Nash equilibria for differential games with multiple players is studied. A method for solving the Riccati-type matrix differential equations for open-loop Nash strategy in linear quadratic game with...In this paper, the Nash equilibria for differential games with multiple players is studied. A method for solving the Riccati-type matrix differential equations for open-loop Nash strategy in linear quadratic game with multiple players is presented and analytical solution is given for a type of differential games in which the system matrixcan be diagonalizable. As the special cases, the Nash equilibria for some type of differential games with particular structure is studied also, and some results in previous literatures are extended. Finally, a numerical example is given to illustrate the effectiveness of the solution procedure.展开更多
This paper deals with Furuta Pendulum(FP)or Rotary Inverted Pendulum(RIP),which is an under-actuated non-minimum unstable non-linear process.The process considered along with uncertainties which are unmodelled and ana...This paper deals with Furuta Pendulum(FP)or Rotary Inverted Pendulum(RIP),which is an under-actuated non-minimum unstable non-linear process.The process considered along with uncertainties which are unmodelled and analyses the performance of Linear Quadratic Regulator(LQR)with Kalman filter and H∞filter as two filter configurations.The LQR is a technique for developing practical feedback,in addition the desired x shows the vector of desirable states and is used as the external input to the closed-loop system.The effectiveness of the two filters in FP or RIP are measured and contrasted with rise time,peak time,settling time and maximum peak overshoot for time domain performance.The filters are also tested with gain margin,phase margin,disk stability margins for frequency domain performance and worst case stability margins for performance due to uncertainties.The H-infinity filter reduces the estimate error to a minimum,making it resilient in the worst case than the standard Kalman filter.Further,when theβrestriction value lowers,the H∞filter becomes more robust.The worst case gain performance is also focused for the two filter configurations and tested where H∞filter is found to outperform towards robust stability and performance.Also the switchover between the two filters is dependent upon a user-specified co-efficient that gives the flexibility in the design of non-linear systems.The non-linear process is tested for set point tracking,disturbance rejection,un-modelled noise dynamics and uncertainties,which records robust performance towards stability.展开更多
This paper focuses on linear-quadratic(LQ)optimal control for a class of systems governed by first-order hyperbolic partial differential equations(PDEs).Different from most of the previous works,an approach of discret...This paper focuses on linear-quadratic(LQ)optimal control for a class of systems governed by first-order hyperbolic partial differential equations(PDEs).Different from most of the previous works,an approach of discretization-then-continuousization is proposed in this paper to cope with the infinite-dimensional nature of PDE systems.The contributions of this paper consist of the following aspects:(1)The differential Riccati equations and the solvability condition of the LQ optimal control problems are obtained via the discretization-then-continuousization method.(2)A numerical calculation way of the differential Riccati equations and a practical design way of the optimal controller are proposed.Meanwhile,the relationship between the optimal costate and the optimal state is established by solving a set of forward and backward partial difference equations(FBPDEs).(3)The correctness of the method used in this paper is verified by a complementary continuous method and the comparative analysis with the existing operator results is presented.It is shown that the proposed results not only contain the classic results of the standard LQ control problem of systems governed by ordinary differential equations as a special case,but also support the existing operator results and give a more convenient form of computation.展开更多
In this paper, tile authors first study two kinds of stochastic differential equations (SDEs) with Levy processes as noise source. Based on the existence and uniqueness of the solutions of these SDEs and multi-dimen...In this paper, tile authors first study two kinds of stochastic differential equations (SDEs) with Levy processes as noise source. Based on the existence and uniqueness of the solutions of these SDEs and multi-dimensional backward stochastic differential equations (BSDEs) driven by Levy pro- cesses, the authors proceed to study a stochastic linear quadratic (LQ) optimal control problem with a Levy process, where the cost weighting matrices of the state and control are allowed to be indefinite. One kind of new stochastic Riccati equation that involves equality and inequality constraints is derived from the idea of square completion and its solvability is proved to be sufficient for the well-posedness and the existence of optimal control which can be of either state feedback or open-loop form of the LQ problems. Moreover, the authors obtain the existence and uniqueness of the solution to the Riccati equation for some special cases. Finally, two examples are presented to illustrate these theoretical results.展开更多
In order to intercept the future targets that are characterized by high maneuverability, multiple interceptors may be launched and aimed at single target. The scenario of two missiles P and Q intercepting a single tar...In order to intercept the future targets that are characterized by high maneuverability, multiple interceptors may be launched and aimed at single target. The scenario of two missiles P and Q intercepting a single target is modeled as a two-pursuit single-evader non-zero-sum linear quadratic differential game. The intercept space is decomposed into three subspaces which are mutually disjoint and their union covers the entire intercept space. The effect of adding the second interceptor arises in the intercept space of both P and Q (PQ-intercept space). A guidance law is derived from the Nash equilibrium strategy set (NESS) of the game. Simulation studies are focused on the PQ-intercept space. It is indicated that 1) increasing the target's maneuverability will enlarge PQ-intercept space; 2) the handover conditions will be released if the initial zero-effort-miss (ZEM) of both interceptors has opposite sign; 3) overvaluation of the target's maneuverability by choosing a small weight coefficient will generate robust performance with respect to the target maneuvering command switch time and decrease the fuel requirement; and 4) cooperation between interceptors increases the interception probability.展开更多
We present an iterative linear quadratic regulator(ILQR) method for trajectory tracking control of a wheeled mobile robot system.The proposed scheme involves a kinematic model linearization technique,a global trajecto...We present an iterative linear quadratic regulator(ILQR) method for trajectory tracking control of a wheeled mobile robot system.The proposed scheme involves a kinematic model linearization technique,a global trajectory generation algorithm,and trajectory tracking controller design.A lattice planner,which searches over a 3D(x,y,θ) configuration space,is adopted to generate the global trajectory.The ILQR method is used to design a local trajectory tracking controller.The effectiveness of the proposed method is demonstrated in simulation and experiment with a significantly asymmetric differential drive robot.The performance of the local controller is analyzed and compared with that of the existing linear quadratic regulator(LQR) method.According to the experiments,the new controller improves the control sequences(v,ω) iteratively and produces slightly better results.Specifically,two trajectories,'S' and '8' courses,are followed with sufficient accuracy using the proposed controller.展开更多
This paper studies an asymptotic solvability problem for linear quadratic(LQ)mean field games with controlled diffusions and indefinite weights for the state and control in the costs.The authors employ a rescaling app...This paper studies an asymptotic solvability problem for linear quadratic(LQ)mean field games with controlled diffusions and indefinite weights for the state and control in the costs.The authors employ a rescaling approach to derive a low dimensional Riccati ordinary differential equation(ODE)system,which characterizes a necessary and sufficient condition for asymptotic solvability.The rescaling technique is further used for performance estimates,establishing an O(1/N)-Nash equilibrium for the obtained decentralized strategies.展开更多
The selection of weighting matrix in design of the linear quadratic optimal controller is an important topic in the control theory. In this paper, an approach based on genetic algorithm is presented for selecting the ...The selection of weighting matrix in design of the linear quadratic optimal controller is an important topic in the control theory. In this paper, an approach based on genetic algorithm is presented for selecting the weighting matrix for the optimal controller. Genetic algorithm is adaptive heuristic search algorithm premised on the evolutionary ideas of natural selection and genetic. In this algorithm, the fitness function is used to evaluate individuals and reproductive success varies with fitness. In the design of the linear quadratic optimal controller, the fitness function has relation to the anticipated step response of the system. Not only can the controller designed by this approach meet the demand of the performance indexes of linear quadratic controller, but also satisfy the anticipated step response of close-loop system. The method possesses a higher calculating efficiency and provides technical support for the optimal controller in engineering application. The simulation of a three-order single-input single-output (SISO) system has demonstrated the feasibility and validity of the approach.展开更多
This paper studies linear quadratic games problem for stochastic Volterra integral equations(SVIEs in short) where necessary and sufficient conditions for the existence of saddle points are derived in two different wa...This paper studies linear quadratic games problem for stochastic Volterra integral equations(SVIEs in short) where necessary and sufficient conditions for the existence of saddle points are derived in two different ways.As a consequence,the open problems raised by Chen and Yong(2007) are solved.To characterize the saddle points more clearly,coupled forward-backward stochastic Volterra integral equations and stochastic Fredholm-Volterra integral equations are introduced.Compared with deterministic game problems,some new terms arising from the procedure of deriving the later equations reflect well the essential nature of stochastic systems.Moreover,our representations and arguments are even new in the classical SDEs case.展开更多
In this paper,the problem of inverse quadratic optimal control over fnite time-horizon for discrete-time linear systems is considered.Our goal is to recover the corresponding quadratic objective function using noisy o...In this paper,the problem of inverse quadratic optimal control over fnite time-horizon for discrete-time linear systems is considered.Our goal is to recover the corresponding quadratic objective function using noisy observations.First,the identifability of the model structure for the inverse optimal control problem is analyzed under relative degree assumption and we show the model structure is strictly globally identifable.Next,we study the inverse optimal control problem whose initial state distribution and the observation noise distribution are unknown,yet the exact observations on the initial states are available.We formulate the problem as a risk minimization problem and approximate the problem using empirical average.It is further shown that the solution to the approximated problem is statistically consistent under the assumption of relative degrees.We then study the case where the exact observations on the initial states are not available,yet the observation noises are known to be white Gaussian distributed and the distribution of the initial state is also Gaussian(with unknown mean and covariance).EM-algorithm is used to estimate the parameters in the objective function.The efectiveness of our results are demonstrated by numerical examples.展开更多
Purpose The purpose of this paper is to study a new method to improve the performance of the magnet power supply in the experimental ring of HIRFL-CSR.Methods A hybrid genetic particle swarm optimization algorithm is ...Purpose The purpose of this paper is to study a new method to improve the performance of the magnet power supply in the experimental ring of HIRFL-CSR.Methods A hybrid genetic particle swarm optimization algorithm is introduced,and the algorithm is applied to the optimal design of the LQR controller of pulse width modulated power supply.The fitness function of hybrid genetic particle swarm optimization is a multi-objective function,which combined the current and voltage,so that the dynamic performance of the closed-loop system can be better.The hybrid genetic particle swarm algorithm is applied to determine LQR controlling matrices Q and R.Results The simulation results show that adoption of this method leads to good transient responses,and the computational time is shorter than in the traditional trial and error methods.Conclusions The results presented in this paper show that the proposed method is robust,efficient and feasible,and the dynamic and static performance of the accelerator PWM power supply has been considerably improved.展开更多
We consider the optimal control problem for a linear conditional McKeanVlasov equation with quadratic cost functional.The coefficients of the system and the weighting matrices in the cost functional are allowed to be ...We consider the optimal control problem for a linear conditional McKeanVlasov equation with quadratic cost functional.The coefficients of the system and the weighting matrices in the cost functional are allowed to be adapted processes with respect to the common noise filtration.Semi closed-loop strategies are introduced,and following the dynamic programming approach in(Pham and Wei,Dynamic programming for optimal control of stochastic McKean-Vlasov dynamics,2016),we solve the problem and characterize time-consistent optimal control by means of a system of decoupled backward stochastic Riccati differential equations.We present several financial applications with explicit solutions,and revisit,in particular,optimal tracking problems with price impact,and the conditional mean-variance portfolio selection in an incomplete market model.展开更多
A switched linear quadratic(LQ) differential game over finite-horizon is investigated in this paper. The switching signal is regarded as a non-conventional player, afterwards the definition of Pareto efficiency is e...A switched linear quadratic(LQ) differential game over finite-horizon is investigated in this paper. The switching signal is regarded as a non-conventional player, afterwards the definition of Pareto efficiency is extended to dynamics switching situations to characterize the solutions of this multi-objective problem. Furthermore, the switched differential game is equivalently transformed into a family of parameterized single-objective optimal problems by introducing preference information and auxiliary variables. This transformation reduces the computing complexity such that the Pareto frontier of the switched LQ differential game can be constructed by dynamic programming. Finally, a numerical example is provided to illustrate the effectiveness.展开更多
An optimal control problem is studied for a linear mean-field stochastic differential equation with a quadratic cost functional.The coefficients and the weighting matrices in the cost functional are all assumed to be ...An optimal control problem is studied for a linear mean-field stochastic differential equation with a quadratic cost functional.The coefficients and the weighting matrices in the cost functional are all assumed to be deterministic.Closedloop strategies are introduced,which require to be independent of initial states;and such a nature makes it very useful and convenient in applications.In this paper,the existence of an optimal closed-loop strategy for the system(also called the closedloop solvability of the problem)is characterized by the existence of a regular solution to the coupled two(generalized)Riccati equations,together with some constraints on the adapted solution to a linear backward stochastic differential equation and a linear terminal value problem of an ordinary differential equation.展开更多
One of the fundamental issues in Control Theory is to design feedback controls.It is well-known that,the purpose of introducing Riccati equations in the study of deterministic linear quadratic control problems is exac...One of the fundamental issues in Control Theory is to design feedback controls.It is well-known that,the purpose of introducing Riccati equations in the study of deterministic linear quadratic control problems is exactly to construct the desired feedbacks.To date,the same problem in the stochastic setting is only partially well-understood.In this paper,we establish the equivalence between the existence of optimal feedback controls for the stochastic linear quadratic control problems with random coefficients and the solvability of the corresponding backward stochastic Riccati equations in a suitable sense.We also give a counterexample showing the nonexistence of feedback controls to a solvable stochastic linear quadratic control problem.This is a new phenomenon in the stochastic setting,significantly different from its deterministic counterpart.展开更多
In this paper,a leader-follower stochastic differential game is studied for a linear stochastic differential equation with quadratic cost functionals.The coefficients in the state equation and the weighting matrices i...In this paper,a leader-follower stochastic differential game is studied for a linear stochastic differential equation with quadratic cost functionals.The coefficients in the state equation and the weighting matrices in the cost functionals are all deterministic.Closed-loop strategies are introduced,which require to be independent of initial states;and such a nature makes it very useful and convenient in applications.The follower first solves a stochastic linear quadratic optimal control problem,and his optimal closed-loop strategy is characterized by a Riccati equation,together with an adapted solution to a linear backward stochastic differential equation.Then the leader turns to solve a stochastic linear quadratic optimal control problem of a forward-backward stochastic differential equation,necessary conditions for the existence of the optimal closed-loop strategy for the leader is given by a Riccati equation.Some examples are also given.展开更多
文摘Double cost function linear quadratic regulator (DLQR) is developed from LQR theory to solve an optimal control problem with a general nonlinear cost function. In addition to the traditional LQ cost function, another free form cost function was introduced to express the physical need plainly and optimize weights of LQ cost function using the search algorithms. As an instance, DLQR was applied in determining the control input in the front steering angle compensation control (FSAC) model for heavy duty vehicles. The brief simulations show that DLQR is powerful enough to specify the engineering requirements correctly and balance many factors effectively. The concept and applicable field of LQR are expanded by DLQR to optimize the system with a free form cost function.
基金This work was supported by Natural Science Foundation of Shandong Province (No. Y2004A05, Y2004A07)Science Technology Planning Project of Shandong Provincial Education Department(No. J05P51) and Science Research Foundation of Shandong Economic University
文摘This paper deals with the problem of singular linear quadratic performance with the worst-disturbance rejection for descriptor systems. Under the conditions we give, the worst-disturbance and the optimal control-state pair are unique respectively, the optimal control can be synthesized as state feedback and the closed-loop system is regular, stable and impulse-free.
文摘The quarter model of an active suspension is established in the form of controllable autoregressive moving average (CARMA) model. An accelerometer can be mounted on the wheel hub for measuring road disturbance; this signal is used to identify the CARMA model parameters by recursive forgetting factors least square method. The linear quadratic integral (LQI) control method for the active suspension is presented. The LQI control algorithm is fit for vehicle suspension control, for the control performance index can comprise multi controlled variables. The simulation results show that the vertical acceleration and suspension travel both are decreased with the LQI control in the low frequency band, and the suspension travel is increased with the LQI control in the middle or high frequency band. The suspension travel is very small in the middle or high frequency band, the suspension bottoming stop will not happen, so the vehicle ride quality can be improved apparently by the LQI control.
基金Project supported by the National Natural Science Foundation of China (No.10202004)
文摘Optimal control system of state space is a conservative system, whose approximate method should be symplectic conservation. Based on the precise integration method, an algorithm of symplectic conservative perturbation is presented. It gives a uniform way to solve the linear quadratic control (LQ control) problems for linear timevarying systems accurately and efficiently, whose key points are solutions of differential Riccati equation (DRE) with variable coefficients and the state feedback equation. The method is symplectic conservative and has a good numerical stability and high precision. Numerical examples demonstrate the effectiveness of the proposed method.
基金This work was supported by the National Natural Science Foundation of China(No.60474029)China Postdoctoral Science Foundation (No.2005038558)
文摘In this paper, the Nash equilibria for differential games with multiple players is studied. A method for solving the Riccati-type matrix differential equations for open-loop Nash strategy in linear quadratic game with multiple players is presented and analytical solution is given for a type of differential games in which the system matrixcan be diagonalizable. As the special cases, the Nash equilibria for some type of differential games with particular structure is studied also, and some results in previous literatures are extended. Finally, a numerical example is given to illustrate the effectiveness of the solution procedure.
文摘This paper deals with Furuta Pendulum(FP)or Rotary Inverted Pendulum(RIP),which is an under-actuated non-minimum unstable non-linear process.The process considered along with uncertainties which are unmodelled and analyses the performance of Linear Quadratic Regulator(LQR)with Kalman filter and H∞filter as two filter configurations.The LQR is a technique for developing practical feedback,in addition the desired x shows the vector of desirable states and is used as the external input to the closed-loop system.The effectiveness of the two filters in FP or RIP are measured and contrasted with rise time,peak time,settling time and maximum peak overshoot for time domain performance.The filters are also tested with gain margin,phase margin,disk stability margins for frequency domain performance and worst case stability margins for performance due to uncertainties.The H-infinity filter reduces the estimate error to a minimum,making it resilient in the worst case than the standard Kalman filter.Further,when theβrestriction value lowers,the H∞filter becomes more robust.The worst case gain performance is also focused for the two filter configurations and tested where H∞filter is found to outperform towards robust stability and performance.Also the switchover between the two filters is dependent upon a user-specified co-efficient that gives the flexibility in the design of non-linear systems.The non-linear process is tested for set point tracking,disturbance rejection,un-modelled noise dynamics and uncertainties,which records robust performance towards stability.
基金supported by the National Natural Science Foundation of China under Grant Nos.61821004 and 62250056the Natural Science Foundation of Shandong Province under Grant Nos.ZR2021ZD14 and ZR2021JQ24+1 种基金Science and Technology Project of Qingdao West Coast New Area under Grant Nos.2019-32,2020-20,2020-1-4,High-level Talent Team Project of Qingdao West Coast New Area under Grant No.RCTDJC-2019-05Key Research and Development Program of Shandong Province under Grant No.2020CXGC01208.
文摘This paper focuses on linear-quadratic(LQ)optimal control for a class of systems governed by first-order hyperbolic partial differential equations(PDEs).Different from most of the previous works,an approach of discretization-then-continuousization is proposed in this paper to cope with the infinite-dimensional nature of PDE systems.The contributions of this paper consist of the following aspects:(1)The differential Riccati equations and the solvability condition of the LQ optimal control problems are obtained via the discretization-then-continuousization method.(2)A numerical calculation way of the differential Riccati equations and a practical design way of the optimal controller are proposed.Meanwhile,the relationship between the optimal costate and the optimal state is established by solving a set of forward and backward partial difference equations(FBPDEs).(3)The correctness of the method used in this paper is verified by a complementary continuous method and the comparative analysis with the existing operator results is presented.It is shown that the proposed results not only contain the classic results of the standard LQ control problem of systems governed by ordinary differential equations as a special case,but also support the existing operator results and give a more convenient form of computation.
基金This work was supported by the National Basic Research Program of China (973 Program) under Grant No. 2007CB814904the Natural Science Foundation of China under Grant No. 10671112+1 种基金Shandong Province under Grant No. Z2006A01Research Fund for the Doctoral Program of Higher Education of China under Grant No. 20060422018
文摘In this paper, tile authors first study two kinds of stochastic differential equations (SDEs) with Levy processes as noise source. Based on the existence and uniqueness of the solutions of these SDEs and multi-dimensional backward stochastic differential equations (BSDEs) driven by Levy pro- cesses, the authors proceed to study a stochastic linear quadratic (LQ) optimal control problem with a Levy process, where the cost weighting matrices of the state and control are allowed to be indefinite. One kind of new stochastic Riccati equation that involves equality and inequality constraints is derived from the idea of square completion and its solvability is proved to be sufficient for the well-posedness and the existence of optimal control which can be of either state feedback or open-loop form of the LQ problems. Moreover, the authors obtain the existence and uniqueness of the solution to the Riccati equation for some special cases. Finally, two examples are presented to illustrate these theoretical results.
文摘In order to intercept the future targets that are characterized by high maneuverability, multiple interceptors may be launched and aimed at single target. The scenario of two missiles P and Q intercepting a single target is modeled as a two-pursuit single-evader non-zero-sum linear quadratic differential game. The intercept space is decomposed into three subspaces which are mutually disjoint and their union covers the entire intercept space. The effect of adding the second interceptor arises in the intercept space of both P and Q (PQ-intercept space). A guidance law is derived from the Nash equilibrium strategy set (NESS) of the game. Simulation studies are focused on the PQ-intercept space. It is indicated that 1) increasing the target's maneuverability will enlarge PQ-intercept space; 2) the handover conditions will be released if the initial zero-effort-miss (ZEM) of both interceptors has opposite sign; 3) overvaluation of the target's maneuverability by choosing a small weight coefficient will generate robust performance with respect to the target maneuvering command switch time and decrease the fuel requirement; and 4) cooperation between interceptors increases the interception probability.
基金Project (Nos. 90920304 and 91120015) supported by the National Natural Science Foundation of China
文摘We present an iterative linear quadratic regulator(ILQR) method for trajectory tracking control of a wheeled mobile robot system.The proposed scheme involves a kinematic model linearization technique,a global trajectory generation algorithm,and trajectory tracking controller design.A lattice planner,which searches over a 3D(x,y,θ) configuration space,is adopted to generate the global trajectory.The ILQR method is used to design a local trajectory tracking controller.The effectiveness of the proposed method is demonstrated in simulation and experiment with a significantly asymmetric differential drive robot.The performance of the local controller is analyzed and compared with that of the existing linear quadratic regulator(LQR) method.According to the experiments,the new controller improves the control sequences(v,ω) iteratively and produces slightly better results.Specifically,two trajectories,'S' and '8' courses,are followed with sufficient accuracy using the proposed controller.
基金Natural Sciences and Engineering Research Council(NSERC)of Canada。
文摘This paper studies an asymptotic solvability problem for linear quadratic(LQ)mean field games with controlled diffusions and indefinite weights for the state and control in the costs.The authors employ a rescaling approach to derive a low dimensional Riccati ordinary differential equation(ODE)system,which characterizes a necessary and sufficient condition for asymptotic solvability.The rescaling technique is further used for performance estimates,establishing an O(1/N)-Nash equilibrium for the obtained decentralized strategies.
文摘The selection of weighting matrix in design of the linear quadratic optimal controller is an important topic in the control theory. In this paper, an approach based on genetic algorithm is presented for selecting the weighting matrix for the optimal controller. Genetic algorithm is adaptive heuristic search algorithm premised on the evolutionary ideas of natural selection and genetic. In this algorithm, the fitness function is used to evaluate individuals and reproductive success varies with fitness. In the design of the linear quadratic optimal controller, the fitness function has relation to the anticipated step response of the system. Not only can the controller designed by this approach meet the demand of the performance indexes of linear quadratic controller, but also satisfy the anticipated step response of close-loop system. The method possesses a higher calculating efficiency and provides technical support for the optimal controller in engineering application. The simulation of a three-order single-input single-output (SISO) system has demonstrated the feasibility and validity of the approach.
基金supported by National Basic Research Program of China(973 Program)(Grant No.2011CB808002)National Natural Science Foundation of China(Grant Nos.11231007,11301298,11471231,11401404,11371226,11071145 and 11231005)+2 种基金China Postdoctoral Science Foundation(Grant No.2014M562321)Foundation for Innovative Research Groups of National Natural Science Foundation of China(Grant No.11221061)the Program for Introducing Talents of Discipline to Universities(the National 111Project of China's Higher Education)(Grant No.B12023)
文摘This paper studies linear quadratic games problem for stochastic Volterra integral equations(SVIEs in short) where necessary and sufficient conditions for the existence of saddle points are derived in two different ways.As a consequence,the open problems raised by Chen and Yong(2007) are solved.To characterize the saddle points more clearly,coupled forward-backward stochastic Volterra integral equations and stochastic Fredholm-Volterra integral equations are introduced.Compared with deterministic game problems,some new terms arising from the procedure of deriving the later equations reflect well the essential nature of stochastic systems.Moreover,our representations and arguments are even new in the classical SDEs case.
文摘In this paper,the problem of inverse quadratic optimal control over fnite time-horizon for discrete-time linear systems is considered.Our goal is to recover the corresponding quadratic objective function using noisy observations.First,the identifability of the model structure for the inverse optimal control problem is analyzed under relative degree assumption and we show the model structure is strictly globally identifable.Next,we study the inverse optimal control problem whose initial state distribution and the observation noise distribution are unknown,yet the exact observations on the initial states are available.We formulate the problem as a risk minimization problem and approximate the problem using empirical average.It is further shown that the solution to the approximated problem is statistically consistent under the assumption of relative degrees.We then study the case where the exact observations on the initial states are not available,yet the observation noises are known to be white Gaussian distributed and the distribution of the initial state is also Gaussian(with unknown mean and covariance).EM-algorithm is used to estimate the parameters in the objective function.The efectiveness of our results are demonstrated by numerical examples.
文摘Purpose The purpose of this paper is to study a new method to improve the performance of the magnet power supply in the experimental ring of HIRFL-CSR.Methods A hybrid genetic particle swarm optimization algorithm is introduced,and the algorithm is applied to the optimal design of the LQR controller of pulse width modulated power supply.The fitness function of hybrid genetic particle swarm optimization is a multi-objective function,which combined the current and voltage,so that the dynamic performance of the closed-loop system can be better.The hybrid genetic particle swarm algorithm is applied to determine LQR controlling matrices Q and R.Results The simulation results show that adoption of this method leads to good transient responses,and the computational time is shorter than in the traditional trial and error methods.Conclusions The results presented in this paper show that the proposed method is robust,efficient and feasible,and the dynamic and static performance of the accelerator PWM power supply has been considerably improved.
基金work is part of the ANR project CAESARS(ANR-15-CE05-0024)lso supported by FiME(Finance for Energy Market Research Centre)and the“Finance et Developpement Durable-Approches Quantitatives”EDF-CACIB Chair。
文摘We consider the optimal control problem for a linear conditional McKeanVlasov equation with quadratic cost functional.The coefficients of the system and the weighting matrices in the cost functional are allowed to be adapted processes with respect to the common noise filtration.Semi closed-loop strategies are introduced,and following the dynamic programming approach in(Pham and Wei,Dynamic programming for optimal control of stochastic McKean-Vlasov dynamics,2016),we solve the problem and characterize time-consistent optimal control by means of a system of decoupled backward stochastic Riccati differential equations.We present several financial applications with explicit solutions,and revisit,in particular,optimal tracking problems with price impact,and the conditional mean-variance portfolio selection in an incomplete market model.
基金supported by the National Natural Science Foundation of China under Grant No.61773098the 111 Project under Grant No.B16009
文摘A switched linear quadratic(LQ) differential game over finite-horizon is investigated in this paper. The switching signal is regarded as a non-conventional player, afterwards the definition of Pareto efficiency is extended to dynamics switching situations to characterize the solutions of this multi-objective problem. Furthermore, the switched differential game is equivalently transformed into a family of parameterized single-objective optimal problems by introducing preference information and auxiliary variables. This transformation reduces the computing complexity such that the Pareto frontier of the switched LQ differential game can be constructed by dynamic programming. Finally, a numerical example is provided to illustrate the effectiveness.
基金supported by Hong Kong RGC under grants 519913,15209614 and 15224215Jingrui Sun was partially supported by the National Natural Science Foundation of China(11401556)+1 种基金the Fundamental Research Funds for the Central Universities(WK 2040000012)Jiongmin Yong was partially supported by NSF DMS-1406776.
文摘An optimal control problem is studied for a linear mean-field stochastic differential equation with a quadratic cost functional.The coefficients and the weighting matrices in the cost functional are all assumed to be deterministic.Closedloop strategies are introduced,which require to be independent of initial states;and such a nature makes it very useful and convenient in applications.In this paper,the existence of an optimal closed-loop strategy for the system(also called the closedloop solvability of the problem)is characterized by the existence of a regular solution to the coupled two(generalized)Riccati equations,together with some constraints on the adapted solution to a linear backward stochastic differential equation and a linear terminal value problem of an ordinary differential equation.
基金supported by the NSF of China under grants 11471231,11221101,11231007,11301298 and 11401404the PCSIRT under grant IRT 16R53 and the Chang Jiang Scholars Program from Chinese Education Ministry+1 种基金the Fundamental Research Funds for the Central Universities in China under grant 2015SCU04A02the NSFC-CNRS Joint Research Project under grant 11711530142。
文摘One of the fundamental issues in Control Theory is to design feedback controls.It is well-known that,the purpose of introducing Riccati equations in the study of deterministic linear quadratic control problems is exactly to construct the desired feedbacks.To date,the same problem in the stochastic setting is only partially well-understood.In this paper,we establish the equivalence between the existence of optimal feedback controls for the stochastic linear quadratic control problems with random coefficients and the solvability of the corresponding backward stochastic Riccati equations in a suitable sense.We also give a counterexample showing the nonexistence of feedback controls to a solvable stochastic linear quadratic control problem.This is a new phenomenon in the stochastic setting,significantly different from its deterministic counterpart.
基金This work was supported by National Key Research&Development Program of China under Grant No.2022YFA1006104National Natural Science Foundations of China under Grant Nos.11971266,11831010Shandong Provincial Natural Science Foundations under Grant Nos.ZR2022JQ01,ZR2020ZD24,ZR2019ZD42.
文摘In this paper,a leader-follower stochastic differential game is studied for a linear stochastic differential equation with quadratic cost functionals.The coefficients in the state equation and the weighting matrices in the cost functionals are all deterministic.Closed-loop strategies are introduced,which require to be independent of initial states;and such a nature makes it very useful and convenient in applications.The follower first solves a stochastic linear quadratic optimal control problem,and his optimal closed-loop strategy is characterized by a Riccati equation,together with an adapted solution to a linear backward stochastic differential equation.Then the leader turns to solve a stochastic linear quadratic optimal control problem of a forward-backward stochastic differential equation,necessary conditions for the existence of the optimal closed-loop strategy for the leader is given by a Riccati equation.Some examples are also given.