In this paper,the problem of inverse quadratic optimal control over fnite time-horizon for discrete-time linear systems is considered.Our goal is to recover the corresponding quadratic objective function using noisy o...In this paper,the problem of inverse quadratic optimal control over fnite time-horizon for discrete-time linear systems is considered.Our goal is to recover the corresponding quadratic objective function using noisy observations.First,the identifability of the model structure for the inverse optimal control problem is analyzed under relative degree assumption and we show the model structure is strictly globally identifable.Next,we study the inverse optimal control problem whose initial state distribution and the observation noise distribution are unknown,yet the exact observations on the initial states are available.We formulate the problem as a risk minimization problem and approximate the problem using empirical average.It is further shown that the solution to the approximated problem is statistically consistent under the assumption of relative degrees.We then study the case where the exact observations on the initial states are not available,yet the observation noises are known to be white Gaussian distributed and the distribution of the initial state is also Gaussian(with unknown mean and covariance).EM-algorithm is used to estimate the parameters in the objective function.The efectiveness of our results are demonstrated by numerical examples.展开更多
文摘In this paper,the problem of inverse quadratic optimal control over fnite time-horizon for discrete-time linear systems is considered.Our goal is to recover the corresponding quadratic objective function using noisy observations.First,the identifability of the model structure for the inverse optimal control problem is analyzed under relative degree assumption and we show the model structure is strictly globally identifable.Next,we study the inverse optimal control problem whose initial state distribution and the observation noise distribution are unknown,yet the exact observations on the initial states are available.We formulate the problem as a risk minimization problem and approximate the problem using empirical average.It is further shown that the solution to the approximated problem is statistically consistent under the assumption of relative degrees.We then study the case where the exact observations on the initial states are not available,yet the observation noises are known to be white Gaussian distributed and the distribution of the initial state is also Gaussian(with unknown mean and covariance).EM-algorithm is used to estimate the parameters in the objective function.The efectiveness of our results are demonstrated by numerical examples.