Missile interception problem can be regarded as a two-person zero-sum differential games problem,which depends on the solution of Hamilton-Jacobi-Isaacs(HJI)equa-tion.It has been proved impossible to obtain a closed-f...Missile interception problem can be regarded as a two-person zero-sum differential games problem,which depends on the solution of Hamilton-Jacobi-Isaacs(HJI)equa-tion.It has been proved impossible to obtain a closed-form solu-tion due to the nonlinearity of HJI equation,and many iterative algorithms are proposed to solve the HJI equation.Simultane-ous policy updating algorithm(SPUA)is an effective algorithm for solving HJI equation,but it is an on-policy integral reinforce-ment learning(IRL).For online implementation of SPUA,the dis-turbance signals need to be adjustable,which is unrealistic.In this paper,an off-policy IRL algorithm based on SPUA is pro-posed without making use of any knowledge of the systems dynamics.Then,a neural-network based online adaptive critic implementation scheme of the off-policy IRL algorithm is pre-sented.Based on the online off-policy IRL method,a computa-tional intelligence interception guidance(CIIG)law is developed for intercepting high-maneuvering target.As a model-free method,intercepting targets can be achieved through measur-ing system data online.The effectiveness of the CIIG is verified through two missile and target engagement scenarios.展开更多
This paper centers on the integrated learning of English and law in China.Firstly,it outlines the importance of English in the solution of the ever increasing legal disputes between China and the outside world,which i...This paper centers on the integrated learning of English and law in China.Firstly,it outlines the importance of English in the solution of the ever increasing legal disputes between China and the outside world,which inevitably involves an integrated learning of English and law.Secondly,it points out that the content of legal English reflects a combination of legal knowledge and English skills.Thirdly,it expounds on the difficulties that Chinese English majors are facing in the process of learning English and law simultaneously and furnishes some practical suggestions.展开更多
An infinite number of semi-discrete and continuous conservation laws for the differential-difference KP equation were obtained by using a solvable generalized Riccati equation.
The concept of reward is fundamental in reinforcement learning with a wide range of applications in natural and social sciences.Seeking an interpretable reward for decision-making that largely shapes the system's ...The concept of reward is fundamental in reinforcement learning with a wide range of applications in natural and social sciences.Seeking an interpretable reward for decision-making that largely shapes the system's behavior has always been a challenge in reinforcement learning.In this work,we explore a discrete-time reward for reinforcement learning in continuous time and action spaces that represent many phenomena captured by applying physical laws.We find that the discrete-time reward leads to the extraction of the unique continuous-time decision law and improved computational efficiency by dropping the integrator operator that appears in classical results with integral rewards.We apply this finding to solve output-feedback design problems in power systems.The results reveal that our approach removes an intermediate stage of identifying dynamical models.Our work suggests that the discrete-time reward is efficient in search of the desired decision law,which provides a computational tool to understand and modify the behavior of large-scale engineering systems using the optimal learned decision.展开更多
This paper presents an Iterative Learning Control design applied to homing guidance of missiles against maneuvering targets. According to numerical experiments, although an increase of the control energies is apprecia...This paper presents an Iterative Learning Control design applied to homing guidance of missiles against maneuvering targets. According to numerical experiments, although an increase of the control energies is appreciated with respect to a previous published base controller for comparison, this strategy, which is simple to realize, is able to reduce the time to reach the head-on condition to target destruction. This fact is important to minimize the missile lateral force-level to fulfill engaging in hyper-sonic target persecutions.展开更多
This paper explores the adaptive iterative learning control method in the control of fractional order systems for the first time. An adaptive iterative learning control(AILC) scheme is presented for a class of commens...This paper explores the adaptive iterative learning control method in the control of fractional order systems for the first time. An adaptive iterative learning control(AILC) scheme is presented for a class of commensurate high-order uncertain nonlinear fractional order systems in the presence of disturbance.To facilitate the controller design, a sliding mode surface of tracking errors is designed by using sufficient conditions of linear fractional order systems. To relax the assumption of the identical initial condition in iterative learning control(ILC), a new boundary layer function is proposed by employing MittagLeffler function. The uncertainty in the system is compensated for by utilizing radial basis function neural network. Fractional order differential type updating laws and difference type learning law are designed to estimate unknown constant parameters and time-varying parameter, respectively. The hyperbolic tangent function and a convergent series sequence are used to design robust control term for neural network approximation error and bounded disturbance, simultaneously guaranteeing the learning convergence along iteration. The system output is proved to converge to a small neighborhood of the desired trajectory by constructing Lyapnov-like composite energy function(CEF)containing new integral type Lyapunov function, while keeping all the closed-loop signals bounded. Finally, a simulation example is presented to verify the effectiveness of the proposed approach.展开更多
Cement-based grouting has been widely used in mining engineering;its constitutive law has not been comprehensively studied.In this study,a novel constitutive law of cement-grouted coal specimens(CGCS)was developed usi...Cement-based grouting has been widely used in mining engineering;its constitutive law has not been comprehensively studied.In this study,a novel constitutive law of cement-grouted coal specimens(CGCS)was developed using hybrid machine learning(ML)algorithms.Shear tests were performed on CGCS for the analysis of stress-strain curves and the preparation of the dataset.To maintain the interpretation of the trained ML models,regression tree(RT)was used as the main technique.The effect of maximum RT depth(Maxdepth)on its performance was studied,and the hyperparameters of RT were tuned using the genetic algorithm(GA).The RT performance was also compared with ensemble learning techniques.The optimum correlation coefficient on the training set was determined as 0.835,0.946,0.981,and 0.985 for RT models with Maxdepth=3,5,7,and 9,respectively.The overall correlation coefficient was over 0.9 when the Maxdepth≥5,indicating that the constitutive law of CGCS can be well described.However,the failure type of CGCS could not be captured using the trained RT models.Random forest was found to be the optimum algorithm for the constitutive modeling of CGCS,while RT with the Maxdepth=3 performed the worst.展开更多
This paper proposes an y2-y∞ learning law as a new learning method for dynamic neural networks with external disturbance. Based on linear matrix inequality (LMI) formulation, the y2-y∞ learning law is presented to...This paper proposes an y2-y∞ learning law as a new learning method for dynamic neural networks with external disturbance. Based on linear matrix inequality (LMI) formulation, the y2-y∞ learning law is presented to not only guarantee asymptotical stability of dynamic neural networks but also reduce the effect of external disturbance to an y2-y∞ induced norm constraint. It is shown that the design of the y2-y∞ learning law for such neural networks can be achieved by solving LMIs, which can be easily facilitated by using some standard numerical packages. A numerical example is presented to demonstrate the validity of the proposed learning law.展开更多
The guidance strategy is an extremely critical factor in determining the striking effect of the missile operation.A novel guidance law is presented by exploiting the deep reinforcement learning(DRL)with the hierarchic...The guidance strategy is an extremely critical factor in determining the striking effect of the missile operation.A novel guidance law is presented by exploiting the deep reinforcement learning(DRL)with the hierarchical deep deterministic policy gradient(DDPG)algorithm.The reward functions are constructed to minimize the line-of-sight(LOS)angle rate and avoid the threat caused by the opposed obstacles.To attenuate the chattering of the acceleration,a hierarchical reinforcement learning structure and an improved reward function with action penalty are put forward.The simulation results validate that the missile under the proposed method can hit the target successfully and keep away from the threatened areas effectively.展开更多
A learning controller of nonhonolomic robot in real-time based on support vector machine(SVM)is presented.The controller includes two parts:one is kinematic controller based on nonlinear law,and the other is dynamic c...A learning controller of nonhonolomic robot in real-time based on support vector machine(SVM)is presented.The controller includes two parts:one is kinematic controller based on nonlinear law,and the other is dynamic controller based on SVM.The kinematic controller is aimed to provide desired velocity which can make the steering system stable.The dynamic controller is aimed to transform the desired velocity to control torque.The parameters of the dynamic system of the robot are estimated through SVM learning algorithm according to the training data of sliding windows in real time.The proposed controller can adapt to the changes in the robot model and uncertainties in the environment.Compared with artificial neural network(ANN)controller,SVM controller can converge to the reference trajectory more quickly and the tracking error is smaller.The simulation results verify the effectiveness of the method proposed.展开更多
文摘Missile interception problem can be regarded as a two-person zero-sum differential games problem,which depends on the solution of Hamilton-Jacobi-Isaacs(HJI)equa-tion.It has been proved impossible to obtain a closed-form solu-tion due to the nonlinearity of HJI equation,and many iterative algorithms are proposed to solve the HJI equation.Simultane-ous policy updating algorithm(SPUA)is an effective algorithm for solving HJI equation,but it is an on-policy integral reinforce-ment learning(IRL).For online implementation of SPUA,the dis-turbance signals need to be adjustable,which is unrealistic.In this paper,an off-policy IRL algorithm based on SPUA is pro-posed without making use of any knowledge of the systems dynamics.Then,a neural-network based online adaptive critic implementation scheme of the off-policy IRL algorithm is pre-sented.Based on the online off-policy IRL method,a computa-tional intelligence interception guidance(CIIG)law is developed for intercepting high-maneuvering target.As a model-free method,intercepting targets can be achieved through measur-ing system data online.The effectiveness of the CIIG is verified through two missile and target engagement scenarios.
文摘This paper centers on the integrated learning of English and law in China.Firstly,it outlines the importance of English in the solution of the ever increasing legal disputes between China and the outside world,which inevitably involves an integrated learning of English and law.Secondly,it points out that the content of legal English reflects a combination of legal knowledge and English skills.Thirdly,it expounds on the difficulties that Chinese English majors are facing in the process of learning English and law simultaneously and furnishes some practical suggestions.
文摘An infinite number of semi-discrete and continuous conservation laws for the differential-difference KP equation were obtained by using a solvable generalized Riccati equation.
基金supported by the Guangdong Basic and Applied Basic Research Foundation(2024A1515011936)the National Natural Science Foundation of China(62320106008)
文摘The concept of reward is fundamental in reinforcement learning with a wide range of applications in natural and social sciences.Seeking an interpretable reward for decision-making that largely shapes the system's behavior has always been a challenge in reinforcement learning.In this work,we explore a discrete-time reward for reinforcement learning in continuous time and action spaces that represent many phenomena captured by applying physical laws.We find that the discrete-time reward leads to the extraction of the unique continuous-time decision law and improved computational efficiency by dropping the integrator operator that appears in classical results with integral rewards.We apply this finding to solve output-feedback design problems in power systems.The results reveal that our approach removes an intermediate stage of identifying dynamical models.Our work suggests that the discrete-time reward is efficient in search of the desired decision law,which provides a computational tool to understand and modify the behavior of large-scale engineering systems using the optimal learned decision.
基金partially supported by the Spanish Ministry of Economy and Competitiveness under grant number DPI2015-64170-R(MINECO/FEDER)
文摘This paper presents an Iterative Learning Control design applied to homing guidance of missiles against maneuvering targets. According to numerical experiments, although an increase of the control energies is appreciated with respect to a previous published base controller for comparison, this strategy, which is simple to realize, is able to reduce the time to reach the head-on condition to target destruction. This fact is important to minimize the missile lateral force-level to fulfill engaging in hyper-sonic target persecutions.
基金supported by the National Natural Science Foundation of China(60674090)Shandong Natural Science Foundation(ZR2017QF016)
文摘This paper explores the adaptive iterative learning control method in the control of fractional order systems for the first time. An adaptive iterative learning control(AILC) scheme is presented for a class of commensurate high-order uncertain nonlinear fractional order systems in the presence of disturbance.To facilitate the controller design, a sliding mode surface of tracking errors is designed by using sufficient conditions of linear fractional order systems. To relax the assumption of the identical initial condition in iterative learning control(ILC), a new boundary layer function is proposed by employing MittagLeffler function. The uncertainty in the system is compensated for by utilizing radial basis function neural network. Fractional order differential type updating laws and difference type learning law are designed to estimate unknown constant parameters and time-varying parameter, respectively. The hyperbolic tangent function and a convergent series sequence are used to design robust control term for neural network approximation error and bounded disturbance, simultaneously guaranteeing the learning convergence along iteration. The system output is proved to converge to a small neighborhood of the desired trajectory by constructing Lyapnov-like composite energy function(CEF)containing new integral type Lyapunov function, while keeping all the closed-loop signals bounded. Finally, a simulation example is presented to verify the effectiveness of the proposed approach.
基金financially supported by Fundamental Research Funds for the Central Universities(No.2020ZDPY0221)State Key Laboratory for Geo Mechanics and Deep Underground Engineering,China University of Mining&Technology(No.SKLGDUEK2002)+1 种基金Fundamental Research Funds for the Central Universities(No.2021QN1003)National Natural Science Foundation of China(Nos.52104106,52174089)。
文摘Cement-based grouting has been widely used in mining engineering;its constitutive law has not been comprehensively studied.In this study,a novel constitutive law of cement-grouted coal specimens(CGCS)was developed using hybrid machine learning(ML)algorithms.Shear tests were performed on CGCS for the analysis of stress-strain curves and the preparation of the dataset.To maintain the interpretation of the trained ML models,regression tree(RT)was used as the main technique.The effect of maximum RT depth(Maxdepth)on its performance was studied,and the hyperparameters of RT were tuned using the genetic algorithm(GA).The RT performance was also compared with ensemble learning techniques.The optimum correlation coefficient on the training set was determined as 0.835,0.946,0.981,and 0.985 for RT models with Maxdepth=3,5,7,and 9,respectively.The overall correlation coefficient was over 0.9 when the Maxdepth≥5,indicating that the constitutive law of CGCS can be well described.However,the failure type of CGCS could not be captured using the trained RT models.Random forest was found to be the optimum algorithm for the constitutive modeling of CGCS,while RT with the Maxdepth=3 performed the worst.
基金Project supported by the Grant of the Korean Ministry of Education, Science and Technology (The Regional Core Research Program/Center for Healthcare Technology Development)
文摘This paper proposes an y2-y∞ learning law as a new learning method for dynamic neural networks with external disturbance. Based on linear matrix inequality (LMI) formulation, the y2-y∞ learning law is presented to not only guarantee asymptotical stability of dynamic neural networks but also reduce the effect of external disturbance to an y2-y∞ induced norm constraint. It is shown that the design of the y2-y∞ learning law for such neural networks can be achieved by solving LMIs, which can be easily facilitated by using some standard numerical packages. A numerical example is presented to demonstrate the validity of the proposed learning law.
基金supported by the National Natural Science Foundation of China(62003021,91212304).
文摘The guidance strategy is an extremely critical factor in determining the striking effect of the missile operation.A novel guidance law is presented by exploiting the deep reinforcement learning(DRL)with the hierarchical deep deterministic policy gradient(DDPG)algorithm.The reward functions are constructed to minimize the line-of-sight(LOS)angle rate and avoid the threat caused by the opposed obstacles.To attenuate the chattering of the acceleration,a hierarchical reinforcement learning structure and an improved reward function with action penalty are put forward.The simulation results validate that the missile under the proposed method can hit the target successfully and keep away from the threatened areas effectively.
基金Project(60910005)supported by the National Natural Science Foundation of China
文摘A learning controller of nonhonolomic robot in real-time based on support vector machine(SVM)is presented.The controller includes two parts:one is kinematic controller based on nonlinear law,and the other is dynamic controller based on SVM.The kinematic controller is aimed to provide desired velocity which can make the steering system stable.The dynamic controller is aimed to transform the desired velocity to control torque.The parameters of the dynamic system of the robot are estimated through SVM learning algorithm according to the training data of sliding windows in real time.The proposed controller can adapt to the changes in the robot model and uncertainties in the environment.Compared with artificial neural network(ANN)controller,SVM controller can converge to the reference trajectory more quickly and the tracking error is smaller.The simulation results verify the effectiveness of the method proposed.