期刊文献+
共找到27,253篇文章
< 1 2 250 >
每页显示 20 50 100
AN OPTIMAL CONTROL PROBLEM FOR A LOTKA-VOLTERRA COMPETITION MODEL WITH CHEMO-REPULSION
1
作者 Diana I.HERNÁNDEZ Diego A.RUEDA-GOMEZ Élder J.VILLAMIZAR-ROA 《Acta Mathematica Scientia》 SCIE CSCD 2024年第2期721-751,共31页
In this paper we study a bilinear optimal control problem for a diffusive Lotka-Volterra competition model with chemo-repulsion in a bounded domain of ℝ^(ℕ),N=2,3.This model describes the competition of two species in... In this paper we study a bilinear optimal control problem for a diffusive Lotka-Volterra competition model with chemo-repulsion in a bounded domain of ℝ^(ℕ),N=2,3.This model describes the competition of two species in which one of them avoid encounters with rivals through a chemo-repulsion mechanism.We prove the existence and uniqueness of weak-strong solutions,and then we analyze the existence of a global optimal solution for a related bilinear optimal control problem,where the control is acting on the chemical signal.Posteriorly,we derive first-order optimality conditions for local optimal solutions using the Lagrange multipliers theory.Finally,we propose a discrete approximation scheme of the optimality system based on the gradient method,which is validated with some computational experiments. 展开更多
关键词 LOTKA-VOLTERRA chemo-repulsion optimal control optimality conditions
下载PDF
Vibration Control of the Rail Grinding Vehicle with Abrasive Belt Based on Structural Optimization and Lightweight Design
2
作者 Wengang Fan Shuai Zhang +2 位作者 Zhiwei Wu Yi Liu Jiangnan Yu 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2024年第3期311-337,共27页
As a new grinding and maintenance technology,rail belt grinding shows significant advantages in many applications The dynamic characteristics of the rail belt grinding vehicle largely determines its grinding performan... As a new grinding and maintenance technology,rail belt grinding shows significant advantages in many applications The dynamic characteristics of the rail belt grinding vehicle largely determines its grinding performance and service life.In order to explore the vibration control method of the rail grinding vehicle with abrasive belt,the vibration response changes in structural optimization and lightweight design are respectively analyzed through transient response and random vibration simulations in this paper.Firstly,the transient response simulation analysis of the rail grinding vehicle with abrasive belt is carried out under operating conditions and non-operating conditions.Secondly,the vibration control of the grinding vehicle is implemented by setting vibration isolation elements,optimizing the structure,and increasing damping.Thirdly,in order to further explore the dynamic characteristics of the rail grinding vehicle,the random vibration simulation analysis of the grinding vehicle is carried out under the condition of the horizontal irregularity of the American AAR6 track.Finally,by replacing the Q235 steel frame material with 7075 aluminum alloy and LA43M magnesium alloy,both vibration control and lightweight design can be achieved simultaneously.The results of transient dynamic response analysis show that the acceleration of most positions in the two working conditions exceeds the standard value in GB/T 17426-1998 standard.By optimizing the structure of the grinding vehicle in three ways,the average vibration acceleration of the whole car is reduced by about 55.1%from 15.6 m/s^(2) to 7.0 m/s^(2).The results of random vibration analysis show that the grinding vehicle with Q235 steel frame does not meet the safety conditions of 3σ.By changing frame material,the maximum vibration stress of the vehicle can be reduced from 240.7 MPa to 160.0 MPa and the weight of the grinding vehicle is reduced by about 21.7%from 1500 kg to 1175 kg.The modal analysis results indicate that the vibration control of the grinding vehicle can be realized by optimizing the structure and replacing the materials with lower stiffness under the premise of ensuring the overall strength.The study provides the basis for the development of lightweight,diversified and efficient rail grinding equipment. 展开更多
关键词 Vibration control Dynamic characteristics Structural optimization Lightweight design Modal analysis
下载PDF
Optimal and robust control of population transfer in asymmetric quantum-dot molecules
3
作者 郭裕 马松山 束传存 《Chinese Physics B》 SCIE EI CAS CSCD 2024年第2期353-359,共7页
We present an optimal and robust quantum control method for efficient population transfer in asymmetric double quantum-dot molecules.We derive a long-duration control scheme that allows for highly efficient population... We present an optimal and robust quantum control method for efficient population transfer in asymmetric double quantum-dot molecules.We derive a long-duration control scheme that allows for highly efficient population transfer by accurately controlling the amplitude of a narrow-bandwidth pulse.To overcome fluctuations in control field parameters,we employ a frequency-domain quantum optimal control theory method to optimize the spectral phase of a single pulse with broad bandwidth while preserving the spectral amplitude.It is shown that this spectral-phase-only optimization approach can successfully identify robust and optimal control fields,leading to efficient population transfer to the target state while concurrently suppressing population transfer to undesired states.The method demonstrates resilience to fluctuations in control field parameters,making it a promising approach for reliable and efficient population transfer in practical applications. 展开更多
关键词 population transfer quantum optimal control theory quantum-dot molecules
下载PDF
Sequential Inverse Optimal Control of Discrete-Time Systems
4
作者 Sheng Cao Zhiwei Luo Changqin Quan 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第3期608-621,共14页
This paper presents a novel sequential inverse optimal control(SIOC)method for discrete-time systems,which calculates the unknown weight vectors of the cost function in real time using the input and output of an optim... This paper presents a novel sequential inverse optimal control(SIOC)method for discrete-time systems,which calculates the unknown weight vectors of the cost function in real time using the input and output of an optimally controlled discrete-time system.The proposed method overcomes the limitations of previous approaches by eliminating the need for the invertible Jacobian assumption.It calculates the possible-solution spaces and their intersections sequentially until the dimension of the intersection space decreases to one.The remaining one-dimensional vector of the possible-solution space’s intersection represents the SIOC solution.The paper presents clear conditions for convergence and addresses the issue of noisy data by clarifying the conditions for the singular values of the matrices that relate to the possible-solution space.The effectiveness of the proposed method is demonstrated through simulation results. 展开更多
关键词 Inverse optimal control promised calculation step sequential calculation
下载PDF
Contract Mechanism of Water Environment Regulation for Small and Medium Sized Enterprises Based on Optimal Control Theory
5
作者 Shuang Zhao Hongbin Gu +2 位作者 Lianfang Xue Dongsheng Wang Bin Huang 《Journal of Water Resource and Protection》 CAS 2024年第7期538-556,共20页
The small and scattered enterprise pattern in the county economy has formed numerous sporadic pollution sources, hindering the centralized treatment of the water environment, increasing the cost and difficulty of trea... The small and scattered enterprise pattern in the county economy has formed numerous sporadic pollution sources, hindering the centralized treatment of the water environment, increasing the cost and difficulty of treatment. How enterprises can make reasonable decisions on their water environment behavior based on the external environment and their own factors is of great significance for scientifically and effectively designing water environment regulation mechanisms. Based on optimal control theory, this study investigates the design of contractual mechanisms for water environmental regulation for small and medium-sized enterprises. The enterprise is regarded as an independent economic entity that can adopt optimal control strategies to maximize its own interests. Based on the participation of multiple subjects including the government, enterprises, and the public, an optimal control strategy model for enterprises under contractual water environmental regulation is constructed using optimal control theory, and a method for calculating the amount of unit pollutant penalties is derived. The water pollutant treatment cost data of a paper company is selected to conduct empirical numerical analysis on the model. The results show that the increase in the probability of government regulation and public participation, as well as the decrease in local government protection for enterprises, can achieve the same regulatory effect while reducing the number of administrative penalties per unit. Finally, the implementation process of contractual water environmental regulation for small and medium-sized enterprises is designed. 展开更多
关键词 optimal control Theory Small and Medium-Sized Enterprises Water Environment Regulation Contract Mechanism
下载PDF
A Loss-model-based Efficiency Optimization Control Method for Induction Traction System of High-speed Train under Emergency Self-propelled Mode
6
作者 Yutong Zhu Yaohua Li 《CES Transactions on Electrical Machines and Systems》 EI CSCD 2024年第2期227-239,共13页
Increasing attention has been paid to the efficiency improvement of the induction traction system of high-speed trains due to the high demand for energy saving. In emergency self-propelled mode, however, the dc-link v... Increasing attention has been paid to the efficiency improvement of the induction traction system of high-speed trains due to the high demand for energy saving. In emergency self-propelled mode, however, the dc-link voltage and the traction power of the motor are significantly reduced, resulting in decreased traction efficiency due to the low load and low speed operations. Aiming to tackle this problem, a novel efficiency improved control method is introduced to the emergency mode of high-speed train traction system in this paper. In the proposed method, a total loss model of induction motor considering the behaviors of both iron and copper loss is established. An improved iterative algorithm with decreased computational burden is then introduced, resulting in a fast solving of the optimal flux reference for loss minimization at each control period. In addition, considering the parameter variation problem due to the low load and low speed operations, a parameter estimation method is integrated to improve the controller's robustness. The effectiveness of the proposed method on efficiency improvement at low voltage and low load conditions is demonstrated by simulated and experimental results. 展开更多
关键词 Efficiency optimization Induction motor Loss model control Motor drives Traction system
下载PDF
Multi-Time Scale Optimal Scheduling of a Photovoltaic Energy Storage Building System Based on Model Predictive Control
7
作者 Ximin Cao Xinglong Chen +2 位作者 He Huang Yanchi Zhang Qifan Huang 《Energy Engineering》 EI 2024年第4期1067-1089,共23页
Building emission reduction is an important way to achieve China’s carbon peaking and carbon neutrality goals.Aiming at the problem of low carbon economic operation of a photovoltaic energy storage building system,a ... Building emission reduction is an important way to achieve China’s carbon peaking and carbon neutrality goals.Aiming at the problem of low carbon economic operation of a photovoltaic energy storage building system,a multi-time scale optimal scheduling strategy based on model predictive control(MPC)is proposed under the consideration of load optimization.First,load optimization is achieved by controlling the charging time of electric vehicles as well as adjusting the air conditioning operation temperature,and the photovoltaic energy storage building system model is constructed to propose a day-ahead scheduling strategy with the lowest daily operation cost.Second,considering inter-day to intra-day source-load prediction error,an intraday rolling optimal scheduling strategy based on MPC is proposed that dynamically corrects the day-ahead dispatch results to stabilize system power fluctuations and promote photovoltaic consumption.Finally,taking an office building on a summer work day as an example,the effectiveness of the proposed scheduling strategy is verified.The results of the example show that the strategy reduces the total operating cost of the photovoltaic energy storage building system by 17.11%,improves the carbon emission reduction by 7.99%,and the photovoltaic consumption rate reaches 98.57%,improving the system’s low-carbon and economic performance. 展开更多
关键词 Load optimization model predictive control multi-time scale optimal scheduling photovoltaic consumption photovoltaic energy storage building
下载PDF
Matrix Riccati Equations in Optimal Control
8
作者 Malick Ndiaye 《Applied Mathematics》 2024年第3期199-213,共15页
In this paper, the matrix Riccati equation is considered. There is no general way for solving the matrix Riccati equation despite the many fields to which it applies. While scalar Riccati equation has been studied tho... In this paper, the matrix Riccati equation is considered. There is no general way for solving the matrix Riccati equation despite the many fields to which it applies. While scalar Riccati equation has been studied thoroughly, matrix Riccati equation of which scalar Riccati equations is a particular case, is much less investigated. This article proposes a change of variable that allows to find explicit solution of the Matrix Riccati equation. We then apply this solution to Optimal Control. 展开更多
关键词 optimal control Matrix Riccati Equation Change of Variable
下载PDF
Identification of time-varying system and energy-based optimization of adaptive control in seismically excited structure
9
作者 Elham Aghabarari Fereidoun Amini Pedram Ghaderi 《Earthquake Engineering and Engineering Vibration》 SCIE EI CSCD 2024年第1期227-240,共14页
The combination of structural health monitoring and vibration control is of great importance to provide components of smart structures.While synthetic algorithms have been proposed,adaptive control that is compatible ... The combination of structural health monitoring and vibration control is of great importance to provide components of smart structures.While synthetic algorithms have been proposed,adaptive control that is compatible with changing conditions still needs to be used,and time-varying systems are required to be simultaneously estimated with the application of adaptive control.In this research,the identification of structural time-varying dynamic characteristics and optimized simple adaptive control are integrated.First,reduced variations of physical parameters are estimated online using the multiple forgetting factor recursive least squares(MFRLS)method.Then,the energy from the structural vibration is simultaneously specified to optimize the control force with the identified parameters to be operational.Optimization is also performed based on the probability density function of the energy under the seismic excitation at any time.Finally,the optimal control force is obtained by the simple adaptive control(SAC)algorithm and energy coefficient.A numerical example and benchmark structure are employed to investigate the efficiency of the proposed approach.The simulation results revealed the effectiveness of the integrated online identification and optimal adaptive control in systems. 展开更多
关键词 integrated online identification time-varying systems structural energy multiple forgetting factor recursive least squares optimal simple adaptive control algorithm
下载PDF
A new optimal adaptive backstepping control approach for nonlinear systems under deception attacks via reinforcement learning
10
作者 Wendi Chen Qinglai Wei 《Journal of Automation and Intelligence》 2024年第1期34-39,共6页
In this paper,a new optimal adaptive backstepping control approach for nonlinear systems under deception attacks via reinforcement learning is presented in this paper.The existence of nonlinear terms in the studied sy... In this paper,a new optimal adaptive backstepping control approach for nonlinear systems under deception attacks via reinforcement learning is presented in this paper.The existence of nonlinear terms in the studied system makes it very difficult to design the optimal controller using traditional methods.To achieve optimal control,RL algorithm based on critic–actor architecture is considered for the nonlinear system.Due to the significant security risks of network transmission,the system is vulnerable to deception attacks,which can make all the system state unavailable.By using the attacked states to design coordinate transformation,the harm brought by unknown deception attacks has been overcome.The presented control strategy can ensure that all signals in the closed-loop system are semi-globally ultimately bounded.Finally,the simulation experiment is shown to prove the effectiveness of the strategy. 展开更多
关键词 Nonlinear systems Reinforcement learning optimal control Backstepping method
下载PDF
Optimization study of station track utilization in high-speed railroad based on constraints of control in random origin and process
11
作者 Yajing Zheng Dekun Zhang 《Railway Sciences》 2024年第3期332-343,共12页
Purpose-The purpose of this paper is to eliminate the fluctuations in train arrival and departure times caused by skewed distributions in interval operation times.These fluctuations arise from random origin and proces... Purpose-The purpose of this paper is to eliminate the fluctuations in train arrival and departure times caused by skewed distributions in interval operation times.These fluctuations arise from random origin and process factors during interval operations and can accumulate over multiple intervals.The aim is to enhance the robustness of high-speed rail station arrival and departure track utilization schemes.Design/methodologylapproach-To achieve this objective,the paper simulates actual train operations,incorporating the fluctuations in interval operation times into the utilization of arrival and departure tracks at the station.The Monte Carlo simulation method is adopted to solve this problem.This approach transforms a nonlinear model,which includes constraints from probability distribution functions and is difficult to solve directly,into a linear programming model that is easier to handle.The method then linearly weights two objectives to optimize the solution.Findings-Through the application of Monte Carlo simulation,the study successfully converts the complex nonlinear model with probability distribution function constraints into a manageable linear programming model.By continuously adjusting the weighting coefficients of the linear objectives,the method is able to optimize the Pareto solution.Notably,this approach does not require extensive scene data to obtain a satisfactory Pareto solution set.Originality/value-The paper contributes to the field by introducing a novel method for optimizing high-speed rail station arrival and departure track utilization in the presence of fluctuations in interval operation times.The use of Monte Carlo simulation to transform the problem into a tractable linear programming model represents a significant advancement.Furthermore,the method's ability to produce satisfactory Pareto solutions without relying on extensive data sets adds to its practical value and applicability in real-world scenarios. 展开更多
关键词 control in random origin control in random process High-speed railroad station Arrival and departure track utilization optimization Paper type Research paper
下载PDF
Stochastic Maximum Principle for Optimal Advertising Models with Delay and Non-Convex Control Spaces
12
作者 Giuseppina Guatteri Federica Masiero 《Advances in Pure Mathematics》 2024年第6期442-450,共9页
In this paper we study optimal advertising problems that model the introduction of a new product into the market in the presence of carryover effects of the advertisement and with memory effects in the level of goodwi... In this paper we study optimal advertising problems that model the introduction of a new product into the market in the presence of carryover effects of the advertisement and with memory effects in the level of goodwill. In particular, we let the dynamics of the product goodwill to depend on the past, and also on past advertising efforts. We treat the problem by means of the stochastic Pontryagin maximum principle, that here is considered for a class of problems where in the state equation either the state or the control depend on the past. Moreover the control acts on the martingale term and the space of controls U can be chosen to be non-convex but now the space of controls U can be chosen to be non-convex. The maximum principle is thus formulated using a first-order adjoint Backward Stochastic Differential Equations (BSDEs), which can be explicitly computed due to the specific characteristics of the model, and a second-order adjoint relation. 展开更多
关键词 Stochastic optimal control Delay Equations Advertisement Models Stochastic Maximum Principle
下载PDF
The Impact of Optimizing Details in the Operating Room on the Level of Knowledge, Attitude, and Practice of Hospital Infection Prevention and Control by Surgeons, as Well as the Effectiveness of Infection Control
13
作者 Yuanyuan Zhang 《Surgical Science》 2024年第7期421-429,共9页
Objective: This paper aims to explore the impact of optimizing details in the operating room on the level of knowledge, attitude, and practice of hospital infection prevention and control by surgeons, as well as the e... Objective: This paper aims to explore the impact of optimizing details in the operating room on the level of knowledge, attitude, and practice of hospital infection prevention and control by surgeons, as well as the effectiveness of infection control. Methods: From January 2022 to June 2023, a total of 120 patients were screened and randomly divided into a control group (routine care and hospital infection management) and a study group (optimizing details in the operating room). Results: Significant differences were found between the two groups in the data of surgeons’ level of knowledge, attitude, and practice in hospital infection prevention and control, infection rates, and nursing satisfaction, with the study group showing better results (P Conclusion: The use of optimizing details in the operating room among surgeons can effectively improve surgeons’ level of knowledge, attitude, and practice in hospital infection prevention and control, reduce infection occurrence, and is worth promoting. 展开更多
关键词 optimizing Details in the Operating Room Infection Level of Knowledge ATTITUDE and Practice Infection control
下载PDF
Transmission Dynamics and Optimal Control Strategies of a Hand-Foot-Mouth Disease Model with Treatment and Vaccination Interventions
14
作者 Jianping Wang Shenghua Zou Zhicai Guo 《Journal of Applied Mathematics and Physics》 2024年第6期2007-2019,共13页
In this article, the transmission dynamics of a Hand-Foot-Mouth disease model with treatment and vaccination interventions are studied. We calculated the basic reproduction number and proved the global stability of di... In this article, the transmission dynamics of a Hand-Foot-Mouth disease model with treatment and vaccination interventions are studied. We calculated the basic reproduction number and proved the global stability of disease-free equilibrium when R0 R0 > 1. Meanwhile, we obtained the optimal control strategies minimizing the cost of intervention and minimizing the infected person. We also give some numerical simulations to verify our theoretical results. 展开更多
关键词 Hand-Foot-Mouth Disease optimal control Transmission Dynamic Vaccination Interventions
下载PDF
A Priori Error Analysis for NCVEM Discretization of Elliptic Optimal Control Problem
15
作者 Shiying Wang Shuo Liu 《Engineering(科研)》 2024年第4期83-101,共19页
In this paper, we propose the nonconforming virtual element method (NCVEM) discretization for the pointwise control constraint optimal control problem governed by elliptic equations. Based on the NCVEM approximation o... In this paper, we propose the nonconforming virtual element method (NCVEM) discretization for the pointwise control constraint optimal control problem governed by elliptic equations. Based on the NCVEM approximation of state equation and the variational discretization of control variables, we construct a virtual element discrete scheme. For the state, adjoint state and control variable, we obtain the corresponding prior estimate in H<sup>1</sup> and L<sup>2</sup> norms. Finally, some numerical experiments are carried out to support the theoretical results. 展开更多
关键词 Nonconforming Virtual Element Method optimal control Problem a Priori Error Estimate
下载PDF
An Optimal Control-Based Distributed Reinforcement Learning Framework for A Class of Non-Convex Objective Functionals of the Multi-Agent Network 被引量:2
16
作者 Zhe Chen Ning Li 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第11期2081-2093,共13页
This paper studies a novel distributed optimization problem that aims to minimize the sum of the non-convex objective functionals of the multi-agent network under privacy protection, which means that the local objecti... This paper studies a novel distributed optimization problem that aims to minimize the sum of the non-convex objective functionals of the multi-agent network under privacy protection, which means that the local objective of each agent is unknown to others. The above problem involves complexity simultaneously in the time and space aspects. Yet existing works about distributed optimization mainly consider privacy protection in the space aspect where the decision variable is a vector with finite dimensions. In contrast, when the time aspect is considered in this paper, the decision variable is a continuous function concerning time. Hence, the minimization of the overall functional belongs to the calculus of variations. Traditional works usually aim to seek the optimal decision function. Due to privacy protection and non-convexity, the Euler-Lagrange equation of the proposed problem is a complicated partial differential equation.Hence, we seek the optimal decision derivative function rather than the decision function. This manner can be regarded as seeking the control input for an optimal control problem, for which we propose a centralized reinforcement learning(RL) framework. In the space aspect, we further present a distributed reinforcement learning framework to deal with the impact of privacy protection. Finally, rigorous theoretical analysis and simulation validate the effectiveness of our framework. 展开更多
关键词 Distributed optimization MULTI-AGENT optimal control reinforcement learning(RL)
下载PDF
Policy Iteration for Optimal Control of Discrete-Time Time-Varying Nonlinear Systems 被引量:1
17
作者 Guangyu Zhu Xiaolu Li +2 位作者 Ranran Sun Yiyuan Yang Peng Zhang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第3期781-791,共11页
Aimed at infinite horizon optimal control problems of discrete time-varying nonlinear systems,in this paper,a new iterative adaptive dynamic programming algorithm,which is the discrete-time time-varying policy iterati... Aimed at infinite horizon optimal control problems of discrete time-varying nonlinear systems,in this paper,a new iterative adaptive dynamic programming algorithm,which is the discrete-time time-varying policy iteration(DTTV)algorithm,is developed.The iterative control law is designed to update the iterative value function which approximates the index function of optimal performance.The admissibility of the iterative control law is analyzed.The results show that the iterative value function is non-increasingly convergent to the Bellman-equation optimal solution.To implement the algorithm,neural networks are employed and a new implementation structure is established,which avoids solving the generalized Bellman equation in each iteration.Finally,the optimal control laws for torsional pendulum and inverted pendulum systems are obtained by using the DTTV policy iteration algorithm,where the mass and pendulum bar length are permitted to be time-varying parameters.The effectiveness of the developed method is illustrated by numerical results and comparisons. 展开更多
关键词 Adaptive critic designs adaptive dynamic programming approximate dynamic programming optimal control policy iteration TIME-VARYING
下载PDF
Optimal guidance strategy for flexible load based on hybrid direct load control and time of use 被引量:1
18
作者 Siyang Liu Yuan Gao +2 位作者 Hejun Yang Xinghua Xie Yinghao Ma 《Global Energy Interconnection》 EI CSCD 2023年第3期297-307,共11页
The time-of-use(TOU)strategy can effectively improve the energy consumption mode of customers,reduce the peak-valley difference of load curve,and optimize the allocation of energy resources.This study presents an Opti... The time-of-use(TOU)strategy can effectively improve the energy consumption mode of customers,reduce the peak-valley difference of load curve,and optimize the allocation of energy resources.This study presents an Optimal guidance mechanism of the flexible load based on strategies of direct load control and time-of-use.First,this study proposes a period partitioning model,which is based on a moving boundary technique with constraint factors,and the Dunn Validity Index(DVI)is used as the objective to solve the period partitioning.Second,a control strategy for the curtailable flexible load is investigated,and a TOU strategy is utilized for further modifying load curve.Third,a price demand response strategy for adjusting transferable load is proposed in this paper.Finally,through the case study analysis of typical daily flexible load curve,the efficiency and correctness of the proposed method and model are validated and proved. 展开更多
关键词 Flexible load optimal demand response strategy Time of use Period partitioning Direct load control
下载PDF
Adaptive Multi-Step Evaluation Design With Stability Guarantee for Discrete-Time Optimal Learning Control 被引量:1
19
作者 Ding Wang Jiangyu Wang +2 位作者 Mingming Zhao Peng Xin Junfei Qiao 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第9期1797-1809,共13页
This paper is concerned with a novel integrated multi-step heuristic dynamic programming(MsHDP)algorithm for solving optimal control problems.It is shown that,initialized by the zero cost function,MsHDP can converge t... This paper is concerned with a novel integrated multi-step heuristic dynamic programming(MsHDP)algorithm for solving optimal control problems.It is shown that,initialized by the zero cost function,MsHDP can converge to the optimal solution of the Hamilton-Jacobi-Bellman(HJB)equation.Then,the stability of the system is analyzed using control policies generated by MsHDP.Also,a general stability criterion is designed to determine the admissibility of the current control policy.That is,the criterion is applicable not only to traditional value iteration and policy iteration but also to MsHDP.Further,based on the convergence and the stability criterion,the integrated MsHDP algorithm using immature control policies is developed to accelerate learning efficiency greatly.Besides,actor-critic is utilized to implement the integrated MsHDP scheme,where neural networks are used to evaluate and improve the iterative policy as the parameter architecture.Finally,two simulation examples are given to demonstrate that the learning effectiveness of the integrated MsHDP scheme surpasses those of other fixed or integrated methods. 展开更多
关键词 Adaptive critic artificial neural networks Hamilton-Jacobi-Bellman(HJB)equation multi-step heuristic dynamic programming multi-step reinforcement learning optimal control
下载PDF
Recent Progress in Reinforcement Learning and Adaptive Dynamic Programming for Advanced Control Applications 被引量:2
20
作者 Ding Wang Ning Gao +2 位作者 Derong Liu Jinna Li Frank L.Lewis 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第1期18-36,共19页
Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and ... Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence. 展开更多
关键词 Adaptive dynamic programming(ADP) advanced control complex environment data-driven control event-triggered design intelligent control neural networks nonlinear systems optimal control reinforcement learning(RL)
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部