期刊文献+
共找到380篇文章
< 1 2 19 >
每页显示 20 50 100
Determination of AVR System PID Controller Parameters Using Improved Variants of Reptile Search Algorithm and a Novel Objective Function
1
作者 Baran Hekimoglu 《Energy Engineering》 EI 2023年第7期1515-1540,共26页
Two novel improved variants of reptile search algorithm(RSA),RSA with opposition-based learning(ORSA)and hybrid ORSA with pattern search(ORSAPS),are proposed to determine the proportional,integral,and derivative(PID)c... Two novel improved variants of reptile search algorithm(RSA),RSA with opposition-based learning(ORSA)and hybrid ORSA with pattern search(ORSAPS),are proposed to determine the proportional,integral,and derivative(PID)controller parameters of an automatic voltage regulator(AVR)system using a novel objective function with augmented flexibility.In the proposed algorithms,the opposition-based learning technique improves the global search abilities of the original RSA algorithm,while the hybridization with the pattern search(PS)algorithm improves the local search abilities.Both algorithms are compared with the original RSA algorithm and have shown to be highly effective algorithms for tuning the PID controller parameters of an AVR system by getting superior results.Several analyses such as transient,stability,robustness,disturbance rejection,and trajectory tracking are conducted to test the performance of the proposed algorithms,which have validated the good promise of the proposed methods for controller designs.The performances of the proposed design approaches are also compared with the previously reported PID controller parameter tuning approaches to assess their success.It is shown that both proposed approaches obtain excellent and robust results among all compared ones.That is,with the adjustment of the weight factorα,which is introduced by the proposed objective function,for a system with high bandwitdh(α=1),the proposed ORSAPS-PID system has 2.08%more bandwidth than the proposed ORSA-PID system and 5.1%faster than the fastest algorithm from the literature.On the other hand,for a system where high phase and gain margins are desired(α=10),the proposed ORSA-PID system has 0.53%more phase margin and 2.18%more gain margin than the proposed ORSAPS-PID system and has 0.71%more phase margin and 2.25%more gain margin than the best performing algorithm from the literature. 展开更多
关键词 Reptile search algorithm pattern search multidirectional search metaheuristics automatic voltage regulator optimal PID controller
下载PDF
Recent Progress in Reinforcement Learning and Adaptive Dynamic Programming for Advanced Control Applications 被引量:1
2
作者 Ding Wang Ning Gao +2 位作者 Derong Liu Jinna Li Frank L.Lewis 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第1期18-36,共19页
Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and ... Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence. 展开更多
关键词 Adaptive dynamic programming(ADP) advanced control complex environment data-driven control event-triggered design intelligent control neural networks nonlinear systems optimal control reinforcement learning(RL)
下载PDF
AN OPTIMAL CONTROL PROBLEM FOR A LOTKA-VOLTERRA COMPETITION MODEL WITH CHEMO-REPULSION
3
作者 Diana I.HERNÁNDEZ Diego A.RUEDA-GOMEZ Élder J.VILLAMIZAR-ROA 《Acta Mathematica Scientia》 SCIE CSCD 2024年第2期721-751,共31页
In this paper we study a bilinear optimal control problem for a diffusive Lotka-Volterra competition model with chemo-repulsion in a bounded domain of ℝ^(ℕ),N=2,3.This model describes the competition of two species in... In this paper we study a bilinear optimal control problem for a diffusive Lotka-Volterra competition model with chemo-repulsion in a bounded domain of ℝ^(ℕ),N=2,3.This model describes the competition of two species in which one of them avoid encounters with rivals through a chemo-repulsion mechanism.We prove the existence and uniqueness of weak-strong solutions,and then we analyze the existence of a global optimal solution for a related bilinear optimal control problem,where the control is acting on the chemical signal.Posteriorly,we derive first-order optimality conditions for local optimal solutions using the Lagrange multipliers theory.Finally,we propose a discrete approximation scheme of the optimality system based on the gradient method,which is validated with some computational experiments. 展开更多
关键词 LOTKA-VOLTERRA chemo-repulsion optimal control optimality conditions
下载PDF
Adaptive Optimal Output Regulation of Interconnected Singularly Perturbed Systems With Application to Power Systems
4
作者 Jianguo Zhao Chunyu Yang +2 位作者 Weinan Gao Linna Zhou Xiaomin Liu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第3期595-607,共13页
This article studies the adaptive optimal output regulation problem for a class of interconnected singularly perturbed systems(SPSs) with unknown dynamics based on reinforcement learning(RL).Taking into account the sl... This article studies the adaptive optimal output regulation problem for a class of interconnected singularly perturbed systems(SPSs) with unknown dynamics based on reinforcement learning(RL).Taking into account the slow and fast characteristics among system states,the interconnected SPS is decomposed into the slow time-scale dynamics and the fast timescale dynamics through singular perturbation theory.For the fast time-scale dynamics with interconnections,we devise a decentralized optimal control strategy by selecting appropriate weight matrices in the cost function.For the slow time-scale dynamics with unknown system parameters,an off-policy RL algorithm with convergence guarantee is given to learn the optimal control strategy in terms of measurement data.By combining the slow and fast controllers,we establish the composite decentralized adaptive optimal output regulator,and rigorously analyze the stability and optimality of the closed-loop system.The proposed decomposition design not only bypasses the numerical stiffness but also alleviates the high-dimensionality.The efficacy of the proposed methodology is validated by a load-frequency control application of a two-area power system. 展开更多
关键词 Adaptive optimal control decentralized control output regulation reinforcement learning(RL) singularly perturbed systems(SPSs)
下载PDF
Optimal and robust control of population transfer in asymmetric quantum-dot molecules
5
作者 郭裕 马松山 束传存 《Chinese Physics B》 SCIE EI CAS CSCD 2024年第2期353-359,共7页
We present an optimal and robust quantum control method for efficient population transfer in asymmetric double quantum-dot molecules.We derive a long-duration control scheme that allows for highly efficient population... We present an optimal and robust quantum control method for efficient population transfer in asymmetric double quantum-dot molecules.We derive a long-duration control scheme that allows for highly efficient population transfer by accurately controlling the amplitude of a narrow-bandwidth pulse.To overcome fluctuations in control field parameters,we employ a frequency-domain quantum optimal control theory method to optimize the spectral phase of a single pulse with broad bandwidth while preserving the spectral amplitude.It is shown that this spectral-phase-only optimization approach can successfully identify robust and optimal control fields,leading to efficient population transfer to the target state while concurrently suppressing population transfer to undesired states.The method demonstrates resilience to fluctuations in control field parameters,making it a promising approach for reliable and efficient population transfer in practical applications. 展开更多
关键词 population transfer quantum optimal control theory quantum-dot molecules
下载PDF
Sequential Inverse Optimal Control of Discrete-Time Systems
6
作者 Sheng Cao Zhiwei Luo Changqin Quan 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第3期608-621,共14页
This paper presents a novel sequential inverse optimal control(SIOC)method for discrete-time systems,which calculates the unknown weight vectors of the cost function in real time using the input and output of an optim... This paper presents a novel sequential inverse optimal control(SIOC)method for discrete-time systems,which calculates the unknown weight vectors of the cost function in real time using the input and output of an optimally controlled discrete-time system.The proposed method overcomes the limitations of previous approaches by eliminating the need for the invertible Jacobian assumption.It calculates the possible-solution spaces and their intersections sequentially until the dimension of the intersection space decreases to one.The remaining one-dimensional vector of the possible-solution space’s intersection represents the SIOC solution.The paper presents clear conditions for convergence and addresses the issue of noisy data by clarifying the conditions for the singular values of the matrices that relate to the possible-solution space.The effectiveness of the proposed method is demonstrated through simulation results. 展开更多
关键词 Inverse optimal control promised calculation step sequential calculation
下载PDF
Identification of time-varying system and energy-based optimization of adaptive control in seismically excited structure
7
作者 Elham Aghabarari Fereidoun Amini Pedram Ghaderi 《Earthquake Engineering and Engineering Vibration》 SCIE EI CSCD 2024年第1期227-240,共14页
The combination of structural health monitoring and vibration control is of great importance to provide components of smart structures.While synthetic algorithms have been proposed,adaptive control that is compatible ... The combination of structural health monitoring and vibration control is of great importance to provide components of smart structures.While synthetic algorithms have been proposed,adaptive control that is compatible with changing conditions still needs to be used,and time-varying systems are required to be simultaneously estimated with the application of adaptive control.In this research,the identification of structural time-varying dynamic characteristics and optimized simple adaptive control are integrated.First,reduced variations of physical parameters are estimated online using the multiple forgetting factor recursive least squares(MFRLS)method.Then,the energy from the structural vibration is simultaneously specified to optimize the control force with the identified parameters to be operational.Optimization is also performed based on the probability density function of the energy under the seismic excitation at any time.Finally,the optimal control force is obtained by the simple adaptive control(SAC)algorithm and energy coefficient.A numerical example and benchmark structure are employed to investigate the efficiency of the proposed approach.The simulation results revealed the effectiveness of the integrated online identification and optimal adaptive control in systems. 展开更多
关键词 integrated online identification time-varying systems structural energy multiple forgetting factor recursive least squares optimal simple adaptive control algorithm
下载PDF
A new optimal adaptive backstepping control approach for nonlinear systems under deception attacks via reinforcement learning
8
作者 Wendi Chen Qinglai Wei 《Journal of Automation and Intelligence》 2024年第1期34-39,共6页
In this paper,a new optimal adaptive backstepping control approach for nonlinear systems under deception attacks via reinforcement learning is presented in this paper.The existence of nonlinear terms in the studied sy... In this paper,a new optimal adaptive backstepping control approach for nonlinear systems under deception attacks via reinforcement learning is presented in this paper.The existence of nonlinear terms in the studied system makes it very difficult to design the optimal controller using traditional methods.To achieve optimal control,RL algorithm based on critic–actor architecture is considered for the nonlinear system.Due to the significant security risks of network transmission,the system is vulnerable to deception attacks,which can make all the system state unavailable.By using the attacked states to design coordinate transformation,the harm brought by unknown deception attacks has been overcome.The presented control strategy can ensure that all signals in the closed-loop system are semi-globally ultimately bounded.Finally,the simulation experiment is shown to prove the effectiveness of the strategy. 展开更多
关键词 Nonlinear systems Reinforcement learning Optimal control Backstepping method
下载PDF
Matrix Riccati Equations in Optimal Control
9
作者 Malick Ndiaye 《Applied Mathematics》 2024年第3期199-213,共15页
In this paper, the matrix Riccati equation is considered. There is no general way for solving the matrix Riccati equation despite the many fields to which it applies. While scalar Riccati equation has been studied tho... In this paper, the matrix Riccati equation is considered. There is no general way for solving the matrix Riccati equation despite the many fields to which it applies. While scalar Riccati equation has been studied thoroughly, matrix Riccati equation of which scalar Riccati equations is a particular case, is much less investigated. This article proposes a change of variable that allows to find explicit solution of the Matrix Riccati equation. We then apply this solution to Optimal Control. 展开更多
关键词 Optimal Control Matrix Riccati Equation Change of Variable
下载PDF
A Priori Error Analysis for NCVEM Discretization of Elliptic Optimal Control Problem
10
作者 Shiying Wang Shuo Liu 《Engineering(科研)》 2024年第4期83-101,共19页
In this paper, we propose the nonconforming virtual element method (NCVEM) discretization for the pointwise control constraint optimal control problem governed by elliptic equations. Based on the NCVEM approximation o... In this paper, we propose the nonconforming virtual element method (NCVEM) discretization for the pointwise control constraint optimal control problem governed by elliptic equations. Based on the NCVEM approximation of state equation and the variational discretization of control variables, we construct a virtual element discrete scheme. For the state, adjoint state and control variable, we obtain the corresponding prior estimate in H<sup>1</sup> and L<sup>2</sup> norms. Finally, some numerical experiments are carried out to support the theoretical results. 展开更多
关键词 Nonconforming Virtual Element Method Optimal Control Problem a Priori Error Estimate
下载PDF
Optimized model-based control of main mine ventilation air flows with minimized energy consumption 被引量:4
11
作者 S.Sjostrom E.Klintenas +1 位作者 P.Johansson J.Nyqvist 《International Journal of Mining Science and Technology》 SCIE EI CSCD 2020年第4期533-539,共7页
In early 2018,the Boliden Garpenberg operation implemented an optimized control strategy as an addition to the existing ventilation on demand system.The purpose of the strategy is to further minimize energy use for ma... In early 2018,the Boliden Garpenberg operation implemented an optimized control strategy as an addition to the existing ventilation on demand system.The purpose of the strategy is to further minimize energy use for main and booster fans,whilst also fulfilling airflow setpoints without violating constraints such as min/max differential pressure over fans and interaction of air between areas in mines.Using air flow measurements and a dynamical model of the ventilation system,a mine-wide coordination control of fans can be carried out.The numerical model is data driven and derived from historical operational data or step changes experiments.This makes both initial deployment and lifetime model maintenance,as the mine evolves,a comparably easy operation.The control has been proven to operate in a stable manner over long periods without having to re-calibrate the model.Results prove a 40%decrease in energy use for the fans involved and a greater controllability of air flow.Moreover,a 15%decrease of the total air flow into the mine will give additional proportional heating savings during winter periods.All in all,the multivariable controller shows a correlation between production in the mine and the ventilation system performance superior to all of its predecessors. 展开更多
关键词 Mine ventilation Ventilation on demand optimized model-based control Minimized energy consumption Advanced process control
下载PDF
Output-Feedback Based Simplified Optimized Backstepping Control for Strict-Feedback Systems with Input and State Constraints 被引量:2
12
作者 Jiaxin Zhang Kewen Li Yongming Li 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2021年第6期1119-1132,共14页
In this paper,an adaptive neural-network(NN)output feedback optimal control problem is studied for a class of strict-feedback nonlinear systems with unknown internal dynamics,input saturation and state constraints.Neu... In this paper,an adaptive neural-network(NN)output feedback optimal control problem is studied for a class of strict-feedback nonlinear systems with unknown internal dynamics,input saturation and state constraints.Neural networks are used to approximate unknown internal dynamics and an adaptive NN state observer is developed to estimate immeasurable states.Under the framework of the backstepping design,by employing the actor-critic architecture and constructing the tan-type Barrier Lyapunov function(BLF),the virtual and actual optimal controllers are developed.In order to accomplish optimal control effectively,a simplified reinforcement learning(RL)algorithm is designed by deriving the updating laws from the negative gradient of a simple positive function,instead of employing existing optimal control methods.In addition,to ensure that all the signals in the closed-loop system are bounded and the output can follow the reference signal within a bounded error,all state variables are confined within their compact sets all times.Finally,a simulation example is given to illustrate the effectiveness of the proposed control strategy. 展开更多
关键词 Backstepping design immeasurable states neuralnetworks(NNs) optimal control state constraints
下载PDF
Implementation of Radial Basis Function Artificial Neural Network into an Adaptive Equivalent Consumption Minimization Strategy for Optimized Control of a Hybrid Electric Vehicle 被引量:2
13
作者 Thomas P. Harris Andrew C. Nix +3 位作者 Mario G. Perhinschi W. Scott Wayne Jared A. Diethorn Aaron R. Mull 《Journal of Transportation Technologies》 2021年第4期471-503,共33页
Continued increases in the emission of greenhouse gases by passenger ve<span style="font-family:Verdana;">hicles ha</span><span style="font-family:Verdana;">ve</span><spa... Continued increases in the emission of greenhouse gases by passenger ve<span style="font-family:Verdana;">hicles ha</span><span style="font-family:Verdana;">ve</span><span style="font-family:;" "=""><span style="font-family:Verdana;"> accelerated the production of hybrid electric vehicles. With this increase in production, there has been a parallel demand for continuously improving strategies of hybrid electric vehicle control. The goal of an ideal control strategy is to maximize fuel economy while minimizing emissions. Methods exist by which the globally optimal control strategy may be found. However, these methods are not applicable in real-world driving applications since these methods require </span><i><span style="font-family:Verdana;">a</span></i> <i><span style="font-family:Verdana;">priori</span></i><span style="font-family:Verdana;"> knowledge of the upcoming drive cycle. Real-time control strategies use the global optimal as a benchmark against which performance can be evaluated. The goal of this work is to use a previously defined strategy that has been shown to closely approximate the global optimal and implement a radial basis function (RBF) artificial neural network (ANN) that dynamically adapts the strategy based on past driving conditions. The strate</span><span style="font-family:Verdana;">gy used is the Equivalent Consumption Minimization Strategy (ECMS),</span><span style="font-family:Verdana;"> which uses an equivalence factor to define the control strategy and the power train </span><span style="font-family:Verdana;">component torque split. An equivalence factor that is optimal for a single</span><span style="font-family:Verdana;"> drive cycle can be found offline</span></span><span style="font-family:;" "=""> </span><span style="font-family:;" "=""><span style="font-family:Verdana;">with </span><i><span style="font-family:Verdana;">a</span></i> <i><span style="font-family:Verdana;">priori</span></i><span style="font-family:Verdana;"> knowledge of the drive cycle. The RBF-ANN is used to dynamically update the equivalence factor by examining a past time window of driving characteristics. A total of 30 sets of training data (drive cycles) are used to train the RBF-ANN. For the majority of drive cycles examined, the RBF-ANN implementation is shown to produce fuel economy values that are within ±2.5% of the fuel economy obtained with the optimal equivalence factor. The advantage of the RBF-ANN is that it does not require </span><i><span style="font-family:Verdana;">a</span></i> <i><span style="font-family:Verdana;">priori</span></i><span style="font-family:Verdana;"> drive cycle knowledge and is able to be implemented in real-time while meeting or exceeding the performance of the optimal ECMS. Recommendations are made on how the RBF-ANN could be improved to produce better results across a greater array of driving conditions.</span></span> 展开更多
关键词 Hybrid Electric Vehicle Artificial Neural Network Equivalent Consumption Minimization Strategy (ECMS) Optimal Control Strategy
下载PDF
Optimized pulse for stimulated Raman adiabatic passage on noisy experimental platform
14
作者 王志凌 刘雷轶男 崔健 《Chinese Physics B》 SCIE EI CAS CSCD 2021年第8期18-24,共7页
Stimulated Raman adiabatic passage(STIRAP)is an important technique to manipulate quantum states in quantum simulation and quantum computation.The transformation fidelity is limited in reality due to experimental impe... Stimulated Raman adiabatic passage(STIRAP)is an important technique to manipulate quantum states in quantum simulation and quantum computation.The transformation fidelity is limited in reality due to experimental imperfections.After systematically calculating the influence of dissipation caused by thermal fluctuations and instantaneous decay of the intermediate state,we find optimized control pulses of Rydberg atom in optical tweezer to increase the STIRAP fidelity via optimal control method.All constraints of currently available control lasers have been taken into account.The transition error can be further depressed when control lasers with shorter rise time and accordingly proper total evolution time are applied.Finally,the robustness of the control pulses with respect to random deviations between the theoretical pulse shape and the implemented ones is also enhanced by additional rounds of optimizations based on ensemble averaged fidelity. 展开更多
关键词 STIRAP DISSIPATION optimal control Rydberg atom
下载PDF
An Optimal Control-Based Distributed Reinforcement Learning Framework for A Class of Non-Convex Objective Functionals of the Multi-Agent Network 被引量:1
15
作者 Zhe Chen Ning Li 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第11期2081-2093,共13页
This paper studies a novel distributed optimization problem that aims to minimize the sum of the non-convex objective functionals of the multi-agent network under privacy protection, which means that the local objecti... This paper studies a novel distributed optimization problem that aims to minimize the sum of the non-convex objective functionals of the multi-agent network under privacy protection, which means that the local objective of each agent is unknown to others. The above problem involves complexity simultaneously in the time and space aspects. Yet existing works about distributed optimization mainly consider privacy protection in the space aspect where the decision variable is a vector with finite dimensions. In contrast, when the time aspect is considered in this paper, the decision variable is a continuous function concerning time. Hence, the minimization of the overall functional belongs to the calculus of variations. Traditional works usually aim to seek the optimal decision function. Due to privacy protection and non-convexity, the Euler-Lagrange equation of the proposed problem is a complicated partial differential equation.Hence, we seek the optimal decision derivative function rather than the decision function. This manner can be regarded as seeking the control input for an optimal control problem, for which we propose a centralized reinforcement learning(RL) framework. In the space aspect, we further present a distributed reinforcement learning framework to deal with the impact of privacy protection. Finally, rigorous theoretical analysis and simulation validate the effectiveness of our framework. 展开更多
关键词 Distributed optimization MULTI-AGENT optimal control reinforcement learning(RL)
下载PDF
ANALYSIS AND DISCRETIZATION FOR AN OPTIMAL CONTROL PROBLEM OF A VARIABLE-COEFFICIENT RIESZ-FRACTIONAL DIFFUSION EQUATION WITH POINTWISE CONTROL CONSTRAINTS
16
作者 周兆杰 王方圆 郑祥成 《Acta Mathematica Scientia》 SCIE CSCD 2023年第2期640-654,共15页
We present a mathematical and numerical study for a pointwise optimal control problem governed by a variable-coefficient Riesz-fractional diffusion equation.Due to the impact of the variable diffusivity coefficient,ex... We present a mathematical and numerical study for a pointwise optimal control problem governed by a variable-coefficient Riesz-fractional diffusion equation.Due to the impact of the variable diffusivity coefficient,existing regularity results for their constantcoefficient counterparts do not apply,while the bilinear forms of the state(adjoint)equation may lose the coercivity that is critical in error estimates of the finite element method.We reformulate the state equation as an equivalent constant-coefficient fractional diffusion equation with the addition of a variable-coefficient low-order fractional advection term.First order optimality conditions are accordingly derived and the smoothing properties of the solutions are analyzed by,e.g.,interpolation estimates.The weak coercivity of the resulting bilinear forms are proven via the Garding inequality,based on which we prove the optimal-order convergence estimates of the finite element method for the(adjoint)state variable and the control variable.Numerical experiments substantiate the theoretical predictions. 展开更多
关键词 Riesz-fractional diffusion equation variable coefficient optimal control finite element method Garding inequality optimal-order error estimate
下载PDF
Reduced differential transform and Sumudu transform methods for solving fractional financial models of awareness
17
作者 A.M.S.Mahdy K.A.Gepreel +1 位作者 Kh.Lotfy A.El-Bary 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2023年第3期338-356,共19页
In that paper,we new study has been carried out on previous studies of one of the most important mathematical models that describe the global economic movement,and that is described as a non-linear fractional financia... In that paper,we new study has been carried out on previous studies of one of the most important mathematical models that describe the global economic movement,and that is described as a non-linear fractional financial model of awareness,where the studies are represented at the steps following:One:The schematic of the model is suggested.Two:The disease-free equilibrium point(DFE)and the stability of the equilibrium point are discussed.Three:The stability of the model is fulfilled by drawing the Lyapunov exponents and Poincare map.Fourth:The existence of uniformly stable solutions have discussed.Five:The Caputo is described as the fractional derivative.Six:Fractional optimal control for NFFMA is discussed by clarifying the fractional optimal control through drawing before and after control.Seven:Reduced differential transform method(RDTM)and Sumudu Decomposition Method(SDM)are used to take the resolution of an NFFMA.Finally,we display that SDM and RDTM are highly identical. 展开更多
关键词 financial of awareness stability Lyapunov exponents Poincare map fractional optimal control HAMILTONIAN
下载PDF
Policy Iteration for Optimal Control of Discrete-Time Time-Varying Nonlinear Systems
18
作者 Guangyu Zhu Xiaolu Li +2 位作者 Ranran Sun Yiyuan Yang Peng Zhang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第3期781-791,共11页
Aimed at infinite horizon optimal control problems of discrete time-varying nonlinear systems,in this paper,a new iterative adaptive dynamic programming algorithm,which is the discrete-time time-varying policy iterati... Aimed at infinite horizon optimal control problems of discrete time-varying nonlinear systems,in this paper,a new iterative adaptive dynamic programming algorithm,which is the discrete-time time-varying policy iteration(DTTV)algorithm,is developed.The iterative control law is designed to update the iterative value function which approximates the index function of optimal performance.The admissibility of the iterative control law is analyzed.The results show that the iterative value function is non-increasingly convergent to the Bellman-equation optimal solution.To implement the algorithm,neural networks are employed and a new implementation structure is established,which avoids solving the generalized Bellman equation in each iteration.Finally,the optimal control laws for torsional pendulum and inverted pendulum systems are obtained by using the DTTV policy iteration algorithm,where the mass and pendulum bar length are permitted to be time-varying parameters.The effectiveness of the developed method is illustrated by numerical results and comparisons. 展开更多
关键词 Adaptive critic designs adaptive dynamic programming approximate dynamic programming optimal control policy iteration TIME-VARYING
下载PDF
Optimal Control of Nonlinear Systems Using Experience Inference Human-Behavior Learning
19
作者 Adolfo Perrusquía Weisi Guo 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第1期90-102,共13页
Safety critical control is often trained in a simulated environment to mitigate risk.Subsequent migration of the biased controller requires further adjustments.In this paper,an experience inference human-behavior lear... Safety critical control is often trained in a simulated environment to mitigate risk.Subsequent migration of the biased controller requires further adjustments.In this paper,an experience inference human-behavior learning is proposed to solve the migration problem of optimal controllers applied to real-world nonlinear systems.The approach is inspired in the complementary properties that exhibits the hippocampus,the neocortex,and the striatum learning systems located in the brain.The hippocampus defines a physics informed reference model of the realworld nonlinear system for experience inference and the neocortex is the adaptive dynamic programming(ADP)or reinforcement learning(RL)algorithm that ensures optimal performance of the reference model.This optimal performance is inferred to the real-world nonlinear system by means of an adaptive neocortex/striatum control policy that forces the nonlinear system to behave as the reference model.Stability and convergence of the proposed approach is analyzed using Lyapunov stability theory.Simulation studies are carried out to verify the approach. 展开更多
关键词 Experience inference hippocampus learning system linear time-variant(LTV)systems neocortex/striatum learning systems nonlinear systems optimal control
下载PDF
Adaptive Multi-Step Evaluation Design With Stability Guarantee for Discrete-Time Optimal Learning Control
20
作者 Ding Wang Jiangyu Wang +2 位作者 Mingming Zhao Peng Xin Junfei Qiao 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第9期1797-1809,共13页
This paper is concerned with a novel integrated multi-step heuristic dynamic programming(MsHDP)algorithm for solving optimal control problems.It is shown that,initialized by the zero cost function,MsHDP can converge t... This paper is concerned with a novel integrated multi-step heuristic dynamic programming(MsHDP)algorithm for solving optimal control problems.It is shown that,initialized by the zero cost function,MsHDP can converge to the optimal solution of the Hamilton-Jacobi-Bellman(HJB)equation.Then,the stability of the system is analyzed using control policies generated by MsHDP.Also,a general stability criterion is designed to determine the admissibility of the current control policy.That is,the criterion is applicable not only to traditional value iteration and policy iteration but also to MsHDP.Further,based on the convergence and the stability criterion,the integrated MsHDP algorithm using immature control policies is developed to accelerate learning efficiency greatly.Besides,actor-critic is utilized to implement the integrated MsHDP scheme,where neural networks are used to evaluate and improve the iterative policy as the parameter architecture.Finally,two simulation examples are given to demonstrate that the learning effectiveness of the integrated MsHDP scheme surpasses those of other fixed or integrated methods. 展开更多
关键词 Adaptive critic artificial neural networks Hamilton-Jacobi-Bellman(HJB)equation multi-step heuristic dynamic programming multi-step reinforcement learning optimal control
下载PDF
上一页 1 2 19 下一页 到第
使用帮助 返回顶部