An adaptive iterative learning control scheme is presented for a class of strict-feedback nonlinear time-delay systems, with unknown nonlinearly parameterised and time-varying disturbed functions of known periods. Rad...An adaptive iterative learning control scheme is presented for a class of strict-feedback nonlinear time-delay systems, with unknown nonlinearly parameterised and time-varying disturbed functions of known periods. Radial basis function neural network and Fourier series expansion (FSE) are combined into a new function approximator to model each suitable disturbed function in systems. The requirement of the traditional iterative learning control algorithm on the nonlinear functions (such as global Lipschitz condition) is relaxed. Furthermore, by using appropriate Lyapunov-Krasovskii functionals, all signs in the closed loop system are guaranteed to be semiglobally uniformly ultimately bounded, and the output of the system is proved to converge to the desired trajectory. A simulation example is provided to illustrate the effectiveness of the control scheme.展开更多
In this paper,the problem of adaptive iterative learning based consensus control for periodically time-varying multi-agent systems is studied,in which the dynamics of each follower are driven by nonlinearly parameteri...In this paper,the problem of adaptive iterative learning based consensus control for periodically time-varying multi-agent systems is studied,in which the dynamics of each follower are driven by nonlinearly parameterized terms with periodic disturbances.Neural networks and Fourier base expansions are introduced to describe the periodically time-varying dynamic terms.On this basis,an adaptive learning parameter with a positively convergent series term is constructed,and a distributed control protocol based on local signals between agents is designed to ensure accurate consensus of the closed-loop systems.Furthermore,consensus algorithm is generalized to solve the formation control problem.Finally,simulation experiments are implemented through MATLAB to demonstrate the effectiveness of the method used.展开更多
In this paper, the stability of iterative learning control with data dropouts is discussed. By the super vector formulation, an iterative learning control (ILC) system with data dropouts can be modeled as an asynchr...In this paper, the stability of iterative learning control with data dropouts is discussed. By the super vector formulation, an iterative learning control (ILC) system with data dropouts can be modeled as an asynchronous dynamical system with rate constraints on events in the iteration domain. The stability condition is provided in the form of linear matrix inequalities (LMIS) depending on the stability of asynchronous dynamical systems. The analysis is supported by simulations.展开更多
Iterative Learning Control (ILC) captures interests of many scholars because of its capability of high precision control implement without identifying plant mathematical models, and it is widely applied in control e...Iterative Learning Control (ILC) captures interests of many scholars because of its capability of high precision control implement without identifying plant mathematical models, and it is widely applied in control engineering. Presently, most ILC algorithms still follow the original ideas of ARIMOTO, in which the iterative-learning-rate is composed by the control error with its derivative and integral values. This kind of algorithms will result in inevitable problems such as huge computation, big storage capacity for algorithm data, and also weak robust. In order to resolve these problems, an improved iterative learning control algorithm with fixed step is proposed here which breaks the primary thought of ARIMOTO. In this algorithm, the control step is set only according to the value of the control error, which could enormously reduce the computation and storage size demanded, also improve the robust of the algorithm by not using the differential coefficient of the iterative learning error. In this paper, the convergence conditions of this proposed fixed step iterative learning algorithm is theoretically analyzed and testified. Then the algorithm is tested through simulation researches on a time-variant object with randomly set disturbance through calculation of step threshold value, algorithm robustness testing,and evaluation of the relation between convergence speed and step size. Finally the algorithm is validated on a valve-serving-cylinder system of a joint robot with time-variant parameters. Experiment results demonstrate the stability of the algorithm and also the relationship between step value and convergence rate. Both simulation and experiment testify the feasibility and validity of the new algorithm proposed here. And it is worth to noticing that this algorithm is simple but with strong robust after improvements, which provides new ideas to the research of iterative learning control algorithms.展开更多
Industrial robot system is a kind of dynamic system w ith strong nonlinear coupling and high position precision. A lot of control ways , such as nonlinear feedbackdecomposition motion and adaptive control and so o n, ...Industrial robot system is a kind of dynamic system w ith strong nonlinear coupling and high position precision. A lot of control ways , such as nonlinear feedbackdecomposition motion and adaptive control and so o n, have been used to control this kind of system, but there are some deficiencie s in those methods: some need accurate and some need complicated operation and e tc. In recent years, in need of controlling the industrial robots, aiming at com pletely tracking the ideal input for the controlled subject with repetitive character, a new research area, ILC (iterative learning control), has been devel oped in the control technology and theory. The iterative learning control method can make the controlled subject operate as desired in a definite time span, merely making use of the prior control experie nce of the system and searching for the desired control signal according to the practical and desired output signal. The process of searching is equal to that o f learning, during which we only need to measure the output signal to amend the control signal, not like the adaptive control strategy, which on line assesses t he complex parameters of the system. Besides, since the iterative learning contr ol relies little on the prior message of the subject, it has been well used in a lot of areas, especially the dynamic systems with strong non-linear coupling a nd high repetitive position precision and the control system with batch producti on. Since robot manipulator has the above-mentioned character, ILC can be very well used in robot manipulator. In the ILC, since the operation always begins with a certain initial state, init ial condition has been required in almost all convergence verification. Therefor e, in designing the controller, the initial state has to be restricted with some condition to guarantee the convergence of the algorithm. The settle of initial condition problem has long been pursued in the ILC. There are commonly two kinds of initial condition problems: one is zero initial error problem, another is non-zero initial error problem. In practice, the repe titive operation will invariably produce excursion of the iterative initial stat e from the desired initial state. As a result, the research on the second in itial problem has more practical meaning. In this paper, for the non-zero initial error problem, one novel robust ILC alg orithms, respectively combining PD type iterative learning control algorithm wit h the robust feedback control algorithm, has been presented. This novel robust ILC algorithm contain two parts: feedforward ILC algorithm and robust feedback algorithm, which can be used to restrain disturbance from param eter variation, mechanical nonlinearities and unmodeled dynamics and to achieve good performance as well. The feedforward ILC algorithm can be used to improve the tracking error and perf ormance of the system through iteratively learning from the previous operation, thus performing the tracking task very fast. The robust feedback algorithm could mainly be applied to make the real output of the system not deviate too much fr om the desired tracking trajectory, and guarantee the system’s robustness w hen there are exterior noises and variations of the system parameter. In this paper, in order to analyze the convergence of the algorithm, Lyapunov st ability theory has been used through properly selecting the Lyapunov function. T he result of the verification shows the feasibility of the novel robust iterativ e learning control in theory. Finally, aiming at the two-freedom rate robot, simulation has been made with th e MATLAB software. Furthermore, two groups of parameters are selected to validat e the robustness of the algorithm.展开更多
This paper presents an innovative data-integration that uses an iterative-learning method,a deep neural network(DNN)coupled with a stacked autoencoder(SAE)to solve issues encountered with many-objective history matchi...This paper presents an innovative data-integration that uses an iterative-learning method,a deep neural network(DNN)coupled with a stacked autoencoder(SAE)to solve issues encountered with many-objective history matching.The proposed method consists of a DNN-based inverse model with SAE-encoded static data and iterative updates of supervised-learning data are based on distance-based clustering schemes.DNN functions as an inverse model and results in encoded flattened data,while SAE,as a pre-trained neural network,successfully reduces dimensionality and reliably reconstructs geomodels.The iterative-learning method can improve the training data for DNN by showing the error reduction achieved with each iteration step.The proposed workflow shows the small mean absolute percentage error below 4%for all objective functions,while a typical multi-objective evolutionary algorithm fails to significantly reduce the initial population uncertainty.Iterative learning-based manyobjective history matching estimates the trends in water cuts that are not reliably included in dynamicdata matching.This confirms the proposed workflow constructs more plausible geo-models.The workflow would be a reliable alternative to overcome the less-convergent Pareto-based multi-objective evolutionary algorithm in the presence of geological uncertainty and varying objective functions.展开更多
An optimal iterative learning control (ILC) strategy of improving endpoint products in semi-batch processes is presented by combining a neural network model. Control affine feed-forward neural network (CAFNN) is p...An optimal iterative learning control (ILC) strategy of improving endpoint products in semi-batch processes is presented by combining a neural network model. Control affine feed-forward neural network (CAFNN) is proposed to build a model of semi-batch process. The main advantage of CAFNN is to obtain analytically its gradient of endpoint products with respect to input. Therefore, an optimal ILC law with direct error feedback is obtained explicitly, and the convergence of tracking error can be analyzed theoretically. It has been proved that the tracking errors may converge to small values. The proposed modeling and control strategy is illustrated on a simulated isothermal semi-batch reactor, and the results show that the endpoint products can be improved gradually from batch to batch.展开更多
This paper is concerned with a novel integrated multi-step heuristic dynamic programming(MsHDP)algorithm for solving optimal control problems.It is shown that,initialized by the zero cost function,MsHDP can converge t...This paper is concerned with a novel integrated multi-step heuristic dynamic programming(MsHDP)algorithm for solving optimal control problems.It is shown that,initialized by the zero cost function,MsHDP can converge to the optimal solution of the Hamilton-Jacobi-Bellman(HJB)equation.Then,the stability of the system is analyzed using control policies generated by MsHDP.Also,a general stability criterion is designed to determine the admissibility of the current control policy.That is,the criterion is applicable not only to traditional value iteration and policy iteration but also to MsHDP.Further,based on the convergence and the stability criterion,the integrated MsHDP algorithm using immature control policies is developed to accelerate learning efficiency greatly.Besides,actor-critic is utilized to implement the integrated MsHDP scheme,where neural networks are used to evaluate and improve the iterative policy as the parameter architecture.Finally,two simulation examples are given to demonstrate that the learning effectiveness of the integrated MsHDP scheme surpasses those of other fixed or integrated methods.展开更多
Since the existing single-layer networked control systems have some inherent limitations and cannot effectively handle the problems associated with unreliable networks, a novel two-layer networked learning control sys...Since the existing single-layer networked control systems have some inherent limitations and cannot effectively handle the problems associated with unreliable networks, a novel two-layer networked learning control system (NLCS) is proposed in this paper. Its lower layer has a number of local controllers that are operated independently, and its upper layer has a learning agent that communicates with the independent local controllers in the lower layer. To implement such a system, a packet-discard strategy is firstly developed to deal with network-induced delay and data packet loss. A cubic spline interpolator is then employed to compensate the lost data. Finally, the output of the learning agent based on a novel radial basis function neural network (RBFNN) is used to update the parameters of fuzzy controllers. A nonlinear heating, ventilation and air-conditioning (HVAC) system is used to demonstrate the feasibility and effectiveness of the proposed system.展开更多
Norm optimal iterative learning control(NOILC) has recently been applied to iterative learning control(ILC) problems in which tracking is only required at a subset of isolated time points along the trial duration. Thi...Norm optimal iterative learning control(NOILC) has recently been applied to iterative learning control(ILC) problems in which tracking is only required at a subset of isolated time points along the trial duration. This problem addresses the practical needs of many applications, including industrial automation, crane control, satellite positioning and motion control within a medical stroke rehabilitation context. This paper provides a substantial generalization of this framework by providing a solution to the problem of convergence at intermediate points with simultaneous tracking of subsets of outputs to reference trajectories on subintervals. This formulation enables the NOILC paradigm to tackle tasks which mix "point to point" movements with linear tracking requirements and hence substantially broadens the application domain to include automation tasks which include welding or cutting movements, or human motion control where the movement is restricted by the task to straight line and/or planar segments. A solution to the problem is presented in the framework of NOILC and inherits NOILC s well-defined convergence properties. Design guidelines and supporting experimental results are included.展开更多
The autonomous mobile robots must be flexible to learn the new complex control behaviours in order to adapt effectively to a dynamic and varying environment. The proposed approach of this paper is to create a controll...The autonomous mobile robots must be flexible to learn the new complex control behaviours in order to adapt effectively to a dynamic and varying environment. The proposed approach of this paper is to create a controller that learns the complex behaviours incorporating the learning from demonstration to reduce the search space and to improve the demonstrated task geometry by trial and corrections. The task faced by the robot has uncertainty that must be learned. Simulation results indicate that after the handful of trials, robot has learned the right policies and avoided the obstacles and reached the goal.展开更多
This paper introduces a self-learning control approach based on approximate dynamic programming. Dynamic programming was introduced by Bellman in the 1950's for solving optimal control problems of nonlinear dynami...This paper introduces a self-learning control approach based on approximate dynamic programming. Dynamic programming was introduced by Bellman in the 1950's for solving optimal control problems of nonlinear dynamical systems. Due to its high computational complexity, the applications of dynamic programming have been limited to simple and small problems. The key step in finding approximate solutions to dynamic programming is to estimate the performance index in dynamic programming. The optimal control signal can then be determined by minimizing (or maximizing) the performance index. Artificial neural networks are very efficient tools in representing the performance index in dynamic programming. This paper assumes the use of neural networks for estimating the performance index in dynamic programming and for generating optimal control signals, thus to achieve optimal control through self-learning.展开更多
Nonlinear loads in the power distribution system cause non-sinusoidal currents and voltages with harmonic components.Shunt active filters(SAF) with current controlled voltage source inverters(CCVSI) are usually used t...Nonlinear loads in the power distribution system cause non-sinusoidal currents and voltages with harmonic components.Shunt active filters(SAF) with current controlled voltage source inverters(CCVSI) are usually used to obtain balanced and sinusoidal source currents by injecting compensation currents.However,CCVSI with traditional controllers have a limited transient and steady state performance.In this paper,we propose an adaptive dynamic programming(ADP) controller with online learning capability to improve transient response and harmonics.The proposed controller works alongside existing proportional integral(PI) controllers to efficiently track the reference currents in the d-q domain.It can generate adaptive control actions to compensate the PI controller.The proposed system was simulated under different nonlinear(three-phase full wave rectifier) load conditions.The performance of the proposed approach was compared with the traditional approach.We have also included the simulation results without connecting the traditional PI control based power inverter for reference comparison.The online learning based ADP controller not only reduced average total harmonic distortion by 18.41%,but also outperformed traditional PI controllers during transients.展开更多
Several applications of machine learning and artificial intelligence,have acquired importance and come to the fore as a result of recent advances and improvements in these approaches.Autonomous cars are one such appli...Several applications of machine learning and artificial intelligence,have acquired importance and come to the fore as a result of recent advances and improvements in these approaches.Autonomous cars are one such application.This is expected to have a significant and revolutionary influence on society.Integration with smart cities,new infrastructure and urban planning with sophisticated cyber-security are some of the current ramifications of self-driving automobiles.The autonomous automobile,often known as selfdriving systems or driverless vehicles,is a vehicle that can perceive its surroundings and navigate predetermined routes without human involvement.Cars are on the verge of evolving into autonomous robots,thanks to significant breakthroughs in artificial intelligence and related technologies,and this will have a wide range of socio-economic implications.However,in order for these automobiles to become a reality,they must be endowed with the perception and cognition necessary to deal with high-pressure real-life events and make proper judgments and take appropriate action.The majority of self-driving car technologies are based on computer systems that automate vehicle control parts.From forward-collision warning and antilock brakes to lane-keeping and adaptive drive control,to fully automated driving,these technological components have a wide range of capabilities.A self-driving car combines a wide range of sensors,actuators,and cameras.Recent researches on computer vision and deep learning are used to control autonomous driving systems.For self-driving automobiles,lane-keeping is crucial.This study presents a deep learning approach to obtain the proper steering angle to maintain the robot in the lane.We propose an advanced control for a selfdriving robot by using two controllers simultaneously.Convolutional neural networks(CNNs)are employed,to predict the car’and a proportionalintegral-derivative(PID)controller is designed for speed and steering control.This study uses a Raspberry PI based camera to control the robot car.展开更多
Considering the wheeled mobile robot(WMR)tracking problem with velocity saturation,we developed a data‐driven iterative learning double loop control method with constraints.First,the authors designed an outer loop co...Considering the wheeled mobile robot(WMR)tracking problem with velocity saturation,we developed a data‐driven iterative learning double loop control method with constraints.First,the authors designed an outer loop controller to provide virtual velocity for the inner loop according to the position and pose tracking error of the WMR kinematic model.Second,the authors employed dynamic linearisation to transform the dynamic model into an online data‐driven model along the iterative domain.Based on the measured input and output data of the dynamic model,the authors identified the parameters of the inner loop controller.The authors considered the velocity saturation constraints;we adjusted the output velocity of the WMR online,providing effective solutions to the problem of velocity saltation and the saturation constraint in the tracking process.Notably,the inner loop controller only uses the output data and input of the dynamic model,which not only enables the reliable control of WMR trajectory tracking,but also avoids the influence of inaccurate model identification processes on the tracking performance.The authors analysed the algorithm's convergence in theory,and the results show that the tracking errors of position,angle and velocity can converge to zero in the iterative domain.Finally,the authors used a simulation to demonstrate the effectiveness of the algorithm.展开更多
Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and ...Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.展开更多
In this paper,we propose a novel algorithm based on Zidan’s quantum computing model for remotely controlling the direction of a quantumcontrolledmobile robot equippedwith n-movements.The proposed algorithm is based o...In this paper,we propose a novel algorithm based on Zidan’s quantum computing model for remotely controlling the direction of a quantumcontrolledmobile robot equippedwith n-movements.The proposed algorithm is based on the measurement of concurrence value for the different movements of the robot.Consider a faraway robot that moves in the deep space(e.g.,moves toward a galaxy),and it is required to control the direction of this robot from a ground station by some person Alice.She sends an unknown qubitα|0)+β|1)via the teleportation protocol to the robot.Then,the proposed algorithm decodes the received unknown qubit into an angleθ,that determines the motion direction of the robot,based on the concurrence value.The proposed algorithm has been tested for four and eight movements.Two simulators have been tested;IBM Quantum composer and IBM’s system,The two simulators achieved the same result approximately.The motion of any part of the robot is considered,if it has a pre-existing sensor system and a locomotive system,.We can use this technique in many places like in space robots(16 directions).The results show that the proposed technique can be easily used for a huge number of movements.However,increasing the number of movements of the robot will increase the number of qubits.展开更多
Organizations are adopting the Bring Your Own Device(BYOD)concept to enhance productivity and reduce expenses.However,this trend introduces security challenges,such as unauthorized access.Traditional access control sy...Organizations are adopting the Bring Your Own Device(BYOD)concept to enhance productivity and reduce expenses.However,this trend introduces security challenges,such as unauthorized access.Traditional access control systems,such as Attribute-Based Access Control(ABAC)and Role-Based Access Control(RBAC),are limited in their ability to enforce access decisions due to the variability and dynamism of attributes related to users and resources.This paper proposes a method for enforcing access decisions that is adaptable and dynamic,based on multilayer hybrid deep learning techniques,particularly the Tabular Deep Neural Network Tabular DNN method.This technique transforms all input attributes in an access request into a binary classification(allow or deny)using multiple layers,ensuring accurate and efficient access decision-making.The proposed solution was evaluated using the Kaggle Amazon access control policy dataset and demonstrated its effectiveness by achieving a 94%accuracy rate.Additionally,the proposed solution enhances the implementation of access decisions based on a variety of resource and user attributes while ensuring privacy through indirect communication with the Policy Administration Point(PAP).This solution significantly improves the flexibility of access control systems,making themmore dynamic and adaptable to the evolving needs ofmodern organizations.Furthermore,it offers a scalable approach to manage the complexities associated with the BYOD environment,providing a robust framework for secure and efficient access management.展开更多
Feedforward control based on an accurate dynamic model is an effective approach to reduce the dynamic effect of the robot and improve its performance. However, due to the complicated work environment with considerable...Feedforward control based on an accurate dynamic model is an effective approach to reduce the dynamic effect of the robot and improve its performance. However, due to the complicated work environment with considerable uncertainty, it is difficult to obtain a high-precision dynamic model of the robot, which severely deteriorates the achievable control performance. This paper proposes an iterative learning method to accurately design the industrial feedforward controller and compensate for the external uncertain dynamic load of the robot. Based on a standard dynamic model, a complete linear feedforward controller is presented.An iterative design strategy is given to iteratively update the feedforward controller by combining the Moore-Penrose Inverse and the PID learning rate. Experiments are carried out on a 5-DOF industrial hybrid robot to validate the effectiveness of the proposed iterative learning method. The experiment results illustrate that the industrial feedforward controller can rapidly converge to the optimal controller and significantly improve the servo performance by using the proposed method. This paper provides an effective method for applying iterative learning control to an unopened industrial control system. It is very useful for the practical control of hybrid robots in industrial field.展开更多
基金supported by National Natural Science Foundation of China (No. 72103676)partially supported by the Fundamental Research Funds for the Central Universities
文摘An adaptive iterative learning control scheme is presented for a class of strict-feedback nonlinear time-delay systems, with unknown nonlinearly parameterised and time-varying disturbed functions of known periods. Radial basis function neural network and Fourier series expansion (FSE) are combined into a new function approximator to model each suitable disturbed function in systems. The requirement of the traditional iterative learning control algorithm on the nonlinear functions (such as global Lipschitz condition) is relaxed. Furthermore, by using appropriate Lyapunov-Krasovskii functionals, all signs in the closed loop system are guaranteed to be semiglobally uniformly ultimately bounded, and the output of the system is proved to converge to the desired trajectory. A simulation example is provided to illustrate the effectiveness of the control scheme.
基金supported by the National Natural Science Foundation of China(Grant Nos.62203342,62073254,92271101,62106186,and62103136)the Fundamental Research Funds for the Central Universities(Grant Nos.XJS220704,QTZX23003,and ZYTS23046)+1 种基金the Project funded by China Postdoctoral Science Foundation(Grant No.2022M712489)the Natural Science Basic Research Program of Shaanxi(Grant Nos.2023-JC-YB-585 and 2020JM-188)。
文摘In this paper,the problem of adaptive iterative learning based consensus control for periodically time-varying multi-agent systems is studied,in which the dynamics of each follower are driven by nonlinearly parameterized terms with periodic disturbances.Neural networks and Fourier base expansions are introduced to describe the periodically time-varying dynamic terms.On this basis,an adaptive learning parameter with a positively convergent series term is constructed,and a distributed control protocol based on local signals between agents is designed to ensure accurate consensus of the closed-loop systems.Furthermore,consensus algorithm is generalized to solve the formation control problem.Finally,simulation experiments are implemented through MATLAB to demonstrate the effectiveness of the method used.
基金supported by General Program (No. 60774022)State Key Program (No. 60834001) of National Natural Science Foundation of China
文摘In this paper, the stability of iterative learning control with data dropouts is discussed. By the super vector formulation, an iterative learning control (ILC) system with data dropouts can be modeled as an asynchronous dynamical system with rate constraints on events in the iteration domain. The stability condition is provided in the form of linear matrix inequalities (LMIS) depending on the stability of asynchronous dynamical systems. The analysis is supported by simulations.
基金supported by Specialized Research Fund for Doctoral Program of Higher Education of China (Grant No. 20091102120038)
文摘Iterative Learning Control (ILC) captures interests of many scholars because of its capability of high precision control implement without identifying plant mathematical models, and it is widely applied in control engineering. Presently, most ILC algorithms still follow the original ideas of ARIMOTO, in which the iterative-learning-rate is composed by the control error with its derivative and integral values. This kind of algorithms will result in inevitable problems such as huge computation, big storage capacity for algorithm data, and also weak robust. In order to resolve these problems, an improved iterative learning control algorithm with fixed step is proposed here which breaks the primary thought of ARIMOTO. In this algorithm, the control step is set only according to the value of the control error, which could enormously reduce the computation and storage size demanded, also improve the robust of the algorithm by not using the differential coefficient of the iterative learning error. In this paper, the convergence conditions of this proposed fixed step iterative learning algorithm is theoretically analyzed and testified. Then the algorithm is tested through simulation researches on a time-variant object with randomly set disturbance through calculation of step threshold value, algorithm robustness testing,and evaluation of the relation between convergence speed and step size. Finally the algorithm is validated on a valve-serving-cylinder system of a joint robot with time-variant parameters. Experiment results demonstrate the stability of the algorithm and also the relationship between step value and convergence rate. Both simulation and experiment testify the feasibility and validity of the new algorithm proposed here. And it is worth to noticing that this algorithm is simple but with strong robust after improvements, which provides new ideas to the research of iterative learning control algorithms.
文摘Industrial robot system is a kind of dynamic system w ith strong nonlinear coupling and high position precision. A lot of control ways , such as nonlinear feedbackdecomposition motion and adaptive control and so o n, have been used to control this kind of system, but there are some deficiencie s in those methods: some need accurate and some need complicated operation and e tc. In recent years, in need of controlling the industrial robots, aiming at com pletely tracking the ideal input for the controlled subject with repetitive character, a new research area, ILC (iterative learning control), has been devel oped in the control technology and theory. The iterative learning control method can make the controlled subject operate as desired in a definite time span, merely making use of the prior control experie nce of the system and searching for the desired control signal according to the practical and desired output signal. The process of searching is equal to that o f learning, during which we only need to measure the output signal to amend the control signal, not like the adaptive control strategy, which on line assesses t he complex parameters of the system. Besides, since the iterative learning contr ol relies little on the prior message of the subject, it has been well used in a lot of areas, especially the dynamic systems with strong non-linear coupling a nd high repetitive position precision and the control system with batch producti on. Since robot manipulator has the above-mentioned character, ILC can be very well used in robot manipulator. In the ILC, since the operation always begins with a certain initial state, init ial condition has been required in almost all convergence verification. Therefor e, in designing the controller, the initial state has to be restricted with some condition to guarantee the convergence of the algorithm. The settle of initial condition problem has long been pursued in the ILC. There are commonly two kinds of initial condition problems: one is zero initial error problem, another is non-zero initial error problem. In practice, the repe titive operation will invariably produce excursion of the iterative initial stat e from the desired initial state. As a result, the research on the second in itial problem has more practical meaning. In this paper, for the non-zero initial error problem, one novel robust ILC alg orithms, respectively combining PD type iterative learning control algorithm wit h the robust feedback control algorithm, has been presented. This novel robust ILC algorithm contain two parts: feedforward ILC algorithm and robust feedback algorithm, which can be used to restrain disturbance from param eter variation, mechanical nonlinearities and unmodeled dynamics and to achieve good performance as well. The feedforward ILC algorithm can be used to improve the tracking error and perf ormance of the system through iteratively learning from the previous operation, thus performing the tracking task very fast. The robust feedback algorithm could mainly be applied to make the real output of the system not deviate too much fr om the desired tracking trajectory, and guarantee the system’s robustness w hen there are exterior noises and variations of the system parameter. In this paper, in order to analyze the convergence of the algorithm, Lyapunov st ability theory has been used through properly selecting the Lyapunov function. T he result of the verification shows the feasibility of the novel robust iterativ e learning control in theory. Finally, aiming at the two-freedom rate robot, simulation has been made with th e MATLAB software. Furthermore, two groups of parameters are selected to validat e the robustness of the algorithm.
基金supported by the basic science research program through the National Research Foundation of Korea(NRF)(2020R1F1A1073395)the basic research project of the Korea Institute of Geoscience and Mineral Resources(KIGAM)(GP2021-011,GP2020-031,21-3117)funded by the Ministry of Science and ICT,Korea。
文摘This paper presents an innovative data-integration that uses an iterative-learning method,a deep neural network(DNN)coupled with a stacked autoencoder(SAE)to solve issues encountered with many-objective history matching.The proposed method consists of a DNN-based inverse model with SAE-encoded static data and iterative updates of supervised-learning data are based on distance-based clustering schemes.DNN functions as an inverse model and results in encoded flattened data,while SAE,as a pre-trained neural network,successfully reduces dimensionality and reliably reconstructs geomodels.The iterative-learning method can improve the training data for DNN by showing the error reduction achieved with each iteration step.The proposed workflow shows the small mean absolute percentage error below 4%for all objective functions,while a typical multi-objective evolutionary algorithm fails to significantly reduce the initial population uncertainty.Iterative learning-based manyobjective history matching estimates the trends in water cuts that are not reliably included in dynamicdata matching.This confirms the proposed workflow constructs more plausible geo-models.The workflow would be a reliable alternative to overcome the less-convergent Pareto-based multi-objective evolutionary algorithm in the presence of geological uncertainty and varying objective functions.
基金Supported by the National Natural Science Foundation of China (Grant Nos. 60404012, 60874049)the National High-Tech Research & Development Program of China (Grant No. 2007AA041402)+1 种基金the New Star of Science and Technology of Beijing City (Grant No. 2006A62)the IBM China Research Lab 2008 UR-Program
文摘An optimal iterative learning control (ILC) strategy of improving endpoint products in semi-batch processes is presented by combining a neural network model. Control affine feed-forward neural network (CAFNN) is proposed to build a model of semi-batch process. The main advantage of CAFNN is to obtain analytically its gradient of endpoint products with respect to input. Therefore, an optimal ILC law with direct error feedback is obtained explicitly, and the convergence of tracking error can be analyzed theoretically. It has been proved that the tracking errors may converge to small values. The proposed modeling and control strategy is illustrated on a simulated isothermal semi-batch reactor, and the results show that the endpoint products can be improved gradually from batch to batch.
基金the National Key Research and Development Program of China(2021ZD0112302)the National Natural Science Foundation of China(62222301,61890930-5,62021003)the Beijing Natural Science Foundation(JQ19013).
文摘This paper is concerned with a novel integrated multi-step heuristic dynamic programming(MsHDP)algorithm for solving optimal control problems.It is shown that,initialized by the zero cost function,MsHDP can converge to the optimal solution of the Hamilton-Jacobi-Bellman(HJB)equation.Then,the stability of the system is analyzed using control policies generated by MsHDP.Also,a general stability criterion is designed to determine the admissibility of the current control policy.That is,the criterion is applicable not only to traditional value iteration and policy iteration but also to MsHDP.Further,based on the convergence and the stability criterion,the integrated MsHDP algorithm using immature control policies is developed to accelerate learning efficiency greatly.Besides,actor-critic is utilized to implement the integrated MsHDP scheme,where neural networks are used to evaluate and improve the iterative policy as the parameter architecture.Finally,two simulation examples are given to demonstrate that the learning effectiveness of the integrated MsHDP scheme surpasses those of other fixed or integrated methods.
基金Supported by National Natural Science Foundation of China(60774059)Project of Science &Technology Commission of Shanghaiunicipality(061107031,06DZ22011,06ZR14131)+1 种基金the Sunlight Plan Following Project of Shanghai Municipal Education Commission and Shanghai Edu-ational Development Foundation(06GG10)Shanghai Leading Academic Disciplines(T0103)
文摘Since the existing single-layer networked control systems have some inherent limitations and cannot effectively handle the problems associated with unreliable networks, a novel two-layer networked learning control system (NLCS) is proposed in this paper. Its lower layer has a number of local controllers that are operated independently, and its upper layer has a learning agent that communicates with the independent local controllers in the lower layer. To implement such a system, a packet-discard strategy is firstly developed to deal with network-induced delay and data packet loss. A cubic spline interpolator is then employed to compensate the lost data. Finally, the output of the learning agent based on a novel radial basis function neural network (RBFNN) is used to update the parameters of fuzzy controllers. A nonlinear heating, ventilation and air-conditioning (HVAC) system is used to demonstrate the feasibility and effectiveness of the proposed system.
文摘Norm optimal iterative learning control(NOILC) has recently been applied to iterative learning control(ILC) problems in which tracking is only required at a subset of isolated time points along the trial duration. This problem addresses the practical needs of many applications, including industrial automation, crane control, satellite positioning and motion control within a medical stroke rehabilitation context. This paper provides a substantial generalization of this framework by providing a solution to the problem of convergence at intermediate points with simultaneous tracking of subsets of outputs to reference trajectories on subintervals. This formulation enables the NOILC paradigm to tackle tasks which mix "point to point" movements with linear tracking requirements and hence substantially broadens the application domain to include automation tasks which include welding or cutting movements, or human motion control where the movement is restricted by the task to straight line and/or planar segments. A solution to the problem is presented in the framework of NOILC and inherits NOILC s well-defined convergence properties. Design guidelines and supporting experimental results are included.
文摘The autonomous mobile robots must be flexible to learn the new complex control behaviours in order to adapt effectively to a dynamic and varying environment. The proposed approach of this paper is to create a controller that learns the complex behaviours incorporating the learning from demonstration to reduce the search space and to improve the demonstrated task geometry by trial and corrections. The task faced by the robot has uncertainty that must be learned. Simulation results indicate that after the handful of trials, robot has learned the right policies and avoided the obstacles and reached the goal.
基金Supported by the National Science Foundation (U.S.A.) under Grant ECS-0355364
文摘This paper introduces a self-learning control approach based on approximate dynamic programming. Dynamic programming was introduced by Bellman in the 1950's for solving optimal control problems of nonlinear dynamical systems. Due to its high computational complexity, the applications of dynamic programming have been limited to simple and small problems. The key step in finding approximate solutions to dynamic programming is to estimate the performance index in dynamic programming. The optimal control signal can then be determined by minimizing (or maximizing) the performance index. Artificial neural networks are very efficient tools in representing the performance index in dynamic programming. This paper assumes the use of neural networks for estimating the performance index in dynamic programming and for generating optimal control signals, thus to achieve optimal control through self-learning.
基金Supported by the Scientific Research Foundation for the Returned 0verseas Chinese Scholars, State Education Ministry, and National Natural Science Foundation of China (60474005)
文摘Nonlinear loads in the power distribution system cause non-sinusoidal currents and voltages with harmonic components.Shunt active filters(SAF) with current controlled voltage source inverters(CCVSI) are usually used to obtain balanced and sinusoidal source currents by injecting compensation currents.However,CCVSI with traditional controllers have a limited transient and steady state performance.In this paper,we propose an adaptive dynamic programming(ADP) controller with online learning capability to improve transient response and harmonics.The proposed controller works alongside existing proportional integral(PI) controllers to efficiently track the reference currents in the d-q domain.It can generate adaptive control actions to compensate the PI controller.The proposed system was simulated under different nonlinear(three-phase full wave rectifier) load conditions.The performance of the proposed approach was compared with the traditional approach.We have also included the simulation results without connecting the traditional PI control based power inverter for reference comparison.The online learning based ADP controller not only reduced average total harmonic distortion by 18.41%,but also outperformed traditional PI controllers during transients.
文摘Several applications of machine learning and artificial intelligence,have acquired importance and come to the fore as a result of recent advances and improvements in these approaches.Autonomous cars are one such application.This is expected to have a significant and revolutionary influence on society.Integration with smart cities,new infrastructure and urban planning with sophisticated cyber-security are some of the current ramifications of self-driving automobiles.The autonomous automobile,often known as selfdriving systems or driverless vehicles,is a vehicle that can perceive its surroundings and navigate predetermined routes without human involvement.Cars are on the verge of evolving into autonomous robots,thanks to significant breakthroughs in artificial intelligence and related technologies,and this will have a wide range of socio-economic implications.However,in order for these automobiles to become a reality,they must be endowed with the perception and cognition necessary to deal with high-pressure real-life events and make proper judgments and take appropriate action.The majority of self-driving car technologies are based on computer systems that automate vehicle control parts.From forward-collision warning and antilock brakes to lane-keeping and adaptive drive control,to fully automated driving,these technological components have a wide range of capabilities.A self-driving car combines a wide range of sensors,actuators,and cameras.Recent researches on computer vision and deep learning are used to control autonomous driving systems.For self-driving automobiles,lane-keeping is crucial.This study presents a deep learning approach to obtain the proper steering angle to maintain the robot in the lane.We propose an advanced control for a selfdriving robot by using two controllers simultaneously.Convolutional neural networks(CNNs)are employed,to predict the car’and a proportionalintegral-derivative(PID)controller is designed for speed and steering control.This study uses a Raspberry PI based camera to control the robot car.
基金supported by the Innovation Project of Guangxi Graduate Education(Grant No.YCSW2022436).
文摘Considering the wheeled mobile robot(WMR)tracking problem with velocity saturation,we developed a data‐driven iterative learning double loop control method with constraints.First,the authors designed an outer loop controller to provide virtual velocity for the inner loop according to the position and pose tracking error of the WMR kinematic model.Second,the authors employed dynamic linearisation to transform the dynamic model into an online data‐driven model along the iterative domain.Based on the measured input and output data of the dynamic model,the authors identified the parameters of the inner loop controller.The authors considered the velocity saturation constraints;we adjusted the output velocity of the WMR online,providing effective solutions to the problem of velocity saltation and the saturation constraint in the tracking process.Notably,the inner loop controller only uses the output data and input of the dynamic model,which not only enables the reliable control of WMR trajectory tracking,but also avoids the influence of inaccurate model identification processes on the tracking performance.The authors analysed the algorithm's convergence in theory,and the results show that the tracking errors of position,angle and velocity can converge to zero in the iterative domain.Finally,the authors used a simulation to demonstrate the effectiveness of the algorithm.
基金supported in part by the National Natural Science Foundation of China(62222301, 62073085, 62073158, 61890930-5, 62021003)the National Key Research and Development Program of China (2021ZD0112302, 2021ZD0112301, 2018YFC1900800-5)Beijing Natural Science Foundation (JQ19013)。
文摘Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.
文摘In this paper,we propose a novel algorithm based on Zidan’s quantum computing model for remotely controlling the direction of a quantumcontrolledmobile robot equippedwith n-movements.The proposed algorithm is based on the measurement of concurrence value for the different movements of the robot.Consider a faraway robot that moves in the deep space(e.g.,moves toward a galaxy),and it is required to control the direction of this robot from a ground station by some person Alice.She sends an unknown qubitα|0)+β|1)via the teleportation protocol to the robot.Then,the proposed algorithm decodes the received unknown qubit into an angleθ,that determines the motion direction of the robot,based on the concurrence value.The proposed algorithm has been tested for four and eight movements.Two simulators have been tested;IBM Quantum composer and IBM’s system,The two simulators achieved the same result approximately.The motion of any part of the robot is considered,if it has a pre-existing sensor system and a locomotive system,.We can use this technique in many places like in space robots(16 directions).The results show that the proposed technique can be easily used for a huge number of movements.However,increasing the number of movements of the robot will increase the number of qubits.
基金partly supported by the University of Malaya Impact Oriented Interdisci-plinary Research Grant under Grant IIRG008(A,B,C)-19IISS.
文摘Organizations are adopting the Bring Your Own Device(BYOD)concept to enhance productivity and reduce expenses.However,this trend introduces security challenges,such as unauthorized access.Traditional access control systems,such as Attribute-Based Access Control(ABAC)and Role-Based Access Control(RBAC),are limited in their ability to enforce access decisions due to the variability and dynamism of attributes related to users and resources.This paper proposes a method for enforcing access decisions that is adaptable and dynamic,based on multilayer hybrid deep learning techniques,particularly the Tabular Deep Neural Network Tabular DNN method.This technique transforms all input attributes in an access request into a binary classification(allow or deny)using multiple layers,ensuring accurate and efficient access decision-making.The proposed solution was evaluated using the Kaggle Amazon access control policy dataset and demonstrated its effectiveness by achieving a 94%accuracy rate.Additionally,the proposed solution enhances the implementation of access decisions based on a variety of resource and user attributes while ensuring privacy through indirect communication with the Policy Administration Point(PAP).This solution significantly improves the flexibility of access control systems,making themmore dynamic and adaptable to the evolving needs ofmodern organizations.Furthermore,it offers a scalable approach to manage the complexities associated with the BYOD environment,providing a robust framework for secure and efficient access management.
基金the National Key Research and Development Program of China(Grant No.2017YFE0111300)EU H2020-MSCA-RISEECSASDPE(Grant No.734272)the National Natural Science Foundation of China(Grant No.51975321)。
文摘Feedforward control based on an accurate dynamic model is an effective approach to reduce the dynamic effect of the robot and improve its performance. However, due to the complicated work environment with considerable uncertainty, it is difficult to obtain a high-precision dynamic model of the robot, which severely deteriorates the achievable control performance. This paper proposes an iterative learning method to accurately design the industrial feedforward controller and compensate for the external uncertain dynamic load of the robot. Based on a standard dynamic model, a complete linear feedforward controller is presented.An iterative design strategy is given to iteratively update the feedforward controller by combining the Moore-Penrose Inverse and the PID learning rate. Experiments are carried out on a 5-DOF industrial hybrid robot to validate the effectiveness of the proposed iterative learning method. The experiment results illustrate that the industrial feedforward controller can rapidly converge to the optimal controller and significantly improve the servo performance by using the proposed method. This paper provides an effective method for applying iterative learning control to an unopened industrial control system. It is very useful for the practical control of hybrid robots in industrial field.