Underwater monopulse space-time adaptive track-before-detect method,which combines space-time adaptive detector(STAD)and the track-before-detect algorithm based on dynamic programming(DP-TBD),denoted as STAD-DP-TBD,ca...Underwater monopulse space-time adaptive track-before-detect method,which combines space-time adaptive detector(STAD)and the track-before-detect algorithm based on dynamic programming(DP-TBD),denoted as STAD-DP-TBD,can effectively detect low-speed weak targets.However,due to the complexity and variability of the underwater environment,it is difficult to obtain sufficient secondary data,resulting in a serious decline in the detection and tracking performance,and leading to poor robustness of the algorithm.In this paper,based on the adaptive matched filter(AMF)test and the RAO test,underwater monopulse AMF-DP-TBD algorithm and RAO-DP-TBD algorithm which incorporate persymmetry and symmetric spectrum,denoted as PSAMF-DP-TBD and PS-RAO-DP-TBD,are proposed and compared with the AMF-DP-TBD algorithm and RAO-DP-TBD algorithm based on persymmetry array,denoted as P-AMF-DP-TBD and P-RAO-DP-TBD.The simulation results show that the four methods can work normally with sufficient secondary data and slightly insufficient secondary data,but when the secondary data is severely insufficient,the P-AMF-DP-TBD and P-RAO-DP-TBD algorithms has failed while the PSAMF-DP-TBD and PS-RAO-DP-TBD algorithms still have good detection and tracking capabilities.展开更多
In order to improve the rejection capability of mismatched interferer signals,a new two-stage detector is proposed under homogeneous scenarios with unknown covariance matrix,which is obtained by cascading the adaptive...In order to improve the rejection capability of mismatched interferer signals,a new two-stage detector is proposed under homogeneous scenarios with unknown covariance matrix,which is obtained by cascading the adaptive matched filter(AMF)detector and the enhanced RAO(EnRAO)detector.The new detector has constant false alarm performance,and the closed-form expression of probability of false alarm and probability of detection is derived.The performance of the new detector is assessed,and analyzed in comparison with other detectors.The results show that,the proposed detector can provide enhanced rejection capability in the case of mismatch,but the performance of the detector is slightly lost under the condition of matching.展开更多
For the slowly changed environment-range-dependent non-homogeneity, a new statistical space-time adaptive processing algorithm is proposed, which uses the statistical methods, such as Bayes or likelihood criterion to ...For the slowly changed environment-range-dependent non-homogeneity, a new statistical space-time adaptive processing algorithm is proposed, which uses the statistical methods, such as Bayes or likelihood criterion to estimate the approximative covariance matrix in the non-homogeneous condition. According to the statistical characteristics of the space-time snapshot data, via defining the aggregate snapshot data and corresponding events, the conditional probability of the space-time snapshot data which is the effective training data is given, then the weighting coefficients are obtained for the weighting method. The theory analysis indicates that the statistical methods of the Bayes and likelihood criterion for covariance matrix estimation are more reasonable than other methods that estimate the covariance matrix with the use of training data except the detected outliers. The last simulations attest that the proposed algorithms can estimate the covariance in the non-homogeneous condition exactly and have favorable characteristics.展开更多
A convenient implementation approach to space-time adaptive processing for airborne radar has been proposed, which is added by some auxiliary array elements in the area of main-lobe clutter on the basis of 2-D Capon a...A convenient implementation approach to space-time adaptive processing for airborne radar has been proposed, which is added by some auxiliary array elements in the area of main-lobe clutter on the basis of 2-D Capon approach . It is of practical use for its small computational load. This approach possesses the ideal performance in the area of main-lobe clutter . In addition, the approach which is added by some auxiliary beams in the area of main-lobe clutter has also been discussed.展开更多
This paper proposes a unified clutter model incorporating the effects of range walk and array rotation for space-time adaptive processing(STAP) in airborne multi-channel early-warning radar.Based on this clutter model...This paper proposes a unified clutter model incorporating the effects of range walk and array rotation for space-time adaptive processing(STAP) in airborne multi-channel early-warning radar.Based on this clutter model,STAP performance is then analyzed from the perspective of covariance matrix tapering(CMT).For STAP performance degradation due to array rotation,a determinate compensation method is proposed based on the CMT method.Numerical examples are provided to verify the analysis and the proposed compensation method.展开更多
In non-homogeneous environment, traditional space-time adaptive processing doesn’t effectively suppress interference and detect target, because the secondary data don’t exactly reflect the statistical characteristic...In non-homogeneous environment, traditional space-time adaptive processing doesn’t effectively suppress interference and detect target, because the secondary data don’t exactly reflect the statistical characteristic of the range cell under test. A novel methodology utilizing the direct data domain approach to space-time adaptive processing (STAP) in airborne radar non-homogeneous environments is presented. The deterministic least squares adaptive signal processing technique operates on a “snapshot-by-snapshot” basis to determine the adaptive weights for nulling interferences and estimating signal of interest (SOI). Furthermore, this approach eliminates the requirement for estimating the covariance through the data of neighboring range cell, which eliminates calculating the inverse of covariance, and can be implemented to operate in real-time. Simulation results illustrate the efficiency of interference suppression in non-homogeneous environment.展开更多
This paper introduces the preconditioned methods for Space-Time Adaptive Processing(STAP).Using the Block-Toeplitz-Toeplitz-Block(BTTB)structure of the clutter-plus-noise covari-ance matrix,a Block-Circulant-Circulant...This paper introduces the preconditioned methods for Space-Time Adaptive Processing(STAP).Using the Block-Toeplitz-Toeplitz-Block(BTTB)structure of the clutter-plus-noise covari-ance matrix,a Block-Circulant-Circulant-Block(BCCB)preconditioner is constructed.Based on thepreconditioner,a Preconditioned Multistage Wiener Filter(PMWF)which can be implemented by thePreconditioned Conjugate Gradient(PCG)method is proposed.Simulation results show that thePMWF has faster convergence rate and lower processing rank compared with the MWF.展开更多
Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and ...Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.展开更多
This paper investigates the adaptive trajectory tracking control problem and the unknown parameter identification problem of a class of rotor-missiles with parametric system uncertainties.First,considering the uncerta...This paper investigates the adaptive trajectory tracking control problem and the unknown parameter identification problem of a class of rotor-missiles with parametric system uncertainties.First,considering the uncertainty of structural and aerodynamic parameters,the six-degree-of-freedom(6Do F) nonlinear equations describing the position and attitude dynamics of the rotor-missile are established,respectively,in the inertial and body-fixed reference frames.Next,a hierarchical adaptive trajectory tracking controller that can guarantee closed-loop stability is proposed according to the cascade characteristics of the 6Do F dynamics.Then,a memory-augmented update rule of unknown parameters is proposed by integrating all historical data of the regression matrix.As long as the finitely excited condition is satisfied,the precise identification of unknown parameters can be achieved.Finally,the validity of the proposed trajectory tracking controller and the parameter identification method is proved through Lyapunov stability theory and numerical simulations.展开更多
The 2.5 m wide-field and high-resolution solar telescope(WeHoST)is currently under developing for solar observations.WeHoST aims to achieve high-resolution observations over a super-wide field of view(FOV)of5′×5...The 2.5 m wide-field and high-resolution solar telescope(WeHoST)is currently under developing for solar observations.WeHoST aims to achieve high-resolution observations over a super-wide field of view(FOV)of5′×5′,and a desired resolution of 0.3″.To meet the scientific requirements of WeHoST,the ground-layer adaptive optics(GLAO)with a specially designed wave front sensing system is as the primary consideration.We introduce the GLAO configuration,particularly the wave front sensing scheme.Utilizing analytic method,we simulate the performance of both classical AO and GLAO systems,optimize the wave front sensing system,and evaluate GLAO performance in terms of PSF uniformity and correction improvement across whole FOV.The results indicate that,the classical AO will achieve diffraction-limited resolution;the suggested GLAO configuration will uniformly improve the seeing across the full 5′×5′FOV,reducing the FWHM across the axis FOV to less than0.3″(λ≥705 nm,r0≥11 cm),which is more than two times improvement.The specially designed wave front sensor schedule offers new potential for WeHoST’s GLAO,particularly the multi-FOV GLAO and the flexibility to select the detected area.These capabilities will significantly enhance the scientific output of the telescope.展开更多
An adaptive control approach is presented in this paper for tracking desired trajectories in interactive manipulators. The controller design incorporates prescribed performance functions (PPFs) to improve dynamic perf...An adaptive control approach is presented in this paper for tracking desired trajectories in interactive manipulators. The controller design incorporates prescribed performance functions (PPFs) to improve dynamic performance. Notably, the performance of the output error is confined in an envelope characterized by exponential convergence, leading to convergence to zero. This feature ensures a prompt response from admittance control and establishes a reliable safety framework for interactions. Simulation results provide practical insights,demonstrating the viability of the control scheme proposed in this paper.展开更多
We propose a fast,adaptive multiscale resolution spectral measurement method based on compressed sensing.The method can apply variable measurement resolution over the entire spectral range to reduce the measurement ti...We propose a fast,adaptive multiscale resolution spectral measurement method based on compressed sensing.The method can apply variable measurement resolution over the entire spectral range to reduce the measurement time by over 75%compared to a global high-resolution measurement.Mimicking the characteristics of the human retina system,the resolution distribution follows the principle of gradually decreasing.The system allows the spectral peaks of interest to be captured dynamically or to be specified a priori by a user.The system was tested by measuring single and dual spectral peaks,and the results of spectral peaks are consistent with those of global high-resolution measurements.展开更多
The solving of dynamic matrix square root(DMSR)problems is frequently encountered in many scientific and engineering fields.Although the original zeroing neural network is powerful for solving the DMSR,it cannot vanis...The solving of dynamic matrix square root(DMSR)problems is frequently encountered in many scientific and engineering fields.Although the original zeroing neural network is powerful for solving the DMSR,it cannot vanish the influence of the noise perturbations,and its constant-coefficient design scheme cannot accelerate the convergence speed.Therefore,a noise-tolerate and adaptive coefficient zeroing neural network(NTACZNN)is raised to enhance the robust noise immunity performance and accelerate the conver-gence speed simultaneously.Then,the global convergence and robustness of the pro-posed NTACZNN are theoretically analysed under an ideal environment and noise-perturbed circumstances.Furthermore,some illustrative simulation examples are designed and performed in order to substantiate the efficacy and advantage of the NTACZNN for the DMSR problem solution.Compared with some existing ZNNs,the proposed NTACZNN possesses advanced performance in terms of noise tolerance,solution accuracy,and convergence rate.展开更多
Passive detection of low-slow-small(LSS)targets is easily interfered by direct signal and multipath clutter,and the traditional clutter suppression method has the contradiction between step size and convergence rate.I...Passive detection of low-slow-small(LSS)targets is easily interfered by direct signal and multipath clutter,and the traditional clutter suppression method has the contradiction between step size and convergence rate.In this paper,a frequency domain clutter suppression algorithm based on sparse adaptive filtering is proposed.The pulse compression operation between the error signal and the input reference signal is added to the cost function as a sparsity constraint,and the criterion for filter weight updating is improved to obtain a purer echo signal.At the same time,the step size and penalty factor are brought into the adaptive iteration process,and the input data is used to drive the adaptive changes of parameters such as step size.The proposed algorithm has a small amount of calculation,which improves the robustness to parameters such as step size,reduces the weight error of the filter and has a good clutter suppression performance.展开更多
This paper mainly focuses on the development of a learning-based controller for a class of uncertain mechanical systems modeled by the Euler-Lagrange formulation.The considered system can depict the behavior of a larg...This paper mainly focuses on the development of a learning-based controller for a class of uncertain mechanical systems modeled by the Euler-Lagrange formulation.The considered system can depict the behavior of a large class of engineering systems,such as vehicular systems,robot manipulators and satellites.All these systems are often characterized by highly nonlinear characteristics,heavy modeling uncertainties and unknown perturbations,therefore,accurate-model-based nonlinear control approaches become unavailable.Motivated by the challenge,a reinforcement learning(RL)adaptive control methodology based on the actor-critic framework is investigated to compensate the uncertain mechanical dynamics.The approximation inaccuracies caused by RL and the exogenous unknown disturbances are circumvented via a continuous robust integral of the sign of the error(RISE)control approach.Different from a classical RISE control law,a tanh(·)function is utilized instead of a sign(·)function to acquire a more smooth control signal.The developed controller requires very little prior knowledge of the dynamic model,is robust to unknown dynamics and exogenous disturbances,and can achieve asymptotic output tracking.Eventually,co-simulations through ADAMS and MATLAB/Simulink on a three degrees-of-freedom(3-DOF)manipulator and experiments on a real-time electromechanical servo system are performed to verify the performance of the proposed approach.展开更多
Since backlash nonlinearity is inevitably existing in actuators for bidirectional stabilization system of allelectric tank,it behaves more drastically in high maneuvering environments.In this work,the accurate trackin...Since backlash nonlinearity is inevitably existing in actuators for bidirectional stabilization system of allelectric tank,it behaves more drastically in high maneuvering environments.In this work,the accurate tracking control for bidirectional stabilization system of moving all-electric tank with actuator backlash and unmodeled disturbance is solved.By utilizing the smooth adaptive backlash inverse model,a nonlinear robust adaptive feedback control scheme is presented.The unknown parameters and unmodelled disturbance are addressed separately through the derived parametric adaptive function and the continuous nonlinear robust term.Because the unknown backlash parameters are updated via adaptive function and the backlash effect can be suppressed successfully by inverse operation,which ensures the system stability.Meanwhile,the system disturbance in the high maneuverable environment can be estimated with the constructed adaptive law online improving the engineering practicality.Finally,Lyapunov-based analysis proves that the developed controller can ensure the tracking error asymptotically converges to zero even with unmodeled disturbance and unknown actuator backlash.Contrast co-simulations and experiments illustrate the advantages of the proposed approach.展开更多
This paper presents a novel cooperative value iteration(VI)-based adaptive dynamic programming method for multi-player differential game models with a convergence proof.The players are divided into two groups in the l...This paper presents a novel cooperative value iteration(VI)-based adaptive dynamic programming method for multi-player differential game models with a convergence proof.The players are divided into two groups in the learning process and adapt their policies sequentially.Our method removes the dependence of admissible initial policies,which is one of the main drawbacks of the PI-based frameworks.Furthermore,this algorithm enables the players to adapt their control policies without full knowledge of others’ system parameters or control laws.The efficacy of our method is illustrated by three examples.展开更多
To solve the problem of data fusion for prior information such as track information and train status in train positioning,an adaptive H∞filtering algorithm with combination constraint is proposed,which fuses prior in...To solve the problem of data fusion for prior information such as track information and train status in train positioning,an adaptive H∞filtering algorithm with combination constraint is proposed,which fuses prior information with other sensor information in the form of constraints.Firstly,the train precise track constraint method of the train is proposed,and the plane position constraint and train motion state constraints are analysed.A model for combining prior information with constraints is established.Then an adaptive H∞filter with combination constraints is derived based on the adaptive adjustment method of the robustness factor.Finally,the positioning effect of the proposed algorithm is simulated and analysed under the conditions of a straight track and a curved track.The results show that the positioning accuracy of the algorithm with constrained filtering is significantly better than that of the algorithm without constrained filtering and that the algorithm with constrained filtering can achieve better performance when combined with track and condition information,which can significantly reduce the train positioning error.The effectiveness of the proposed algorithm is verified.展开更多
This paper proposes an adaptive neural network sliding mode control based on fractional-order ultra-local model for n-DOF upper-limb exoskeleton in presence of uncertainties,external disturbances and input deadzone.Co...This paper proposes an adaptive neural network sliding mode control based on fractional-order ultra-local model for n-DOF upper-limb exoskeleton in presence of uncertainties,external disturbances and input deadzone.Considering the model complexity and input deadzone,a fractional-order ultra-local model is proposed to formulate the original dynamic system for simple controller design.Firstly,the control gain of ultra-local model is considered as a constant.The fractional-order sliding mode technique is designed to stabilize the closed-loop system,while fractional-order time-delay estimation is combined with neural network to estimate the lumped disturbance.Correspondingly,a fractional-order ultra-local model-based neural network sliding mode controller(FO-NNSMC) is proposed.Secondly,to avoid disadvantageous effect of improper gain selection on the control performance,the control gain of ultra-local model is considered as an unknown parameter.Then,the Nussbaum technique is introduced into the FO-NNSMC to deal with the stability problem with unknown gain.Correspondingly,a fractional-order ultra-local model-based adaptive neural network sliding mode controller(FO-ANNSMC) is proposed.Moreover,the stability analysis of the closed-loop system with the proposed method is presented by using the Lyapunov theory.Finally,with the co-simulations on virtual prototype of 7-DOF iReHave upper-limb exoskeleton and experiments on 2-DOF upper-limb exoskeleton,the obtained compared results illustrate the effectiveness and superiority of the proposed method.展开更多
As government agencies continue to tighten emissions regulations due to the continued increase in greenhouse gas production, automotive industries are seeking to produce increasingly efficient vehicle technology. Hybr...As government agencies continue to tighten emissions regulations due to the continued increase in greenhouse gas production, automotive industries are seeking to produce increasingly efficient vehicle technology. Hybrid electric vehicles (HEVs) have been introduced to mitigate problems while improving fuel economy. HEVs have led to the demand of creating more advanced controls software to consider multiple components for propulsive power in a vehicle. A large section in the software development process is the implementation of an optimal energy management strategy meant to improve the overall fuel efficiency of the vehicle. Optimal strategies can be implemented when driving conditions are known a prior. The Equivalent Consumption Minimization Strategy (ECMS) is an optimal control strategy that uses an equivalence factor to equate electrical to mechanical power when performing torque split determination between the internal combustion engine and electric motor for propulsive and regenerative torque. This equivalence factor is determined from offline vehicle simulations using a sensitivity analysis to provide optimal fuel economy results while maintaining predetermined high voltage battery state of charge (SOC) constraints. When the control hierarchy is modified or different driving styles are applied, the analysis must be redone to update the equivalence factor. The goal of this work is to implement a fuzzy logic controller that dynamically updates the equivalence factor to improve fuel economy, maintain a strict charge sustaining window of operation for the high voltage battery, and reduce computational time required during algorithm development. The adaptive algorithm is validated against global optimum fuel economy and charge sustaining results from a sensitivity analysis performed for multiple drive cycles. Results show a maximum fuel economy improvement of 9.82% when using a mild driving style and a 95% success rate when maintaining an ending SOC within 5% of the desired SOC regardless of starting SOC.展开更多
基金supported by the National Natural Science Foundation of China (No.61971412)。
文摘Underwater monopulse space-time adaptive track-before-detect method,which combines space-time adaptive detector(STAD)and the track-before-detect algorithm based on dynamic programming(DP-TBD),denoted as STAD-DP-TBD,can effectively detect low-speed weak targets.However,due to the complexity and variability of the underwater environment,it is difficult to obtain sufficient secondary data,resulting in a serious decline in the detection and tracking performance,and leading to poor robustness of the algorithm.In this paper,based on the adaptive matched filter(AMF)test and the RAO test,underwater monopulse AMF-DP-TBD algorithm and RAO-DP-TBD algorithm which incorporate persymmetry and symmetric spectrum,denoted as PSAMF-DP-TBD and PS-RAO-DP-TBD,are proposed and compared with the AMF-DP-TBD algorithm and RAO-DP-TBD algorithm based on persymmetry array,denoted as P-AMF-DP-TBD and P-RAO-DP-TBD.The simulation results show that the four methods can work normally with sufficient secondary data and slightly insufficient secondary data,but when the secondary data is severely insufficient,the P-AMF-DP-TBD and P-RAO-DP-TBD algorithms has failed while the PSAMF-DP-TBD and PS-RAO-DP-TBD algorithms still have good detection and tracking capabilities.
基金supported by the National Natural Science Foundation of China(No.61971412).
文摘In order to improve the rejection capability of mismatched interferer signals,a new two-stage detector is proposed under homogeneous scenarios with unknown covariance matrix,which is obtained by cascading the adaptive matched filter(AMF)detector and the enhanced RAO(EnRAO)detector.The new detector has constant false alarm performance,and the closed-form expression of probability of false alarm and probability of detection is derived.The performance of the new detector is assessed,and analyzed in comparison with other detectors.The results show that,the proposed detector can provide enhanced rejection capability in the case of mismatch,but the performance of the detector is slightly lost under the condition of matching.
基金Supported by the National Post-doctor Fundation (No. 20090451251) the Shaanxi Industry Surmount Foundation (2009K08-31) of China
文摘For the slowly changed environment-range-dependent non-homogeneity, a new statistical space-time adaptive processing algorithm is proposed, which uses the statistical methods, such as Bayes or likelihood criterion to estimate the approximative covariance matrix in the non-homogeneous condition. According to the statistical characteristics of the space-time snapshot data, via defining the aggregate snapshot data and corresponding events, the conditional probability of the space-time snapshot data which is the effective training data is given, then the weighting coefficients are obtained for the weighting method. The theory analysis indicates that the statistical methods of the Bayes and likelihood criterion for covariance matrix estimation are more reasonable than other methods that estimate the covariance matrix with the use of training data except the detected outliers. The last simulations attest that the proposed algorithms can estimate the covariance in the non-homogeneous condition exactly and have favorable characteristics.
基金National Nature Science FoundationNational Deferise Research Funds
文摘A convenient implementation approach to space-time adaptive processing for airborne radar has been proposed, which is added by some auxiliary array elements in the area of main-lobe clutter on the basis of 2-D Capon approach . It is of practical use for its small computational load. This approach possesses the ideal performance in the area of main-lobe clutter . In addition, the approach which is added by some auxiliary beams in the area of main-lobe clutter has also been discussed.
基金supported by the National Natural Science Foundation of China(60901056)
文摘This paper proposes a unified clutter model incorporating the effects of range walk and array rotation for space-time adaptive processing(STAP) in airborne multi-channel early-warning radar.Based on this clutter model,STAP performance is then analyzed from the perspective of covariance matrix tapering(CMT).For STAP performance degradation due to array rotation,a determinate compensation method is proposed based on the CMT method.Numerical examples are provided to verify the analysis and the proposed compensation method.
文摘In non-homogeneous environment, traditional space-time adaptive processing doesn’t effectively suppress interference and detect target, because the secondary data don’t exactly reflect the statistical characteristic of the range cell under test. A novel methodology utilizing the direct data domain approach to space-time adaptive processing (STAP) in airborne radar non-homogeneous environments is presented. The deterministic least squares adaptive signal processing technique operates on a “snapshot-by-snapshot” basis to determine the adaptive weights for nulling interferences and estimating signal of interest (SOI). Furthermore, this approach eliminates the requirement for estimating the covariance through the data of neighboring range cell, which eliminates calculating the inverse of covariance, and can be implemented to operate in real-time. Simulation results illustrate the efficiency of interference suppression in non-homogeneous environment.
基金the Innovation Foundation of NUDT forPh.D.graduates.
文摘This paper introduces the preconditioned methods for Space-Time Adaptive Processing(STAP).Using the Block-Toeplitz-Toeplitz-Block(BTTB)structure of the clutter-plus-noise covari-ance matrix,a Block-Circulant-Circulant-Block(BCCB)preconditioner is constructed.Based on thepreconditioner,a Preconditioned Multistage Wiener Filter(PMWF)which can be implemented by thePreconditioned Conjugate Gradient(PCG)method is proposed.Simulation results show that thePMWF has faster convergence rate and lower processing rank compared with the MWF.
基金supported in part by the National Natural Science Foundation of China(62222301, 62073085, 62073158, 61890930-5, 62021003)the National Key Research and Development Program of China (2021ZD0112302, 2021ZD0112301, 2018YFC1900800-5)Beijing Natural Science Foundation (JQ19013)。
文摘Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.
基金partially supported by the Natural Science Foundation of China (Grant Nos.62103052,52272358)partially supported by the Beijing Institute of Technology Research Fund Program for Young Scholars。
文摘This paper investigates the adaptive trajectory tracking control problem and the unknown parameter identification problem of a class of rotor-missiles with parametric system uncertainties.First,considering the uncertainty of structural and aerodynamic parameters,the six-degree-of-freedom(6Do F) nonlinear equations describing the position and attitude dynamics of the rotor-missile are established,respectively,in the inertial and body-fixed reference frames.Next,a hierarchical adaptive trajectory tracking controller that can guarantee closed-loop stability is proposed according to the cascade characteristics of the 6Do F dynamics.Then,a memory-augmented update rule of unknown parameters is proposed by integrating all historical data of the regression matrix.As long as the finitely excited condition is satisfied,the precise identification of unknown parameters can be achieved.Finally,the validity of the proposed trajectory tracking controller and the parameter identification method is proved through Lyapunov stability theory and numerical simulations.
基金supported by the National Natural Science Foundation of China(12103057,12127901)the Frontier Research Fund of the Institute of Optics and Electronics,Chinese Academy of Sciences(C21K002)+1 种基金the Youth Innovation Promotion Association,Chinese Academy of Sciences(2021378)the National Natural Science Foundation of China(U2031148)。
文摘The 2.5 m wide-field and high-resolution solar telescope(WeHoST)is currently under developing for solar observations.WeHoST aims to achieve high-resolution observations over a super-wide field of view(FOV)of5′×5′,and a desired resolution of 0.3″.To meet the scientific requirements of WeHoST,the ground-layer adaptive optics(GLAO)with a specially designed wave front sensing system is as the primary consideration.We introduce the GLAO configuration,particularly the wave front sensing scheme.Utilizing analytic method,we simulate the performance of both classical AO and GLAO systems,optimize the wave front sensing system,and evaluate GLAO performance in terms of PSF uniformity and correction improvement across whole FOV.The results indicate that,the classical AO will achieve diffraction-limited resolution;the suggested GLAO configuration will uniformly improve the seeing across the full 5′×5′FOV,reducing the FWHM across the axis FOV to less than0.3″(λ≥705 nm,r0≥11 cm),which is more than two times improvement.The specially designed wave front sensor schedule offers new potential for WeHoST’s GLAO,particularly the multi-FOV GLAO and the flexibility to select the detected area.These capabilities will significantly enhance the scientific output of the telescope.
基金supported by the National Natural Science Foundation of China (6207319761933006)National International Science and Technology Cooperation Base on Railway Vehicle Operation Engineering of Beijing Jiaotong University (BMRV20KF08)。
文摘An adaptive control approach is presented in this paper for tracking desired trajectories in interactive manipulators. The controller design incorporates prescribed performance functions (PPFs) to improve dynamic performance. Notably, the performance of the output error is confined in an envelope characterized by exponential convergence, leading to convergence to zero. This feature ensures a prompt response from admittance control and establishes a reliable safety framework for interactions. Simulation results provide practical insights,demonstrating the viability of the control scheme proposed in this paper.
基金Project supported by the Natural Science Foundation of Shandong Province,China(Grant Nos.ZR2020MF119 and ZR2020MA082)the National Natural Science Foundation of China(Grant No.62002208)the National Key Research and Development Program of China(Grant No.2018YFB0504302).
文摘We propose a fast,adaptive multiscale resolution spectral measurement method based on compressed sensing.The method can apply variable measurement resolution over the entire spectral range to reduce the measurement time by over 75%compared to a global high-resolution measurement.Mimicking the characteristics of the human retina system,the resolution distribution follows the principle of gradually decreasing.The system allows the spectral peaks of interest to be captured dynamically or to be specified a priori by a user.The system was tested by measuring single and dual spectral peaks,and the results of spectral peaks are consistent with those of global high-resolution measurements.
基金Natural Science Foundation of Guangdong Province,Grant/Award Number:2021A1515011847Special Project in Key Fields of Universities in Department of Education of Guangdong Province,Grant/Award Number:2019KZDZX1036+3 种基金Demonstration Bases for Joint Training of Postgraduates of Department of Education of Guangdong Province,Grant/Award Number:202205Key Lab of Digital Signal and Image Processing of Guangdong Province,Grant/Award Number:2019GDDSIPL-01Innovation and Entrepreneurship Training Program for College Students of Guangdong Ocean University,Grant/Award Number:202210566028Postgraduate Education Innovation Plan Project of Guangdong Ocean University,Grant/Award Numbers:202214,202250,202251,202160。
文摘The solving of dynamic matrix square root(DMSR)problems is frequently encountered in many scientific and engineering fields.Although the original zeroing neural network is powerful for solving the DMSR,it cannot vanish the influence of the noise perturbations,and its constant-coefficient design scheme cannot accelerate the convergence speed.Therefore,a noise-tolerate and adaptive coefficient zeroing neural network(NTACZNN)is raised to enhance the robust noise immunity performance and accelerate the conver-gence speed simultaneously.Then,the global convergence and robustness of the pro-posed NTACZNN are theoretically analysed under an ideal environment and noise-perturbed circumstances.Furthermore,some illustrative simulation examples are designed and performed in order to substantiate the efficacy and advantage of the NTACZNN for the DMSR problem solution.Compared with some existing ZNNs,the proposed NTACZNN possesses advanced performance in terms of noise tolerance,solution accuracy,and convergence rate.
文摘Passive detection of low-slow-small(LSS)targets is easily interfered by direct signal and multipath clutter,and the traditional clutter suppression method has the contradiction between step size and convergence rate.In this paper,a frequency domain clutter suppression algorithm based on sparse adaptive filtering is proposed.The pulse compression operation between the error signal and the input reference signal is added to the cost function as a sparsity constraint,and the criterion for filter weight updating is improved to obtain a purer echo signal.At the same time,the step size and penalty factor are brought into the adaptive iteration process,and the input data is used to drive the adaptive changes of parameters such as step size.The proposed algorithm has a small amount of calculation,which improves the robustness to parameters such as step size,reduces the weight error of the filter and has a good clutter suppression performance.
基金supported in part by the National Key R&D Program of China under Grant 2021YFB2011300the National Natural Science Foundation of China under Grant 52075262。
文摘This paper mainly focuses on the development of a learning-based controller for a class of uncertain mechanical systems modeled by the Euler-Lagrange formulation.The considered system can depict the behavior of a large class of engineering systems,such as vehicular systems,robot manipulators and satellites.All these systems are often characterized by highly nonlinear characteristics,heavy modeling uncertainties and unknown perturbations,therefore,accurate-model-based nonlinear control approaches become unavailable.Motivated by the challenge,a reinforcement learning(RL)adaptive control methodology based on the actor-critic framework is investigated to compensate the uncertain mechanical dynamics.The approximation inaccuracies caused by RL and the exogenous unknown disturbances are circumvented via a continuous robust integral of the sign of the error(RISE)control approach.Different from a classical RISE control law,a tanh(·)function is utilized instead of a sign(·)function to acquire a more smooth control signal.The developed controller requires very little prior knowledge of the dynamic model,is robust to unknown dynamics and exogenous disturbances,and can achieve asymptotic output tracking.Eventually,co-simulations through ADAMS and MATLAB/Simulink on a three degrees-of-freedom(3-DOF)manipulator and experiments on a real-time electromechanical servo system are performed to verify the performance of the proposed approach.
基金the National Natural Science Foundation of China(No.52275062)and(No.52075262).
文摘Since backlash nonlinearity is inevitably existing in actuators for bidirectional stabilization system of allelectric tank,it behaves more drastically in high maneuvering environments.In this work,the accurate tracking control for bidirectional stabilization system of moving all-electric tank with actuator backlash and unmodeled disturbance is solved.By utilizing the smooth adaptive backlash inverse model,a nonlinear robust adaptive feedback control scheme is presented.The unknown parameters and unmodelled disturbance are addressed separately through the derived parametric adaptive function and the continuous nonlinear robust term.Because the unknown backlash parameters are updated via adaptive function and the backlash effect can be suppressed successfully by inverse operation,which ensures the system stability.Meanwhile,the system disturbance in the high maneuverable environment can be estimated with the constructed adaptive law online improving the engineering practicality.Finally,Lyapunov-based analysis proves that the developed controller can ensure the tracking error asymptotically converges to zero even with unmodeled disturbance and unknown actuator backlash.Contrast co-simulations and experiments illustrate the advantages of the proposed approach.
基金supported by the Industry-University-Research Cooperation Fund Project of the Eighth Research Institute of China Aerospace Science and Technology Corporation (USCAST2022-11)Aeronautical Science Foundation of China (20220001057001)。
文摘This paper presents a novel cooperative value iteration(VI)-based adaptive dynamic programming method for multi-player differential game models with a convergence proof.The players are divided into two groups in the learning process and adapt their policies sequentially.Our method removes the dependence of admissible initial policies,which is one of the main drawbacks of the PI-based frameworks.Furthermore,this algorithm enables the players to adapt their control policies without full knowledge of others’ system parameters or control laws.The efficacy of our method is illustrated by three examples.
基金the National Natural Science Fund of China(61471080)Training Plan for Young Backbone Teachers in Colleges and Universities of Henan Province(2018GGJS171).
文摘To solve the problem of data fusion for prior information such as track information and train status in train positioning,an adaptive H∞filtering algorithm with combination constraint is proposed,which fuses prior information with other sensor information in the form of constraints.Firstly,the train precise track constraint method of the train is proposed,and the plane position constraint and train motion state constraints are analysed.A model for combining prior information with constraints is established.Then an adaptive H∞filter with combination constraints is derived based on the adaptive adjustment method of the robustness factor.Finally,the positioning effect of the proposed algorithm is simulated and analysed under the conditions of a straight track and a curved track.The results show that the positioning accuracy of the algorithm with constrained filtering is significantly better than that of the algorithm without constrained filtering and that the algorithm with constrained filtering can achieve better performance when combined with track and condition information,which can significantly reduce the train positioning error.The effectiveness of the proposed algorithm is verified.
基金supported in part by the National Natural Science Foundation of China (62173182,61773212)the Intergovernmental International Science and Technology Innovation Cooperation Key Project of Chinese National Key R&D Program (2021YFE0102700)。
文摘This paper proposes an adaptive neural network sliding mode control based on fractional-order ultra-local model for n-DOF upper-limb exoskeleton in presence of uncertainties,external disturbances and input deadzone.Considering the model complexity and input deadzone,a fractional-order ultra-local model is proposed to formulate the original dynamic system for simple controller design.Firstly,the control gain of ultra-local model is considered as a constant.The fractional-order sliding mode technique is designed to stabilize the closed-loop system,while fractional-order time-delay estimation is combined with neural network to estimate the lumped disturbance.Correspondingly,a fractional-order ultra-local model-based neural network sliding mode controller(FO-NNSMC) is proposed.Secondly,to avoid disadvantageous effect of improper gain selection on the control performance,the control gain of ultra-local model is considered as an unknown parameter.Then,the Nussbaum technique is introduced into the FO-NNSMC to deal with the stability problem with unknown gain.Correspondingly,a fractional-order ultra-local model-based adaptive neural network sliding mode controller(FO-ANNSMC) is proposed.Moreover,the stability analysis of the closed-loop system with the proposed method is presented by using the Lyapunov theory.Finally,with the co-simulations on virtual prototype of 7-DOF iReHave upper-limb exoskeleton and experiments on 2-DOF upper-limb exoskeleton,the obtained compared results illustrate the effectiveness and superiority of the proposed method.
文摘As government agencies continue to tighten emissions regulations due to the continued increase in greenhouse gas production, automotive industries are seeking to produce increasingly efficient vehicle technology. Hybrid electric vehicles (HEVs) have been introduced to mitigate problems while improving fuel economy. HEVs have led to the demand of creating more advanced controls software to consider multiple components for propulsive power in a vehicle. A large section in the software development process is the implementation of an optimal energy management strategy meant to improve the overall fuel efficiency of the vehicle. Optimal strategies can be implemented when driving conditions are known a prior. The Equivalent Consumption Minimization Strategy (ECMS) is an optimal control strategy that uses an equivalence factor to equate electrical to mechanical power when performing torque split determination between the internal combustion engine and electric motor for propulsive and regenerative torque. This equivalence factor is determined from offline vehicle simulations using a sensitivity analysis to provide optimal fuel economy results while maintaining predetermined high voltage battery state of charge (SOC) constraints. When the control hierarchy is modified or different driving styles are applied, the analysis must be redone to update the equivalence factor. The goal of this work is to implement a fuzzy logic controller that dynamically updates the equivalence factor to improve fuel economy, maintain a strict charge sustaining window of operation for the high voltage battery, and reduce computational time required during algorithm development. The adaptive algorithm is validated against global optimum fuel economy and charge sustaining results from a sensitivity analysis performed for multiple drive cycles. Results show a maximum fuel economy improvement of 9.82% when using a mild driving style and a 95% success rate when maintaining an ending SOC within 5% of the desired SOC regardless of starting SOC.