期刊文献+
共找到183篇文章
< 1 2 10 >
每页显示 20 50 100
Recent Progress in Reinforcement Learning and Adaptive Dynamic Programming for Advanced Control Applications 被引量:4
1
作者 Ding Wang Ning Gao +2 位作者 Derong Liu Jinna Li Frank L.Lewis 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第1期18-36,共19页
Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and ... Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence. 展开更多
关键词 adaptive dynamic programming(ADP) advanced control complex environment data-driven control event-triggered design intelligent control neural networks nonlinear systems optimal control reinforcement learning(RL)
下载PDF
Adaptable and Dynamic Access Control Decision-Enforcement Approach Based on Multilayer Hybrid Deep Learning Techniques in BYOD Environment
2
作者 Aljuaid Turkea Ayedh M Ainuddin Wahid Abdul Wahab Mohd Yamani Idna Idris 《Computers, Materials & Continua》 SCIE EI 2024年第9期4663-4686,共24页
Organizations are adopting the Bring Your Own Device(BYOD)concept to enhance productivity and reduce expenses.However,this trend introduces security challenges,such as unauthorized access.Traditional access control sy... Organizations are adopting the Bring Your Own Device(BYOD)concept to enhance productivity and reduce expenses.However,this trend introduces security challenges,such as unauthorized access.Traditional access control systems,such as Attribute-Based Access Control(ABAC)and Role-Based Access Control(RBAC),are limited in their ability to enforce access decisions due to the variability and dynamism of attributes related to users and resources.This paper proposes a method for enforcing access decisions that is adaptable and dynamic,based on multilayer hybrid deep learning techniques,particularly the Tabular Deep Neural Network Tabular DNN method.This technique transforms all input attributes in an access request into a binary classification(allow or deny)using multiple layers,ensuring accurate and efficient access decision-making.The proposed solution was evaluated using the Kaggle Amazon access control policy dataset and demonstrated its effectiveness by achieving a 94%accuracy rate.Additionally,the proposed solution enhances the implementation of access decisions based on a variety of resource and user attributes while ensuring privacy through indirect communication with the Policy Administration Point(PAP).This solution significantly improves the flexibility of access control systems,making themmore dynamic and adaptable to the evolving needs ofmodern organizations.Furthermore,it offers a scalable approach to manage the complexities associated with the BYOD environment,providing a robust framework for secure and efficient access management. 展开更多
关键词 BYOD security access control access control decision-enforcement deep learning neural network techniques TabularDNN MULTILAYER dynamic adaptable FLEXIBILITY bottlenecks performance policy conflict
下载PDF
Adaptive Multi-Step Evaluation Design With Stability Guarantee for Discrete-Time Optimal Learning Control 被引量:3
3
作者 Ding Wang Jiangyu Wang +2 位作者 Mingming Zhao Peng Xin Junfei Qiao 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第9期1797-1809,共13页
This paper is concerned with a novel integrated multi-step heuristic dynamic programming(MsHDP)algorithm for solving optimal control problems.It is shown that,initialized by the zero cost function,MsHDP can converge t... This paper is concerned with a novel integrated multi-step heuristic dynamic programming(MsHDP)algorithm for solving optimal control problems.It is shown that,initialized by the zero cost function,MsHDP can converge to the optimal solution of the Hamilton-Jacobi-Bellman(HJB)equation.Then,the stability of the system is analyzed using control policies generated by MsHDP.Also,a general stability criterion is designed to determine the admissibility of the current control policy.That is,the criterion is applicable not only to traditional value iteration and policy iteration but also to MsHDP.Further,based on the convergence and the stability criterion,the integrated MsHDP algorithm using immature control policies is developed to accelerate learning efficiency greatly.Besides,actor-critic is utilized to implement the integrated MsHDP scheme,where neural networks are used to evaluate and improve the iterative policy as the parameter architecture.Finally,two simulation examples are given to demonstrate that the learning effectiveness of the integrated MsHDP scheme surpasses those of other fixed or integrated methods. 展开更多
关键词 adaptive critic artificial neural networks Hamilton-Jacobi-Bellman(HJB)equation multi-step heuristic dynamic programming multi-step reinforcement learning optimal control
下载PDF
Adaptive learning tracking control of robotic manipulators with uncertainties
4
作者 Keng Peng TEE 《控制理论与应用(英文版)》 EI 2010年第2期160-165,共6页
An adaptive learning tracking control scheme is developed for robotic manipulators by a synthesis of adaptive control and learning control approaches. The proposed controller possesses both adaptive and learning prope... An adaptive learning tracking control scheme is developed for robotic manipulators by a synthesis of adaptive control and learning control approaches. The proposed controller possesses both adaptive and learning properties and thereby is able to handle robotic systems with both time-varying periodic uncertainties and time invariant parameters. Theoretical proofs are established to show that proposed controllers ensure asymptotical tracking performance. The effectiveness of the proposed approaches is validated through extensive numerical simulation results. 展开更多
关键词 adaptive control learning control Robotic dynamic systems UNCERTAINTIES
下载PDF
Mobility-Aware Adaptive Beam Tracking for Vehicles in Mm Wave Communication Networks 被引量:1
5
作者 Jin Xu Ying Zhou +2 位作者 Jian Zhang Yuchong Tang Xiaofeng Tao 《China Communications》 SCIE CSCD 2023年第3期161-174,共14页
The millimeter wave(mm Wave)is a potential solution for high data rate communication due to its availability of large bandwidth.However,it is challenging to perform beam tracking in vehicular mm Wave communication sys... The millimeter wave(mm Wave)is a potential solution for high data rate communication due to its availability of large bandwidth.However,it is challenging to perform beam tracking in vehicular mm Wave communication systems due to high mobility and narrow beams.In this paper,an adaptive beam tracking algorithm is proposed to improve the network throughput performance while reducing the training signal overhead.In particular,based on the mobility prediction at base station(BS),a novel frame structure with dynamic bundled timeslot is designed.Moreover,an actor-critic reinforcement learning based algorithm is proposed to obtain the joint optimization of both beam width and the number of bundled timeslots,which makes the beam tracking adapt to the changing environment.Simulation results demonstrate that,compared with the traditional full scan and Kalman filter based beam tracking algorithms,our proposed algorithm can improve the time-averaged throughput by 11.34%and 24.86%respectively.With the newly designed frame structure,it also outperforms beam tracking with conventional frame structure,especially in scenarios with large range of vehicle speeds. 展开更多
关键词 adaptive beam tracking mobility predic-tion dynamic bundled timeslot variable beam width reinforcement learning actor-critic
下载PDF
Self-adaptive large neighborhood search algorithm for parallel machine scheduling problems 被引量:7
6
作者 Pei Wang Gerhard Reinelt Yuejin Tan 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2012年第2期208-215,共8页
A self-adaptive large neighborhood search method for scheduling n jobs on m non-identical parallel machines with mul- tiple time windows is presented. The problems' another feature lies in oversubscription, namely no... A self-adaptive large neighborhood search method for scheduling n jobs on m non-identical parallel machines with mul- tiple time windows is presented. The problems' another feature lies in oversubscription, namely not all jobs can be scheduled within specified scheduling horizons due to the limited machine capacity. The objective is thus to maximize the overall profits of processed jobs while respecting machine constraints. A first-in- first-out heuristic is applied to find an initial solution, and then a large neighborhood search procedure is employed to relax and re- optimize cumbersome solutions. A machine learning mechanism is also introduced to converge on the most efficient neighborhoods for the problem. Extensive computational results are presented based on data from an application involving the daily observation scheduling of a fleet of earth observing satellites. The method rapidly solves most problem instances to optimal or near optimal and shows a robust performance in sensitive analysis. 展开更多
关键词 non-identical parallel machine scheduling problem with multiple time windows (NPMSPMTW) oversubscribed self- adaptive large neighborhood search (SALNS) machine learning.
下载PDF
Dynamic Distribution Adaptation Based Transfer Network for Cross Domain Bearing Fault Diagnosis 被引量:4
7
作者 Yixiao Liao Ruyi Huang +2 位作者 Jipu Li Zhuyun Chen Weihua Li 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2021年第3期94-103,共10页
In machinery fault diagnosis,labeled data are always difficult or even impossible to obtain.Transfer learning can leverage related fault diagnosis knowledge from fully labeled source domain to enhance the fault diagno... In machinery fault diagnosis,labeled data are always difficult or even impossible to obtain.Transfer learning can leverage related fault diagnosis knowledge from fully labeled source domain to enhance the fault diagnosis performance in sparsely labeled or unlabeled target domain,which has been widely used for cross domain fault diagnosis.However,existing methods focus on either marginal distribution adaptation(MDA)or conditional distribution adaptation(CDA).In practice,marginal and conditional distributions discrepancies both have significant but different influences on the domain divergence.In this paper,a dynamic distribution adaptation based transfer network(DDATN)is proposed for cross domain bearing fault diagnosis.DDATN utilizes the proposed instance-weighted dynamic maximum mean discrepancy(IDMMD)for dynamic distribution adaptation(DDA),which can dynamically estimate the influences of marginal and conditional distribution and adapt target domain with source domain.The experimental evaluation on cross domain bearing fault diagnosis demonstrates that DDATN can outperformance the state-of-the-art cross domain fault diagnosis methods. 展开更多
关键词 Cross domain fault diagnosis dynamic distribution adaptation Instance-weighted dynamic MMD Transfer learning
下载PDF
Dynamic Intelligent Supply-Demand Adaptation Model Towards Intelligent Cloud Manufacturing
8
作者 Yanfei Sun Feng Qiao +4 位作者 Wei Wang Bin Xu Jianming Zhu Romany Fouad Mansour Jin Qi 《Computers, Materials & Continua》 SCIE EI 2022年第8期2825-2843,共19页
As a new mode and means of smart manufacturing,smart cloud manufacturing(SCM)faces great challenges in massive supply and demand,dynamic resource collaboration and intelligent adaptation.To address the problem,this pa... As a new mode and means of smart manufacturing,smart cloud manufacturing(SCM)faces great challenges in massive supply and demand,dynamic resource collaboration and intelligent adaptation.To address the problem,this paper proposes an SCM-oriented dynamic supply-demand(SD)intelligent adaptation model for massive manufacturing services.In this model,a collaborative network model is established based on the properties of both the supply-demand and their relationships;in addition,an algorithm based on deep graph clustering(DGC)and aligned sampling(AS)is used to divide and conquer the large adaptation domain to solve the problem of the slow computational speed caused by the high complexity of spatiotemporal search in the collaborative network model.At the same time,an intelligent supply-demand adaptation method driven by the quality of service(QoS)is established,in which the experiences of adaptation are shared among adaptation subdomains through deep reinforcement learning(DRL)powered by a transfer mechanism to improve the poor adaptation results caused by dynamic uncertainty.The results show that the model and the solution proposed in this paper can performcollaborative and intelligent supply-demand adaptation for themassive and dynamic resources in SCM through autonomous learning and can effectively performglobal supply-demand matching and optimal resource allocation. 展开更多
关键词 Smart Cloud Manufacturing supply and demand sides dynamic adaptation Deep Graph Clustering transfer learning reinforcement learning
下载PDF
Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control 被引量:9
9
作者 Mingming Ha Ding Wang Derong Liu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2022年第7期1262-1272,共11页
The core task of tracking control is to make the controlled plant track a desired trajectory.The traditional performance index used in previous studies cannot eliminate completely the tracking error as the number of t... The core task of tracking control is to make the controlled plant track a desired trajectory.The traditional performance index used in previous studies cannot eliminate completely the tracking error as the number of time steps increases.In this paper,a new cost function is introduced to develop the value-iteration-based adaptive critic framework to solve the tracking control problem.Unlike the regulator problem,the iterative value function of tracking control problem cannot be regarded as a Lyapunov function.A novel stability analysis method is developed to guarantee that the tracking error converges to zero.The discounted iterative scheme under the new cost function for the special case of linear systems is elaborated.Finally,the tracking performance of the present scheme is demonstrated by numerical results and compared with those of the traditional approaches. 展开更多
关键词 adaptive critic design adaptive dynamic programming(ADP) approximate dynamic programming discrete-time nonlinear systems reinforcement learning stability analysis tracking control value iteration(VI)
下载PDF
Advanced Policy Learning Near-Optimal Regulation 被引量:3
10
作者 Ding Wang Xiangnan Zhong 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2019年第3期743-749,共7页
Designing advanced design techniques for feedback stabilization and optimization of complex systems is important to the modern control field. In this paper, a near-optimal regulation method for general nonaffine dynam... Designing advanced design techniques for feedback stabilization and optimization of complex systems is important to the modern control field. In this paper, a near-optimal regulation method for general nonaffine dynamics is developed with the help of policy learning. For addressing the nonaffine nonlinearity, a pre-compensator is constructed, so that the augmented system can be formulated as affine-like form. Different cost functions are defined for original and transformed controlled plants and then their relationship is analyzed in detail. Additionally, an adaptive critic algorithm involving stability guarantee is employed to solve the augmented optimal control problem. At last, several case studies are conducted for verifying the stability, robustness, and optimality of a torsional pendulum plant with suitable cost. 展开更多
关键词 adaptive CRITIC algorithm learning control NEURAL APPROXIMATION nonaffine dynamicS optimal REGULATION
下载PDF
Data⁃Based Feedback Relearning Algorithm for Robust Control of SGCMG Gimbal Servo System with Multi⁃source Disturbance 被引量:3
11
作者 ZHANG Yong MU Chaoxu LU Ming 《Transactions of Nanjing University of Aeronautics and Astronautics》 EI CSCD 2021年第2期225-236,共12页
Single gimbal control moment gyroscope(SGCMG)with high precision and fast response is an important attitude control system for high precision docking,rapid maneuvering navigation and guidance system in the aerospace f... Single gimbal control moment gyroscope(SGCMG)with high precision and fast response is an important attitude control system for high precision docking,rapid maneuvering navigation and guidance system in the aerospace field.In this paper,considering the influence of multi-source disturbance,a data-based feedback relearning(FR)algorithm is designed for the robust control of SGCMG gimbal servo system.Based on adaptive dynamic programming and least-square principle,the FR algorithm is used to obtain the servo control strategy by collecting the online operation data of SGCMG system.This is a model-free learning strategy in which no prior knowledge of the SGCMG model is required.Then,combining the reinforcement learning mechanism,the servo control strategy is interacted with system dynamic of SGCMG.The adaptive evaluation and improvement of servo control strategy against the multi-source disturbance are realized.Meanwhile,a data redistribution method based on experience replay is designed to reduce data correlation to improve algorithm stability and data utilization efficiency.Finally,by comparing with other methods on the simulation model of SGCMG,the effectiveness of the proposed servo control strategy is verified. 展开更多
关键词 control moment gyroscope feedback relearning algorithm servo control reinforcement learning multisource disturbance adaptive dynamic programming
下载PDF
Online Learning Control for Harmonics Reduction Based on Current Controlled Voltage Source Power Inverters 被引量:2
12
作者 Naresh Malla Ujjwol Tamrakar +2 位作者 Dipesh Shrestha Zhen Ni Reinaldo Tonkoski 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2017年第3期447-457,共11页
Nonlinear loads in the power distribution system cause non-sinusoidal currents and voltages with harmonic components.Shunt active filters(SAF) with current controlled voltage source inverters(CCVSI) are usually used t... Nonlinear loads in the power distribution system cause non-sinusoidal currents and voltages with harmonic components.Shunt active filters(SAF) with current controlled voltage source inverters(CCVSI) are usually used to obtain balanced and sinusoidal source currents by injecting compensation currents.However,CCVSI with traditional controllers have a limited transient and steady state performance.In this paper,we propose an adaptive dynamic programming(ADP) controller with online learning capability to improve transient response and harmonics.The proposed controller works alongside existing proportional integral(PI) controllers to efficiently track the reference currents in the d-q domain.It can generate adaptive control actions to compensate the PI controller.The proposed system was simulated under different nonlinear(three-phase full wave rectifier) load conditions.The performance of the proposed approach was compared with the traditional approach.We have also included the simulation results without connecting the traditional PI control based power inverter for reference comparison.The online learning based ADP controller not only reduced average total harmonic distortion by 18.41%,but also outperformed traditional PI controllers during transients. 展开更多
关键词 adaptive dynamic programming(ADP) current controlled voltage source power inverter(CCVSI) online learning based controller neural networks shunt active filter(SAF) total harmonic distortion(THD)
下载PDF
PDP: Parallel Dynamic Programming 被引量:15
13
作者 Fei-Yue Wang Jie Zhang +2 位作者 Qinglai Wei Xinhu Zheng Li Li 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2017年第1期1-5,共5页
Deep reinforcement learning is a focus research area in artificial intelligence. The principle of optimality in dynamic programming is a key to the success of reinforcement learning methods. The principle of adaptive ... Deep reinforcement learning is a focus research area in artificial intelligence. The principle of optimality in dynamic programming is a key to the success of reinforcement learning methods. The principle of adaptive dynamic programming ADP is first presented instead of direct dynamic programming DP , and the inherent relationship between ADP and deep reinforcement learning is developed. Next, analytics intelligence, as the necessary requirement, for the real reinforcement learning, is discussed. Finally, the principle of the parallel dynamic programming, which integrates dynamic programming and analytics intelligence, is presented as the future computational intelligence. © 2014 Chinese Association of Automation. 展开更多
关键词 Artificial intelligence Neural networks Reinforcement learning
下载PDF
Consensus control for heterogeneous uncertain multi-agent systems with hybrid nonlinear dynamics via iterative learning algorithm 被引量:1
14
作者 XIE Jin CHEN JiaXi +2 位作者 LI JunMin CHEN WeiSheng ZHANG Shuai 《Science China(Technological Sciences)》 SCIE EI CAS CSCD 2023年第10期2897-2906,共10页
In this study,We propose a compensated distributed adaptive learning algorithm for heterogeneous multi-agent systems with repetitive motion,where the leader's dynamics are unknown,and the controlled system's p... In this study,We propose a compensated distributed adaptive learning algorithm for heterogeneous multi-agent systems with repetitive motion,where the leader's dynamics are unknown,and the controlled system's parameters are uncertain.The multiagent systems are considered a kind of hybrid order nonlinear systems,which relaxes the strict requirement that all agents are of the same order in some existing work.For theoretical analyses,we design a composite energy function with virtual gain parameters to reduce the restriction that the controller gain depends on global information.Considering the stability of the controller,we introduce a smooth continuous function to improve the piecewise controller to avoid possible chattering.Theoretical analyses prove the convergence of the presented algorithm,and simulation experiments verify the effectiveness of the algorithm. 展开更多
关键词 multi-agent systems adaptive iterative learning control hybrid nonlinear dynamics composite energy function consensus algorithm
原文传递
Dynamic Movement Primitives Based Robot Skills Learning 被引量:1
15
作者 Ling-Huan Kong Wei He +2 位作者 Wen-Shi Chen Hui Zhang Yao-Nan Wang 《Machine Intelligence Research》 EI CSCD 2023年第3期396-407,共12页
In this article,a robot skills learning framework is developed,which considers both motion modeling and execution.In order to enable the robot to learn skills from demonstrations,a learning method called dynamic movem... In this article,a robot skills learning framework is developed,which considers both motion modeling and execution.In order to enable the robot to learn skills from demonstrations,a learning method called dynamic movement primitives(DMPs)is introduced to model motion.A staged teaching strategy is integrated into DMPs frameworks to enhance the generality such that the complicated tasks can be also performed for multi-joint manipulators.The DMP connection method is used to make an accurate and smooth transition in position and velocity space to connect complex motion sequences.In addition,motions are categorized into different goals and durations.It is worth mentioning that an adaptive neural networks(NNs)control method is proposed to achieve highly accurate trajectory tracking and to ensure the performance of action execution,which is beneficial to the improvement of reliability of the skills learning system.The experiment test on the Baxter robot verifies the effectiveness of the proposed method. 展开更多
关键词 dynamic movement primitives(DMPs) trajectory tracking control robot learning from demonstrations neural networks(NNs) adaptive control
原文传递
State of the Art of Adaptive Dynamic Programming and Reinforcement Learning
16
作者 Derong Liu Mingming Ha Shan Xue 《CAAI Artificial Intelligence Research》 2022年第2期93-110,共18页
This article introduces the state-of-the-art development of adaptive dynamic programming and reinforcement learning(ADPRL).First,algorithms in reinforcement learning(RL)are introduced and their roots in dynamic progra... This article introduces the state-of-the-art development of adaptive dynamic programming and reinforcement learning(ADPRL).First,algorithms in reinforcement learning(RL)are introduced and their roots in dynamic programming are illustrated.Adaptive dynamic programming(ADP)is then introduced following a brief discussion of dynamic programming.Researchers in ADP and RL have enjoyed the fast developments of the past decade from algorithms,to convergence and optimality analyses,and to stability results.Several key steps in the recent theoretical developments of ADPRL are mentioned with some future perspectives.In particular,convergence and optimality results of value iteration and policy iteration are reviewed,followed by an introduction to the most recent results on stability analysis of value iteration algorithms. 展开更多
关键词 adaptive dynamic programming approximate dynamic programming adaptive critic designs neuro-dynamic programming neural dynamic programming reinforcement learning intelligent control learning control optimal control
原文传递
多场景下基于传感器的行为识别 被引量:2
17
作者 安健 程宇森 +1 位作者 桂小林 戴慧珺 《计算机工程与设计》 北大核心 2024年第1期244-251,共8页
针对基于传感器的行为识别任务中识别场景单一且固定的问题,提出一种多场景下基于传感器的行为识别迁移模型,由基于传感器的动态感知算法(dynamic perception algorithm,DPA)和自适应场景的行为识别迁移方法(adaptive scene human recog... 针对基于传感器的行为识别任务中识别场景单一且固定的问题,提出一种多场景下基于传感器的行为识别迁移模型,由基于传感器的动态感知算法(dynamic perception algorithm,DPA)和自适应场景的行为识别迁移方法(adaptive scene human recognition,AHR)两部分组成,解决在固定场景下对传感器的依赖性以及在场景转换时识别模型失效的问题。DPA提出两阶段迁移模式,将行为识别阶段和模型迁移阶段同步推进,保证模型在传感器异动发生后仍能持续拥有识别能力。进一步提出AHR场景迁移方法,实现模型在多场景下的行为识别能力。实验验证该模型具有更优的适应性和可扩展性。 展开更多
关键词 传感器 行为识别 迁移学习 动态感知算法 自适应场景 两阶段迁移模式 场景转换
下载PDF
从管段走向管网:管道泄漏诊断技术研究进展 被引量:1
18
作者 张化光 王天彪 +2 位作者 胡旭光 马大中 刘金海 《控制工程》 CSCD 北大核心 2024年第6期961-972,共12页
管道泄漏诊断技术在保障管道系统安全运行中起着至关重要的作用。首先,介绍了管道泄漏诊断系统的结构,并指出由单一管段向复杂管网泄漏诊断的发展趋势。进一步从基于数据驱动的传统泄漏检测方法、管道泄漏信号源定位技术和基于深度学习... 管道泄漏诊断技术在保障管道系统安全运行中起着至关重要的作用。首先,介绍了管道泄漏诊断系统的结构,并指出由单一管段向复杂管网泄漏诊断的发展趋势。进一步从基于数据驱动的传统泄漏检测方法、管道泄漏信号源定位技术和基于深度学习的复杂管网泄漏检测方法3个方面进行综述,分析了不同方法的优势、局限性和适用范围。最后指出,随着管网系统的复杂度增加,传统方法的局限性逐渐显现,基于深度学习技术的复杂管网微弱泄漏诊断、多源信号融合和管网智能化的研究将成为未来的研究趋势。 展开更多
关键词 管道泄漏诊断 深度学习 复杂管网 自适应动态规划
下载PDF
一种基于深度自适应网络迁移的暂稳评估模型更新框架
19
作者 李楠 张帅 +1 位作者 胡禹先 隋想 《电力系统保护与控制》 EI CSCD 北大核心 2024年第14期25-35,共11页
为解决电力系统的运行方式或拓扑结构变化后暂稳评估模型的适应性问题,常规的特征迁移学习方法主要侧重于拉近源域与目标域数据集间的条件分布或边缘分布的距离,却不能定量的评价这两种分布对于不同域之间的贡献,导致模型迁移性能不理... 为解决电力系统的运行方式或拓扑结构变化后暂稳评估模型的适应性问题,常规的特征迁移学习方法主要侧重于拉近源域与目标域数据集间的条件分布或边缘分布的距离,却不能定量的评价这两种分布对于不同域之间的贡献,导致模型迁移性能不理想。针对该问题,引入SENet注意力机制和动态分布自适应算法,构建了基于SEDDAN迁移的深度自适应网络暂稳评估模型更新框架,从特征提取和不同域间分布权重的动态调整两个层面进行改进,进一步提升了评估模型的迁移性能和自适应性。在IEEE 39和IEEE 140节点系统上进行测试,仿真结果表明所提模型在更新后的评估准确性、适应性和迁移性能方面有一定的优势。 展开更多
关键词 电力系统 评估 迁移学习 注意力机制 动态自适应分布
下载PDF
具有时变输出约束的非线性多智能体系统自适应最优包含控制
20
作者 张天平 刘涛 章恩泽 《控制理论与应用》 EI CAS CSCD 北大核心 2024年第10期1899-1912,共14页
本文对具有时变输出约束和未建模动态的不确定严格反馈非线性多智能体系统,提出了一种最优包含控制方法.利用一种新型积分型障碍Lyapunov函数处理输出约束,利用动态信号处理未建模动态,利用动态面控制方法设计前馈控制器,结合自适应动... 本文对具有时变输出约束和未建模动态的不确定严格反馈非线性多智能体系统,提出了一种最优包含控制方法.利用一种新型积分型障碍Lyapunov函数处理输出约束,利用动态信号处理未建模动态,利用动态面控制方法设计前馈控制器,结合自适应动态规划和积分强化学习方法设计最优反馈控制器,利用神经网络在线逼近相应代价函数,并设计权重更新律.理论分析证明了所有跟随者的输出收敛到领导者生成的凸包中,全部跟随者组成的闭环系统是半全局一致最终有界的,同时,跟随者的输出保持在给定的约束集中,代价函数达到最小.仿真结果验证了所提出方法的有效性. 展开更多
关键词 自适应动态规划 积分强化学习 最优控制 动态面控制 积分型障碍Lyapunov函数 多智能体系统
下载PDF
上一页 1 2 10 下一页 到第
使用帮助 返回顶部