摘要
针对非仿射系统的跟踪控制问题,提出一种基于执行依赖启发式动态规划(ADHDP)结构的在线设计方法。首先,考虑一类未知非仿射系统,进行从跟踪控制问题到误差调节问题的转换;然后,设计基于ADHDP结构的跟踪控制器,并采用在线学习的方式,实现系统控制与执行网络和评判网络的训练同步,使得系统状态能够跟踪期望轨迹;最后,通过一个仿真实例验证所提方法的有效性。
To solve the tracking control problem for nonaffine systems,an online design method was developed by using the action dependent heuristic dynamic programming(ADHDP)structure.Firstly,the tracking control problem for the unknown nonaffine system was transformed into the error regulation problem.Then,the ADHDP tracking controller was designed and the online learning method was adopted to synchronize the system control with the training of action networks and critic networks,so that the desired trajectory could be tracked by the system state.Finally,a simulation example was given to verify the effectiveness of the proposed method.
作者
赵慧玲
王鼎
任进
ZHAO Huiling;WANG Ding;REN Jin(Faculty of Information Technology,Beijing University of Technology,Beijing 100124,China;Beijing Key Laboratory of Computational Intelligence and Intelligent System,Beijing 100124,China;Beijing Institute of Artificial Intelligence,Beijing 100124,China;Beijing Laboratory of Smart Environmental Protection,Beijing 100124,China)
出处
《智能科学与技术学报》
2021年第4期449-455,共7页
Chinese Journal of Intelligent Science and Technology
基金
国家自然科学基金资助项目(No.61773373,No.61890930-5,No.62021003)
国家重点研发计划资助项目(No.2021ZD0112300-2,No.2018YFC1900800-5)
北京市自然科学基金资助项目(No.JQ19013)。
关键词
跟踪控制
在线学习
执行依赖设计
tracking control
online learning
action dependent design