期刊文献+

基于梯度算法的跟踪最优控制器设计及仿真

Optimal Tracking Control Based on Gradient Estimation Algorithm
下载PDF
导出
摘要 应用自适应梯度算法和自适应动态规划方法,在线求解非线性系统的最优跟踪控制。首先对所求非线性系统给定性能指标,其次根据系统和性能指标建立哈密尔顿函数,再用神经网络逼近性能指标,然后用另一个神经网络逼近近似最优控制,神经网络权重参数应用自适应梯度算法在线进行估计,最后基于所求结果以及所设计的稳态控制和鲁棒项,求得系统鲁棒最优跟踪控制,对参数收敛性和系统稳定性进行了详细分析。仿真结果表明了本文所提出方法的有效性。 Based on the gradient estimation algorithm and adaptive dynamic programming, this paper solved the optimal control problem online. At first, regarding to the nonlinear system, a performance index was proposed. Then a Hamiltonian(HJB) function was constructed and a neural network(NN) was used to approximate the performance index. Another neural network was proposed to approach the actor, and both critic and actor NN weights are estimated based on gradient estimation online and simultaneously. Furthermore, steady-state control and robust term were designed to obtain the robust optimal control. At last, simulation results proved the effectiveness of the proposed methods.
出处 《计算机与现代化》 2016年第12期34-37,共4页 Computer and Modernization
关键词 自适应动态规划 梯度估计 跟踪控制 最优控制 Adaptive Dynamic Program(ADP) gradient algorithm tracking control optimal control
  • 相关文献

参考文献3

二级参考文献24

  • 1周美娇,李相林.二次型单神经元PSD控制器及仿真(英文)[J].仪器仪表学报,2003,24(z2):553-555. 被引量:2
  • 2陈之启.基于二次型优化空调PID-DDC系统控制器参数[J].控制工程,2005,12(2):112-115. 被引量:7
  • 3陈宗海,文锋,王智灵.基于自适应评价的非线性系统神经网络控制[J].控制与决策,2007,22(7):765-768. 被引量:3
  • 4B. R. E, Dynamic programming, Princeton: Princeton Uni versity Press, 1957.
  • 5SUTTON R S,BARTO A G. Reinforcement learning: an introduction. Cambridge Univ Press, 1998.
  • 6WERBOS P J. Approximate dynamic programming for real-- time control and neural modeling, Handbook of intelligent control: Neural[J]. fuzzy, and adaptive approaches, 1992, 15: 493--525.
  • 7DREYFUS S E,LAW A M. Art and theory of dynamic pro- gramming[M]. New York: Academic Press, 1977,56.
  • 8MURRAY J J,COX C J,LENDARIS G G, et al. Adaptive dynamic programming, Systems, Man, and Cybernetics, Part C= Applications and Reviews[J]. IEEE Transactions on, 2002, 32(2): 140-153.
  • 9WERBOS P J. A menu of designs for reinforcement learning over time[J]. Neural networks for control, 1990 : 67-95.
  • 10ABU-KHALAF M,LEWIS F L. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach[J]. Automatiea, 2005, 41(5) : 779-- 791.

共引文献83

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部