期刊文献+

Approximate Dynamic Programming for Self-Learning Control 被引量:14

Approximate Dynamic Programming for Self-Learning Control
下载PDF
导出
摘要 This paper introduces a self-learning control approach based on approximate dynamic programming. Dynamic programming was introduced by Bellman in the 1950's for solving optimal control problems of nonlinear dynamical systems. Due to its high computational complexity, the applications of dynamic programming have been limited to simple and small problems. The key step in finding approximate solutions to dynamic programming is to estimate the performance index in dynamic programming. The optimal control signal can then be determined by minimizing (or maximizing) the performance index. Artificial neural networks are very efficient tools in representing the performance index in dynamic programming. This paper assumes the use of neural networks for estimating the performance index in dynamic programming and for generating optimal control signals, thus to achieve optimal control through self-learning. This paper introduces a self-learning control approach based on approximate dynamic programming. Dynamic programming was introduced by Bellman in the 1950's for solving optimal control problems of nonlinear dynamical systems. Due to its high computational complexity, the applications of dynamic programming have been limited to simple and small problems. The key step in finding approximate solutions to dynamic programming is to estimate the performance index in dynamic programming. The optimal control signal can then be determined by minimizing (or maximizing) the performance index. Artificial neural networks are very efficient tools in representing the performance index in dynamic programming. This paper assumes the use of neural networks for estimating the performance index in dynamic programming and for generating optimal control signals, thus to achieve optimal control through self-learning.
作者 DerongLiu
出处 《自动化学报》 EI CSCD 北大核心 2005年第1期13-18,共6页 Acta Automatica Sinica
基金 Supported by the National Science Foundation (U.S.A.) under Grant ECS-0355364
关键词 近似动态程序 自学习控制 神经网络 人工智能 Approximate dynamic programming, learning control, neural networks
  • 相关文献

参考文献36

  • 1Bellman R E. Dynamic Programming, Princeton, N J: Princeton University Press- 1957.
  • 2Dreyfus S E, Law A M. The Art and Theory of Dynamic Programming, New York, NY: Academic Press,1977.
  • 3Lewis F L, Syrmos V L. Optimal Control, New York, NY: John Wiley, 1995.
  • 4Balakrishnan S N, Biega V. Adaptive-critic-based neural networks for aircraft optimal control, Journal of Guidance, Control, Dynamics, 1996, 19(7-8): 893--898.
  • 5Prokhorov D V, Wunsch D C. Adaptive critic designs, IEEE Transactions on Neural Networks, 1997, 8(9):997--1007.
  • 6Si J, Wang Y-T. On-line learning control by association and reinforcement, IEEE Transactions on Neural Networks, 2001, 12(3): 264-276.
  • 7Werbos P J. Building and understanding adaptive systems: A statistical/numerical approach to factory automation and brain research, IEEE Transactions on Systems, Man, and Cybernetics, vol. SMC-17, 1987,7-20.
  • 8Werbos P J. A menu of designs for reinforcement learning over time, In: Neural Networks for Control (Chapter3), Edited by W. T. Miller, R. S. Sutton, and P. J. Werbos, Cambridge, MA: The MIT Press, 1990.
  • 9Werbos P J. Approximate dynmic programming for real-time control and neural modeling, In: Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches (Chapter 13), Edited by D. A. White and D.A. Sofge, New York, NY: Van Nostrand Reinhold, 1992.
  • 10Werbos P J. Aduanced forecasting methods for global crisis warning and models of intelligence, General Systems Yearbook, 1977, 22:25-38.

同被引文献90

引证文献14

二级引证文献148

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部