期刊文献+

Data-based Optimal Control for Discrete-time Zero-sum Games of 2-D Systems Using Adaptive Critic Designs 被引量:8

Data-based Optimal Control for Discrete-time Zero-sum Games of 2-D Systems Using Adaptive Critic Designs
下载PDF
导出
出处 《自动化学报》 EI CSCD 北大核心 2009年第6期682-692,共11页 Acta Automatica Sinica
基金 Supported by National High Technology Research and Development Program of China (863 Program) (2006AA04Z183), National Natural Science Foundation of China (60621001, 60534010, 60572070, 60774048, 60728307), Program for Changjiang Scholars and Innovative Research Groups of China (60728307, 4031002)
关键词 自适应系统 最优控制 离散时间 自动化系统 Adaptive critic designs (ACD), optimal control, zero-sum game, 2-D system, neural networks
  • 相关文献

二级参考文献65

  • 1年晓红.Suboptimal Strategies of Linear Quadratic Closed-loop Differential Games: An BMI Approach[J].自动化学报,2005,31(2):216-222. 被引量:7
  • 2Seong C-Y, Widrow B. Neural dynamic optimization for control systems-Part Ⅲ: Applications, IEEE Transactions on Systems, Man, and Cybernetics Part B: Cybernetics, 2001, 31(8): 502-513.
  • 3Bellman R E. Dynamic Programming, Princeton, N J: Princeton University Press- 1957.
  • 4Dreyfus S E, Law A M. The Art and Theory of Dynamic Programming, New York, NY: Academic Press,1977.
  • 5Lewis F L, Syrmos V L. Optimal Control, New York, NY: John Wiley, 1995.
  • 6Balakrishnan S N, Biega V. Adaptive-critic-based neural networks for aircraft optimal control, Journal of Guidance, Control, Dynamics, 1996, 19(7-8): 893--898.
  • 7Prokhorov D V, Wunsch D C. Adaptive critic designs, IEEE Transactions on Neural Networks, 1997, 8(9):997--1007.
  • 8Si J, Wang Y-T. On-line learning control by association and reinforcement, IEEE Transactions on Neural Networks, 2001, 12(3): 264-276.
  • 9Werbos P J. Building and understanding adaptive systems: A statistical/numerical approach to factory automation and brain research, IEEE Transactions on Systems, Man, and Cybernetics, vol. SMC-17, 1987,7-20.
  • 10Werbos P J. A menu of designs for reinforcement learning over time, In: Neural Networks for Control (Chapter3), Edited by W. T. Miller, R. S. Sutton, and P. J. Werbos, Cambridge, MA: The MIT Press, 1990.

共引文献25

同被引文献23

  • 1张友,井元伟,张嗣瀛.基于观测器的线性中立时滞系统的H_∞控制[J].控制与决策,2004,19(10):1137-1141. 被引量:13
  • 2王宝华,杨成梧,张强.TCSC自适应逆推控制器设计[J].电力自动化设备,2005,25(4):59-61. 被引量:14
  • 3韩京清.一类不确定对象的扩张状态观测器[J].控制与决策,1995,10(1):85-88. 被引量:422
  • 4陈宗海,文锋,王智灵.基于自适应评价的非线性系统神经网络控制[J].控制与决策,2007,22(7):765-768. 被引量:3
  • 5Vamvoudakis K G, Lewis F L. Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem[J]. Automatica, 2010, 46(5): 878-888.
  • 6Zhang H G, Luo Y H, Liu D. Neural-network-based near- optimal control for a class of discrete-time affine nonlinear systems with control constraints[J]. IEEE Trans on Neural Networks, 2009, 20(9): 1490-1503.
  • 7Dierks T, Jagannathan S. Optimal Control of Affine Nonlinear Continuous-time Systems Using an Online Hamilton-Jacobi-Isaacs Formulation[C]. Proc of IEEE Conf on Decision and Control. New York: IEEE Press, 2010: 3047-3053.
  • 8Werbos P J. Approximate dynamic programming for real- time control and neural modeling[C]. Intelligent Control: Neural, Fuzzy, and Adaptive Approaches. New York: Van Nostrand Reinhold, 1992.
  • 9Liu D, Javaherian H, Tandale M D, et al. Adaptive critic learning technique for engine torque and air-fuel ratio control[J]. IEEE Trans on Systems, Man, and Cybernetics, Part B: Cybernetics, 2008, 38(4): 988-993.
  • 10Venayagamoorthy G K, Harley R G, Wunsch D C. Dual heuristic programming excitation neurocontrol for generation in a multi-machine power system[J]. IEEE Trans on Industry Applications, 2003, 39(2): 382-394.

引证文献8

二级引证文献120

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部