Data-based Optimal Control for Discrete-time Zero-sum Games of 2-D Systems Using Adaptive Critic Designs 被引量：8

Data-based Optimal Control for Discrete-time Zero-sum Games of 2-D Systems Using Adaptive Critic Designs

下载PDF

导出

作者 WEI Qing-Lai ZHANG Hua-Guang CUI Li-Li

机构地区 The Key Laboratory of Complex Systems and Intelligence Science The School of Information Science and Engineering

出处《自动化学报》 EI CSCD 北大核心 2009年第6期682-692,共11页 Acta Automatica Sinica

基金 Supported by National High Technology Research and Development Program of China （863 Program）（2006AA04Z183）, National Natural Science Foundation of China （60621001, 60534010, 60572070, 60774048, 60728307）, Program for Changjiang Scholars and Innovative Research Groups of China （60728307, 4031002）

关键词自适应系统最优控制离散时间自动化系统 Adaptive critic designs （ACD）, optimal control, zero-sum game, 2-D system, neural networks

分类号 TP273.2 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献4

1年晓红.Suboptimal Strategies of Linear Quadratic Closed-loop Differential Games: An BMI Approach[J].自动化学报,2005,31(2):216-222. 被引量：7
2XU Jian-Ming YU Li.H_∞ Control for 2-D Discrete State Delayed Systems in the Second FM Model[J].自动化学报,2008,34(7):809-813. 被引量：3
3年晓红,曹莉.基于微分对策的最优状态观测器和最优状态反馈控制器的设计[J].自动化学报,2006,32(5):807-812. 被引量：5
4DerongLiu.Approximate Dynamic Programming for Self-Learning Control[J].自动化学报,2005,31(1):13-18. 被引量：14

二级参考文献65

1年晓红.Suboptimal Strategies of Linear Quadratic Closed-loop Differential Games: An BMI Approach[J].自动化学报,2005,31(2):216-222. 被引量：7
2Seong C-Y, Widrow B. Neural dynamic optimization for control systems-Part Ⅲ: Applications, IEEE Transactions on Systems, Man, and Cybernetics Part B: Cybernetics, 2001, 31(8): 502-513.
3Bellman R E. Dynamic Programming, Princeton, N J: Princeton University Press- 1957.
4Dreyfus S E, Law A M. The Art and Theory of Dynamic Programming, New York, NY: Academic Press,1977.
5Lewis F L, Syrmos V L. Optimal Control, New York, NY: John Wiley, 1995.
6Balakrishnan S N, Biega V. Adaptive-critic-based neural networks for aircraft optimal control, Journal of Guidance, Control, Dynamics, 1996, 19(7-8): 893--898.
7Prokhorov D V, Wunsch D C. Adaptive critic designs, IEEE Transactions on Neural Networks, 1997, 8(9):997--1007.
8Si J, Wang Y-T. On-line learning control by association and reinforcement, IEEE Transactions on Neural Networks, 2001, 12(3): 264-276.
9Werbos P J. Building and understanding adaptive systems: A statistical/numerical approach to factory automation and brain research, IEEE Transactions on Systems, Man, and Cybernetics, vol. SMC-17, 1987,7-20.
10Werbos P J. A menu of designs for reinforcement learning over time, In: Neural Networks for Control (Chapter3), Edited by W. T. Miller, R. S. Sutton, and P. J. Werbos, Cambridge, MA: The MIT Press, 1990.

共引文献25

1年晓红,曹莉.基于微分对策的最优状态观测器和最优状态反馈控制器的设计[J].自动化学报,2006,32(5):807-812. 被引量：5
2陈宗海,文锋.基于复杂过程简化模型的DHP学习控制[J].控制与决策,2006,21(10):1087-1091. 被引量：2
3NIAN Xiao-Hong CAO Li.BMI Approach to the Interconnected Stability and Cooperative Control of Linear Systems[J].自动化学报,2008,34(4):438-444. 被引量：8
4Yanhong Luo Huaguang Zhang.Approximate optimal control for a class of nonlinear discrete-time systems with saturating actuators[J].Progress in Natural Science:Materials International,2008,18(8):1023-1029. 被引量：2
5赵冬斌,刘德荣,易建强.基于自适应动态规划的城市交通信号优化控制方法综述[J].自动化学报,2009,35(6):676-681. 被引量：39
6肖军,白静.状态反馈最优控制器设计及仿真[J].鞍山师范学院学报,2009,11(4):58-61. 被引量：1
7罗艳红,张化光,曹宁,陈兵.一类控制受约束非线性系统的基于单网络贪婪迭代DHP算法的近似最优镇定[J].自动化学报,2009,35(11):1436-1445. 被引量：1
8WEI Qing-Lai,ZHANG Hua-Guang,LIU De-Rong,ZHAO Yan.An Optimal Control Scheme for a Class of Discrete-time Nonlinear Systems with Time Delays Using Adaptive Dynamic Programming[J].自动化学报,2010,36(1):121-129. 被引量：17
9XIE Xiang-Peng,ZHANG Hua-Guang.Stabilization of Discrete-time 2-D T-S Fuzzy Systems Based on New Relaxed Conditions[J].自动化学报,2010,36(2):267-273. 被引量：2
10康琦,汪镭,安静,吴启迪.基于近似动态规划的微粒群系统参数优化研究[J].自动化学报,2010,36(8):1171-1181. 被引量：4

同被引文献23

1张友,井元伟,张嗣瀛.基于观测器的线性中立时滞系统的H_∞控制[J].控制与决策,2004,19(10):1137-1141. 被引量：13
2王宝华,杨成梧,张强.TCSC自适应逆推控制器设计[J].电力自动化设备,2005,25(4):59-61. 被引量：14
3韩京清.一类不确定对象的扩张状态观测器[J].控制与决策,1995,10(1):85-88. 被引量：422
4陈宗海,文锋,王智灵.基于自适应评价的非线性系统神经网络控制[J].控制与决策,2007,22(7):765-768. 被引量：3
5Vamvoudakis K G, Lewis F L. Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem[J]. Automatica, 2010, 46(5): 878-888.
6Zhang H G, Luo Y H, Liu D. Neural-network-based near- optimal control for a class of discrete-time affine nonlinear systems with control constraints[J]. IEEE Trans on Neural Networks, 2009, 20(9): 1490-1503.
7Dierks T, Jagannathan S. Optimal Control of Affine Nonlinear Continuous-time Systems Using an Online Hamilton-Jacobi-Isaacs Formulation[C]. Proc of IEEE Conf on Decision and Control. New York: IEEE Press, 2010: 3047-3053.
8Werbos P J. Approximate dynamic programming for real- time control and neural modeling[C]. Intelligent Control: Neural, Fuzzy, and Adaptive Approaches. New York: Van Nostrand Reinhold, 1992.
9Liu D, Javaherian H, Tandale M D, et al. Adaptive critic learning technique for engine torque and air-fuel ratio control[J]. IEEE Trans on Systems, Man, and Cybernetics, Part B: Cybernetics, 2008, 38(4): 988-993.
10Venayagamoorthy G K, Harley R G, Wunsch D C. Dual heuristic programming excitation neurocontrol for generation in a multi-machine power system[J]. IEEE Trans on Industry Applications, 2003, 39(2): 382-394.

引证文献8

1崔黎黎,张化光,罗艳红.控制方向未知的非线性系统的自适应评价设计[J].浙江大学学报（工学版）,2012,46(5):853-857. 被引量：2
2张吉烈,张化光,罗艳红,梁洪晶.基于广义模糊双曲模型的自适应动态规划最优控制设计[J].自动化学报,2013,39(2):142-149. 被引量：11
3张化光,张欣,罗艳红,杨珺.自适应动态规划综述[J].自动化学报,2013,39(4):303-311. 被引量：80
4乔俊飞,薄迎春,韩广.基于ESN的多指标DHP控制策略在污水处理过程中的应用[J].自动化学报,2013,39(7):1146-1151. 被引量：18
5SONG Rui-Zhuo XIAO Wen-Dong SUN Chang-Yin.Optimal Tracking Control for a Class of Unknown Discrete-time Systems with Actuator Saturation via Data-based ADP Algorithm[J].自动化学报,2013,39(9):1413-1420. 被引量：4
6崔黎黎,刘杰,张勇.基于单网络ADP的一类未知非线性系统的近似最优控制[J].控制与决策,2013,28(9):1423-1426. 被引量：3
7崔小红,王缔,刘素兵,柴宝杰.基于近似动态规划方法的未知系统的最优跟踪控制[J].数学的实践与认识,2015,45(11):266-274. 被引量：2
8Yanhong Luo,Shengnan Zhao,Dongsheng Yang,Huaguang Zhang.A New Robust Adaptive Neural Network Backstepping Control for Single Machine Infinite Power System With TCSC[J].IEEE/CAA Journal of Automatica Sinica,2020,7(1):48-56. 被引量：4

二级引证文献120

1Haishan XU,Fucheng LIAO.Optimal Tracking Control for Discrete-time Systems with Time-delay Based on the Preview Control Method[J].Journal of Systems Science and Information,2019,10(5):452-461.
2刘富,安毅,董博,李元春.基于ADP的可重构机械臂能耗保代价分散最优控制[J].吉林大学学报（工学版）,2020,50(1):342-350. 被引量：4
3蓝雯飞,吴子莹,李强,强小利.动态规划算法的时间效率改进[J].中南民族大学学报（自然科学版）,2016,35(2):135-140. 被引量：6
4会国涛,张化光,汪刚,解相朋,吴振宁.模糊双曲正切模型研究综述[J].自动化学报,2013,39(11):1849-1857. 被引量：3
5刘德荣,李宏亮,王鼎.基于数据的自学习优化控制:研究进展与展望[J].自动化学报,2013,39(11):1858-1870. 被引量：22
6薄迎春,夏伯锴.催化剂窑炉温度的启发式动态规划控制[J].化工学报,2013,64(12):4615-4620. 被引量：1
7谭拂晓,刘德荣,关新平,罗斌.基于微分对策理论的非线性控制回顾与展望[J].自动化学报,2014,40(1):1-15. 被引量：12
8林小峰,曹怒云,宋绍剑.基于ε-ADP的一类离散非线性系统最优跟踪控制[J].广西大学学报（自然科学版）,2014,39(2):372-377.
9刘鑫燕,王玉惠,吴庆宪.方向未知的非仿射非线性系统的模糊滑模控制[J].吉林大学学报（信息科学版）,2014,32(2):145-150. 被引量：2
10张绍杰,吴雪,刘春生.执行器故障不确定非线性系统最优自适应输出跟踪控制[J].自动化学报,2018,44(12):2188-2197. 被引量：9

1计算机系统、计算机网络与网络互连[J].电子科技文摘,2003,0(4):107-110.
2声音[J].科技新时代,2012(4):17-17.
3王衍.问一个技术批评家[J].人物,2017,0(1):146-147.
4LIAO Yong CHEN Xudong XIONG Guangze ZHU Qingxin, SANG Nan LI Yun.Adaptive CPU Resource Allocation for Pervasive Computing Devices Based on Optimal Control[J].Chinese Journal of Electronics,2006,15(3):431-436. 被引量：1
5DUAN ZhiSheng HUANG Lin YANG Ying.The effects of redundant control inputs in optimal control[J].Science in China(Series F),2009,52(11):1973-1981. 被引量：13
6Chunhui LI,Erchuan ZHANG,Lin JIU,Huafei SUN.Optimal control on special Euclidean group via natural gradient algorithm[J].Science China(Information Sciences),2016,59(11):59-68. 被引量：4
7WANG Xue-song CHENG Yu-hu SUN Wei.A Proposal of Adaptive PID Controller Based on Reinforcement Learning[J].Journal of China University of Mining and Technology,2007,17(1):40-44. 被引量：2
8王凤先,田俊峰,刘鑫.分布式冗余服务的自适应配置管理策略[J].计算机工程与应用,2003,39(25):92-94.
9Yanhong Luo Huaguang Zhang.Approximate optimal control for a class of nonlinear discrete-time systems with saturating actuators[J].Progress in Natural Science:Materials International,2008,18(8):1023-1029. 被引量：2
10段海新,吴建平,李星.防火墙规则的动态分配和散列表匹配算法[J].清华大学学报（自然科学版）,2001,41(1):96-98. 被引量：8

自动化学报

2009年第6期

浏览历史

内容加载中请稍等...