期刊文献+

基于增强学习规则的倒立摆模糊神经网络控制器 被引量:1

Fuzzy Neural Network Controller of Inverted Pendulum with Reinforcement Learn ing Rule
下载PDF
导出
摘要 为实现模型未知、初始时有脉冲输入的车上单级倒立摆镇定控制,提出了一种采用增强学习规则训练的模糊神经网络控制器。以神经网络构造基于T-S(Tankagi-Sugeno)规则的模糊控制器;用3层前馈网络组成预测器进入仿真,得到倒立摆状态并计算状态预测值,再将状态和状态预测值组成训练数据对,训练状态预测BP(Backward Propagation)网络;利用增强学习的方法训练模糊控制器,根据神经网络产生的模糊控制量和倒立摆状态预测,做出控制决策。此方法简化了模糊控制部分参数调整,亦可应用于其他无模型控制。实验证明,控制器鲁棒性良好,即使在倒立摆参数变化较大时,控制器仍能维持倒立摆平衡。 A training method for fuzzy neural network controller with reinforcement learning is proposed to maintain the balance of a single inverted pendulum on cart. The inverted pendulum model is unknown and it is disturbed by impulse input at the beginning. Build fuzzy controller based on T-S (Tankagi-Sugeno) rules with neural network; Begin the simulation with a forward network of three layers to obtain states and state-predictions; State-prediction BP (Backward Propagation) network is trained with pairs of state and state prediction; Fuzzy neural network controller is trained using reinforcement learning rule. Control decision is made due to fuzzy control and state prediction. This method simplifies adjustment of fuzzy control parameters, and is suitable to other controls without system model. Experiments showe that the control method has sound robustness. Even if the parameters of inverted pendulum change a lot, the controller could maintain the balance of inverted pendulum however.
出处 《吉林大学学报(信息科学版)》 CAS 2006年第5期561-566,共6页 Journal of Jilin University(Information Science Edition)
关键词 增强学习 模糊神经网络 T-S模型 倒立摆 reinforcement learning fuzzy neural network controller Takagi-Sugeno model inverted pendulum
  • 相关文献

参考文献18

  • 1SMIRNOV E Y.Control of Rotational Motion of a Free Solid by Means of Pendulums[J].Mechanics of Solids,1980,15(3):1-5.
  • 2FURUTA K,KAJIWARA H.Digital Control of a Double Inverted Pendulum on an Inclined Rail[ J ].International Journal of Control,1980,32 (5):907-924.
  • 3WATTS J W.Control of an Inverted Pendulum[C] // Proceedings,92nd Annual Conference-American Society for Engineering Education,Engineering Education.Salt Lake City,USA:ASEE,1984:706-710.
  • 4LEFEBVRE S,RICHTER S.Decentralized Variable Structure Control Design for a Two-Pendulum System[ J ].IEEE Transanction on Automatic Control,1983,28 (13):1112-1114.
  • 5YI Jian-qiang,YUBAZAKI N,HIROTA K.A New Fuzzy Controller for Stabilization of Parallel-Type Double Inverted Pendulum System[J].Fuzzy Sets and Systems,2002,126 (1):105-119.
  • 6MOHANLAL P P,KAIMAL M R.Exact Fuzzy Modeling and Optimal Control of the Inverted Pendulum on Cart[C] // 41st IEEE Conference on Decision and Control.Las Vegas:IEEE,2002 (3):3255-3260.
  • 7ANDERSON C W.Learning to Control an Inverted Pendulum Using Neural Networks[ J ].IEEE Control System Magazine,1989,9 (3):31-37.
  • 8蒋国飞,吴沧浦.基于Q学习算法和BP神经网络的倒立摆控制[J].自动化学报,1998,24(5):662-666. 被引量:55
  • 9WU Q.Neural Inverse Modeling and Control of a Base-Eexcited Inverted Pendulum[ J ].Engineering Applications of Artificial Intelligence,2002,15 (3/4):261-272.
  • 10GAO Yang,ER M J.Online Adaptive Fuzzy Neural Identification and Control of a Class of MIMO Nonlinear Systems[ J].IEEE Transactions on Fuzzy Systems,2003,11 (4):462-477.

二级参考文献3

共引文献137

同被引文献15

  • 1康怀祺,史彩成,何佩琨,李晓琼.Novel Sequential Neural Network Learning Algorithm for Function Approximation[J].Journal of Beijing Institute of Technology,2007,16(2):197-200. 被引量:1
  • 2SUTTON R S, BARTO A G. Reinforcement Learning: An Introductin [ M]. Cambridge, MA: MIT Press, 1998.
  • 3THURN S, MITCHEIL T M. Lifelong Robot Leaning [J]. Robotics and Autonomous System, 1995, 15 (1) : 25-46.
  • 4WATKINS C, DAYAN P. Q-Learning [J]. Machine Learning, 1992, 8 (3/4): 279-292.
  • 5WIDROW B, RUMELHART D E, LEHR M A. The Basic Ideas in Neural Networks [ J]. Communications of the ACM, 1994, 37 (3) : 87-92.
  • 6WANG Xue-song, CHENG Yu-hu, SUN Wei. Q Learning Based on Self-Organizing Fuzzy Radial Basis Function Network [ C] //Thrid International symposium on Neural Networks. Berlin Heidelberg: Springer Verlag, 2006: 607-615.
  • 7PARK J, SANDBERG I W. Universal Approximation Using Radial Basis Functions Networks [ J ]. Neural Computation, 1991, 3 (2): 246-257.
  • 8JUN L. Learning Reactive Behaviors with Constructive Neural Network in Mobile Robotics [ D]. [ S.l. ] : Orebro Studies in Technology, 2006.
  • 9STASTNY J, SKORPIL. Analysis of Algorithms for Radial Basis Function Neural Network [ C ] // IFIP International Federation for Information Processing. [ S. l. ] : Springer, 2007, 245 : 54-62.
  • 10MOODY J, DARKEN C. Learning with Localized Receptive Field [ C ] // In Proc Connection Models Summer School. [ S. l. ] : Morgan Kaufmann, 1988: 133-143.

引证文献1

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部