期刊文献+

Bionic autonomous learning control of a two-wheeled self-balancing flexible robot 被引量:2

Bionic autonomous learning control of a two-wheeled self-balancing flexible robot
原文传递
导出
摘要 This paper presents an OCPA (operant conditioning probabilistic automaton) bionic autonomous learning system based on Skinner's operant conditioning theory for solving the balance control problem of a two-wheeled flexible robot. The OCPA learning system consists of two stages: in the first stage, an operant action is selected stochastically from a set of operant actions and then used as the input of the control system; in the second stage, the learning system gathers the orientation information of the system and uses it for optimization until achieves control target. At the same time, the size of the operant action set can be automatically reduced during the learning process for avoiding little probability event. Theory analysis is made for the designed OCPA learning system in the paper, which theoretically proves the convergence of operant conditioning learning mechanism in OCPA learning system, namely the operant action entropy will converge to minimum with the learning process. And then OCPA learning system is applied to posture balanced control of two-wheeled flexible self-balanced robots. Robot does not have posutre balanced skill in initial state and the selecting probability of each operant in operant sets is equal. With the learning proceeding, the selected probabilities of optimal operant gradually tend to one and the operant action entropy gradually tends to minimum, and so robot gradually learned the posture balanced skill. This paper presents an OCPA (operant conditioning probabilistic automaton) bionic autonomous learning system based on Skinner's operant conditioning theory for solving the balance control problem of a two-wheeled flexible robot. The OCPA learning system consists of two stages: in the first stage, an operant action is selected stochastically from a set of operant actions and then used as the input of the control system; in the second stage, the learning system gathers the orientation information of the system and uses it for optimization until achieves control target. At the same time, the size of the operant action set can be automatically reduced during the learning process for avoiding little probability event. Theory analysis is made for the designed OCPA learning system in the paper, which theoretically proves the convergence of operant conditioning learning mechanism in OCPA learning system, namely the operant action entropy will converge to minimum with the learning process. And then OCPA learning system is applied to posture balanced control of two-wheeled flexible self-balanced robots. Robot does not have posutre balanced skill in initial state and the selecting probability of each operant in operant sets is equal. With the learning proceeding, the selected probabilities of optimal operant gradually tend to one and the operant action entropy gradually tends to minimum, and so robot gradually learned the posture balanced skill.
出处 《控制理论与应用(英文版)》 EI 2011年第4期521-528,共8页
基金 supported by the National Natural Science Foundation of China (No. 60774077) the National High Technology Development Plan(863) of China (No. 2007AA04Z226) the Beijing Municipal Education Commission Key Project (No. KZ200810005002) the Beijing Natural Science Foundation Project (No. 4102011)
关键词 Two-wheeled flexible robot Poster balance control Operant conditioning Probabilistic automaton Bionic autonomous learning Two-wheeled flexible robot Poster balance control Operant conditioning Probabilistic automaton Bionic autonomous learning
  • 相关文献

参考文献17

  • 1A. Salerno, J. Angeles. On the nonlinear controllability of a quasiholonomic mobile robot. Proceedings of the IEEE International Conference on Robotic and Automation. Piscataway: IEEE, 2003: 3379 - 3384.
  • 2F. Grasser, A. D'Arrigo, S. Colombi, et al. JOE: A mobile, inverted pendulum. IEEE Transactions on Industrial Electronics, 2002, 49(1 ): 107 - 114.
  • 3D. H. Kim, J. H. Oh. Tracking control of a two-wheeled mobile robot using input-output linearization. Control Engineering Practice, 1999, 7(3): 369 - 373.
  • 4T. Urakubo, K. Tsuchiya, K. Tsujita. Motion control of a two-wheeled mobile robot. Advanced Robotics, 2001, 15(7): 711 - 728.
  • 5K. Kozlowski, D. Pazdersdi. Stabilization of two-wheeled mobile robot using smooth control law: experiment study. Proceedings of the IEEE International Conference on Robotic and Automation. Florida: IEEE, 2006:3387 - 3392.
  • 6X. Ruan, J. Zhao. The flexible two-wheeled self-balancing robot based on hopfield. Intelligent Robotics and Applications, 2009, 5928: 1196 - 1204.
  • 7D. McFarland, T. Bosser. Intelligent Behavior in Animals and Robots. Cambridge: MIT Press, 1993.
  • 8J. Simons, H. Brussel, J. de Schutter, et al. A self-learning automaton with variable resolution for high precision assembly by industrial robots. 1EEE Transactions on Automatic Control, 1982, 27(5): 1109 -1113.
  • 9B. J. Oommen. Trajectory planning of robot manipulators in noisy work spaces using stochastic automata. The International Journal of Robotics Re,-earch, 1991, 10(2): 135- 148.
  • 10B. F. Skinner. The behavior of organisms. New York: Appleton- Century-Crofts, 1938.

同被引文献9

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部