期刊文献+

模糊操作条件概率自动机仿生自主学习系统和机器人自平衡控制 被引量:2

Fuzzy operant conditioning probabilistic automaton bionic autonomous learning system and robot self-balancing control
下载PDF
导出
摘要 为了实现两轮机器人的自平衡控制,利用Skinner操作条件反射机理,以概率自动机为平台,融入模糊推理,构造了模糊操作条件概率自动机(OCPA)仿生自主学习系统.该学习系统是一个从状态集合到操作行为集合的随机映射,采用操作条件反射学习机制,从操作行为集合中随机学习作为控制系统控制信号的最优行为,并利用学习到的操作行为取向值信息,调整操作条件反射学习算法.此外,学习系统还引入行为熵,以验证其自学习和自组织能力.应用于两轮机器人自平衡控制的仿真结果,验证了模糊OCPA学习系统的可行性. A fuzzy operant conditioning probabilistic automaton(OCPA) bionic autonomous learning system is constructed based on Skinner operant conditioning theory and combined with the probabilistic automaton and fuzzy inference for realizing a two-wheeled robot self-balancing control. The learning system is a stochastic mapping from state sets to operant action sets. The optimal action for controlling the system is stochastically learned from the operant action set by adopting operant conditioning learning algorithm; in the same time the orientation value information of the learned operant action is used to adjust the operant conditioning learning algorithm. In addition, the action entropy is added to verify the self-learning and self-organizing ability of the learning system. In the simulation, a two-wheeled robot self-balancing control is realized, demonstrating the feasibility of the fuzzy OCPA learning system.
出处 《控制理论与应用》 EI CAS CSCD 北大核心 2010年第7期960-964,共5页 Control Theory & Applications
基金 国家自然科学基金资助项目(60774077) 国家“863”计划资助项目(2007AA04Z226) 北京市教委重点资助项目(KZ200810005002)
关键词 操作条件反射 概率自动机 模糊推理 仿生自主学习系统 自平衡控制 operant conditioning probabilistic automaton fuzzy inference bionic autonomous learning system entropy self-balancing control
  • 相关文献

参考文献11

  • 1KIM D H,OH J H.Tracking control of a two-wheeled mobile robot using input-output linearization[J].Control Engineering Practice,1999,7(3):369-373.
  • 2URAKUBO T,TSUCHIYA K,TSUJITA K.Motion control of a twowheeled mobile robot[J].Advanced Robotics,2001,15(7):711-728.
  • 3ZHOU J,WU W.The application of disturbance observer in twowheeled mobile robot[C] //2004 IEEE Conference on Robotics,Automation and Mechatronics.Singapore:IEEE,2004,1:171-174.
  • 4KOZLOWSKI K,PAZDERSKI D.Stabilization of two-wheeled mobile robot using smooth control law[C] //The 2006 IEEE International Conference on Robotics and Automation.Orlando:Institute of Electrical and Electronics Engineers Inc,2006:3387-3392.
  • 5MCFARLAND D,BOSSER T.Intelligent Behavior in Animals and Robots[M].Cambridge:MIT Press,1993.
  • 6SKINNER B F.The Behavior of Organisms[M].New York:Appleton-Century-Crofts,1938.
  • 7SAKSIDA L M,TOURETZKY D S.Application of a model of instrumental conditioning to mobile robot control[J].Sensor Fusion and Decentralized Control in Autonomous Robotic Systems,1997,32(9):55-66.
  • 8TOURETZKY D S,SAKSIDA L M.Operant conditioning in Skinnerbots[J].Adaptive Behavior,1997,5(3/4):219-247.
  • 9SAKSIDA L M,RAYMOND S M,TOURETZKY D S.Shaping robot behavior using principles from instrumental conditioning[J].Robotics and Autonomous Systems,1998,22(3/4):231-249.
  • 10TOURETZKY D S,DAW N D,TIRA-THOMPSON E J.Combining configured and TD learning on a robot[C] //The 2nd International Conference on Development and Learning.Pittsburgh:IEEE,2002:47-52.

二级参考文献6

共引文献3

同被引文献20

  • 1MISRA S, OOMMEN B J. Dynamic algorithms for the shortest path routing problem: learning automata-based solutions [J]. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernet- ics, 2005, 35(6): 1179- 1192.
  • 2NAJIM K, PIBOULEAU L, LE LANN M V. Optimization technique based on learning automata [J]. Journal of Optimization Theory and Applications, 1990, 64(2): 331 - 347.
  • 3BANKS J S, SUNDARAM R K. Repeated games, finite automata, and complexity [J]. Games and Economic Behavior, 1990, 2(2): 97 - 117.
  • 4KARHUMAKI J, PLANDOWSKI W, RYTTER W. Pattern- Matching problems for two-dimensional images described by finite automata [J]. Nordic Journal of Computing, 2000, 7(1): 1 - 13.
  • 5ROSIN P L. Image processing using 3-state cellular automata [J]. Computer Vision andlmage Understanding, 2010, 114(7): 790 - 802.
  • 6LAMEGO M M. Automata control systems [J]. lET Control Theory & Applications, 2007, 1(1): 358- 371.
  • 7SKINNER B E Two types of conditioned reflex and a pseudo type [J]. Journal of General Psychology, 1935, 12(1): 66 - 77.
  • 8SKINNER B F. The Behavior of Organisms an Experimental Analy- sis [M]. New York & London: D. Appleton-Century Compan, 1938.
  • 9DAWN D, TOURETZKY D S. Operant behavior suggests attentional gating of dopamine system inputs [J]. Neurocomputing, 2001, 38-40: 1161 - 1167.
  • 10SAKS1DA L M, RAYMOND S M, TOURETZKY D S. Shaping robot behavior using principles from instrumental conditioning [J]. Robotics and Autonomous Systems, 1997, 22(3/4): 231 - 249.

引证文献2

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部