动态FOCPA学习系统设计及在机器人运动平衡控制中的应用

Design of Dynamic FOCPA Learning System and Its Application to Robot Motion Balance Control

下载PDF

导出

摘要针对仿生自主学习系统的自组织和泛化能力问题,基于Skinner操作条件反射原理和模糊聚类算法设计了动态FOCPA(fuzzy operant conditioning probabilistic automaton)仿生自主学习系统.动态FOCPA学习系统不仅具有仿生的自学习和自组织能力,而且提高了学习的精度和速度.其在仅能获得环境微弱反馈信息的前提下,首先采用在线聚类的方法实现对输入空间的灵活划分,以确保映射规则的数目是最经济的;然后以取向值为评价信号,采用OC学习算法,在线自主学习输入状态到输出操作行为的最佳映射,并加入一个高斯噪声项对映射结果进行实时优化.此外,动态FOCPA学习系统还利用信息熵的评价能力,来验证自身的自学习和自组织能力.理论上分析了设计的OC学习算法的收敛性;通过对两轮柔性直立式机器人姿态平衡控制和速度控制的实验分析,验证了动态FOCPA学习系统的有效性. Aiming at the ability of self-organization and generalization of bionic autonomous learning system,this paper constructs a dynamic fuzzy operant conditioning probabilistic automaton（FOCPA） bionic autonomous learning system based on Skinner operant conditioning（OC） theory and fuzzy clustering algorithm.The dynamic FOCPA learning system not only has bionic self-learning and self-organizing ability,but also can improve the learning speed and precision of learning system. Under the learning environment where only weak feedback information can be obtained,the FOCPA learning system firstly adopts online clustering algorithm to flexibly divide the input space to ensure that the number of mapping rules is the most economical.And then the learning system takes orientation value as evaluation signal and adopts the designed OC learning algorithm to autonomously learn the optimal mapping online from input states to output operant action,and a Gaussian noise term is added for optimizing the mapping result in real time.Moreover,by using the evaluating ability of information entropy, the self-learning and self-organizing ability is verified.The convergence of OC learning algorithm is proved from theory,and the further experiments on posture balancing control and velocity control of two-wheeled flexible upright robot prove the validity of dynamic FOCPA learning system.

作者蔡建羡阮晓钢

机构地区北京工业大学电子信息与控制工程学院防灾科技学院

出处《信息与控制》 CSCD 北大核心 2010年第5期662-672,共11页 Information and Control

基金国家自然科学基金资助项目(60774077) 国家863计划资助项目(2007AA04Z226) 北京市教委重点科技项目(KZ200810005002)

关键词操作条件反射模糊聚类仿生自主学习系统信息熵姿态平衡控制速度控制 operant conditioning fuzzy clustering bionic autonomous learning system information entropy posture-balanced control velocity control

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献10

1Yilddirim S. Design of adaptive robot control system using recurrent neural network[J]. Journal of Intelligent & Robotic Systems, 2005, 44(3): 247-261.
2Floreano D, Mondada E Evolutionary neuro-controller for autonomous mobile robots[J]. Neural Networks, 1998, 11(7/8): 1461-1478.
3Skinner B E Two types of conditioned reflex and a pseudo type[J]. Journal of General Psychology, 1935, 1(12): 66-77.
4Skinner B E The behavior of organisms: An experimental analysis[M]. New York, NJ, USA: Appleton-Century-Crofts, 1938.
5Touretzky D S, Saksida L M. Operant conditioning in Skinnerbots[J]. Adaptive Behavior, 1997, 5(3/4): 219-247.
6Touretzky D S, Daw N D, Tim-Thompson E J. Combining configured and TD learning on a robot[C]//Proceedings of the 2nd International Conference on Development and Learning. Piscataway, NJ, USA: IEEE, 2002: 47-52.
7Tira-Thornpson E J, Halelamien N S, Wales J J, et al. Cognitive primitives for mobile robots[R]. Menlo Park, CA,USA: AAAI, 2004.
8Touretzky D S, Tira-Thompson E J. Tekkotsu: A framework for AIBO cognitive robotics[C]//Proceedings of the 20th National Conference on Artificial Intelligence and the 17th Innovative Applications of Artificial Intelligence Conference. Menlo Park, CA,USA: AAAI, 2005: 1741-1742.
9蒋宗礼,姜守旭.形式语青与自动机理论[M].北京:清华大学出版社,2007.
10Juang C E Combination of online clustering and Q-value based GA for reinforcement fuzzy system design[J]. IEEE Transactions on Fuzzy System, 2004, 13(3): 289-302.

1蔡建羡,阮晓钢.OCPA仿生自主学习系统及在机器人姿态平衡控制上的应用[J].模式识别与人工智能,2011,24(1):138-146. 被引量：5
2阮晓钢,蔡建羡.模糊操作条件概率自动机仿生自主学习系统和机器人自平衡控制[J].控制理论与应用,2010,27(7):960-964. 被引量：2
3蔡建羡,阮晓钢.基于遗传算法的Skinner操作条件反射学习模型[J].系统工程与电子技术,2011,33(6):1370-1376. 被引量：3
4戴丽珍,杨刚,阮晓钢.基于AOCA仿生学习模型的两轮机器人自主平衡学习研究[J].自动化学报,2014,40(9):1951-1957. 被引量：3
5王帅,李光泽,李宾泽.基于操作条件反射的自主学习型智能系统[J].科技创新导报,2014,11(10):223-223.
6阮晓钢,戴丽珍,于乃功,于建均.一种自治操作条件反射自动机[J].控制理论与应用,2012,29(11):1452-1457. 被引量：2
7李静.基于网格的学习系统设计[J].信阳师范学院学报（自然科学版）,2009,22(3):461-464.
8邓敏,陈斌,徐方.基于移动互联网的移动学习系统设计框架研究[J].湖北工程学院学报,2016,36(6):53-56. 被引量：2
9郜园园,阮晓钢,宋洪军.操作条件反射学习自动机及其在机器人平衡控制中的应用[J].控制与决策,2013,28(6):930-934. 被引量：3
10王海洋,张艳,徐静.基于Web挖掘的个性化网络学习系统设计[J].微计算机信息,2010,26(6):114-115. 被引量：6

信息与控制

2010年第5期

浏览历史

内容加载中请稍等...

动态FOCPA学习系统设计及在机器人运动平衡控制中的应用

参考文献10

相关作者

相关机构

相关主题

浏览历史