Bionic autonomous learning control of a two-wheeled self-balancing flexible robot 被引量：2

Bionic autonomous learning control of a two-wheeled self-balancing flexible robot

导出

摘要 This paper presents an OCPA （operant conditioning probabilistic automaton） bionic autonomous learning system based on Skinner＇s operant conditioning theory for solving the balance control problem of a two-wheeled flexible robot. The OCPA learning system consists of two stages： in the first stage, an operant action is selected stochastically from a set of operant actions and then used as the input of the control system; in the second stage, the learning system gathers the orientation information of the system and uses it for optimization until achieves control target. At the same time, the size of the operant action set can be automatically reduced during the learning process for avoiding little probability event. Theory analysis is made for the designed OCPA learning system in the paper, which theoretically proves the convergence of operant conditioning learning mechanism in OCPA learning system, namely the operant action entropy will converge to minimum with the learning process. And then OCPA learning system is applied to posture balanced control of two-wheeled flexible self-balanced robots. Robot does not have posutre balanced skill in initial state and the selecting probability of each operant in operant sets is equal. With the learning proceeding, the selected probabilities of optimal operant gradually tend to one and the operant action entropy gradually tends to minimum, and so robot gradually learned the posture balanced skill. This paper presents an OCPA （operant conditioning probabilistic automaton） bionic autonomous learning system based on Skinner＇s operant conditioning theory for solving the balance control problem of a two-wheeled flexible robot. The OCPA learning system consists of two stages： in the first stage, an operant action is selected stochastically from a set of operant actions and then used as the input of the control system; in the second stage, the learning system gathers the orientation information of the system and uses it for optimization until achieves control target. At the same time, the size of the operant action set can be automatically reduced during the learning process for avoiding little probability event. Theory analysis is made for the designed OCPA learning system in the paper, which theoretically proves the convergence of operant conditioning learning mechanism in OCPA learning system, namely the operant action entropy will converge to minimum with the learning process. And then OCPA learning system is applied to posture balanced control of two-wheeled flexible self-balanced robots. Robot does not have posutre balanced skill in initial state and the selecting probability of each operant in operant sets is equal. With the learning proceeding, the selected probabilities of optimal operant gradually tend to one and the operant action entropy gradually tends to minimum, and so robot gradually learned the posture balanced skill.

作者 Jianxian CAI Xiaogang RUAN

机构地区 School of Electronic Information and Control Engineering Institute of Disaster Prevention

出处《控制理论与应用（英文版）》 EI 2011年第4期521-528,共8页

基金 supported by the National Natural Science Foundation of China (No. 60774077) the National High Technology Development Plan(863) of China (No. 2007AA04Z226) the Beijing Municipal Education Commission Key Project (No. KZ200810005002) the Beijing Natural Science Foundation Project (No. 4102011)

关键词 Two-wheeled flexible robot Poster balance control Operant conditioning Probabilistic automaton Bionic autonomous learning Two-wheeled flexible robot Poster balance control Operant conditioning Probabilistic automaton Bionic autonomous learning

分类号 TP242 [自动化与计算机技术—检测技术与自动化装置] TL631.24 [核科学技术—核技术及应用]

引文网络
相关文献

参考文献17

1A. Salerno, J. Angeles. On the nonlinear controllability of a quasiholonomic mobile robot. Proceedings of the IEEE International Conference on Robotic and Automation. Piscataway: IEEE, 2003: 3379 - 3384.
2F. Grasser, A. D'Arrigo, S. Colombi, et al. JOE: A mobile, inverted pendulum. IEEE Transactions on Industrial Electronics, 2002, 49(1 ): 107 - 114.
3D. H. Kim, J. H. Oh. Tracking control of a two-wheeled mobile robot using input-output linearization. Control Engineering Practice, 1999, 7(3): 369 - 373.
4T. Urakubo, K. Tsuchiya, K. Tsujita. Motion control of a two-wheeled mobile robot. Advanced Robotics, 2001, 15(7): 711 - 728.
5K. Kozlowski, D. Pazdersdi. Stabilization of two-wheeled mobile robot using smooth control law: experiment study. Proceedings of the IEEE International Conference on Robotic and Automation. Florida: IEEE, 2006:3387 - 3392.
6X. Ruan, J. Zhao. The flexible two-wheeled self-balancing robot based on hopfield. Intelligent Robotics and Applications, 2009, 5928: 1196 - 1204.
7D. McFarland, T. Bosser. Intelligent Behavior in Animals and Robots. Cambridge: MIT Press, 1993.
8J. Simons, H. Brussel, J. de Schutter, et al. A self-learning automaton with variable resolution for high precision assembly by industrial robots. 1EEE Transactions on Automatic Control, 1982, 27(5): 1109 -1113.
9B. J. Oommen. Trajectory planning of robot manipulators in noisy work spaces using stochastic automata. The International Journal of Robotics Re,-earch, 1991, 10(2): 135- 148.
10B. F. Skinner. The behavior of organisms. New York: Appleton- Century-Crofts, 1938.

同被引文献9

1薛凡,孙京诰,严怀成.两轮平衡车的建模与控制研究[J].化工自动化及仪表,2012,39(11):1450-1454. 被引量：21
2王晓宇,闫继宏,臧希喆,秦勇,赵杰.两轮自平衡机器人多传感器数据融合方法研究[J].传感技术学报,2007,20(3):668-672. 被引量：11
3阮晓钢,任红格.两轮自平衡机器人动力学建模及其平衡控制[J].计算机应用研究,2009,26(1):99-101. 被引量：34
4丁珠玉,陈建,李云武,吴达科.基于模糊PID的花椒烘房温度自动控制系统[J].农业工程学报,2010,26(S1):32-36. 被引量：9
5郜园园,阮晓钢,宋洪军,陈静.两轮自平衡机器人惯性传感器滤波问题的研究[J].传感技术学报,2010,23(5):696-700. 被引量：30
6刘明,王洪军,李永科.直立行走的智能车设计方案[J].科技信息,2012(20):122-122. 被引量：4
7林文建,钟杭,黎福海,肖祥慧,钱馨然.两轮自平衡机器人控制系统设计与实现[J].电子测量与仪器学报,2013,27(8):750-759. 被引量：33
8唐永川,刘枫,祁虔,李祖枢,高帆.平面倒立摆系统的自校正仿人协调控制[J].西南大学学报（自然科学版）,2013,35(10):94-104. 被引量：4
9徐国华,谭民.移动机器人的发展现状及其趋势[J].机器人技术与应用,2001(3):7-14. 被引量：189

引证文献2

1季浚涛.两轮自平衡避障机器人[J].科技信息,2013,0(34):252-253.
2杨勋涛,樊利,颜新华,郑远,丁珠玉.两轮自平衡小车启动暂态过程的研究[J].西南师范大学学报（自然科学版）,2014,39(12):87-93. 被引量：3

二级引证文献3

1王来志,杨雨浓.一级旋转倒立摆及其控制装置的研究与实现[J].西南师范大学学报（自然科学版）,2016,41(8):145-150. 被引量：6
2范硕,陶翔翔,王志明.基于STM32的旋转倒立摆实验平台的下位机设计与实现[J].电脑知识与技术,2018,14(6):219-221. 被引量：4
3杨程翔.基于直流伺服电机的旋转倒立摆实验装置的设计[J].科技风,2020(29):1-2. 被引量：1

1Wang Changhong Zhang Fuen(School of Astronautics).An Approach to the Design of Learning Controllers[J].哈尔滨工业大学学报,1990,22(3):84-91. 被引量：1
2杨崇耀,戚国正,康家成.关于概率自动机的分解[J].贵州科学,1992,10(1):94-99. 被引量：1
3吕品.计算机专业人才培养中需正视的问题[J].理工高教研究,2005,24(4):111-112. 被引量：2
4周文波,陈健,谢力,何晓波.Web数据挖掘技术在远程教育中的应用[J].防灾科技学院学报,2007,9(4):105-107. 被引量：2
5孙云平,李俊民,王江安,王辉林.Generalized projective synchronization of chaotic systems via adaptive learning control[J].Chinese Physics B,2010,19(2):119-126. 被引量：19
6Farooq M,Wang Daobo,Dar N. U.Improved hybrid position/force controller design of a flexible robot manipulator using a sliding observer[J].Journal of Systems Engineering and Electronics,2009,20(1):146-158. 被引量：4
7魏若岩,阮晓钢,于乃功,黄静,朱晓庆,肖尧.基于Skinner操作条件反射的抽样一致性算法[J].控制与决策,2015,30(2):235-240. 被引量：3
8LUOJun,LUTian－sheng.Kinematic Control of Wheeled Snake-Like Mobile Robot[J].Journal of Shanghai University(English Edition),2001,5(4):312-316.
9冀亚林,艾迪明,王学义,刘滨.Web服务器日志文件广义集成分析模型[J].计算机工程与应用,2006,42(12):159-164.
10蔡建羡,阮晓钢.基于遗传算法的Skinner操作条件反射学习模型[J].系统工程与电子技术,2011,33(6):1370-1376. 被引量：3

控制理论与应用（英文版）

2011年第4期

浏览历史

内容加载中请稍等...

Bionic autonomous learning control of a two-wheeled self-balancing flexible robot 被引量：2

参考文献17

同被引文献9

引证文献2

二级引证文献3

相关作者

相关机构

相关主题

浏览历史