基于增强学习的关节型机器人动态操作任务运动规划

Reinforcement Learning Based Motion Planning of Dynamic Manipulation Task for Manipulator

下载PDF

导出

摘要提出增强学习(RL)解决机器人动态操作任务运动规划的方法。对动态操作任务,分析了如何确定输入输出变量以及强化函数的设计问题;给出用于连续输入输出问题的自适应启发评价(AHC)算法。增强学习解决动态操作任务的运动规划问题,只需要机器人正解进行反复尝试即可学会动作,从而避免了常规运动规划方法中涉及的复杂逆解运算;最后以平面3连杆机器人接取自由飞行的球为例进行仿真研究,结果表明了方法的有效性和可行性。 Reinforcement learning （RL） to motion planning of dynamic manipulation tasks was applied, The input（s）, the output（s） and reinforcement function were analyzed, and adaptive heuristic critic （AHC） algorithms were adopted for continuous problem. The advantage of applying RL to dynamic manipulation is to avoid the complex inverse kinemics and to learn the motion by trial. Simulation of planar 3 links manipulator to catch free flying ball is to validate the method.

作者张培艳吕恬生

机构地区上海交通大学工程训练中心

出处《系统仿真学报》 EI CAS CSCD 北大核心 2006年第9期2537-2540,共4页 Journal of System Simulation

关键词增强学习运动规划动态操作任务 reinforcement learning motion planning dynamic manipulation task

分类号 TP242.1 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献10

1张汝波,周宁,顾国昌,张国印.基于强化学习的智能机器人避碰方法研究[J].机器人,1999,21(3):204-209. 被引量：23
2Bucak O, Zohdy A. Application of reinforcement learning control to a nonlinear dexterous robot[C]// Proceedings of the 38th IEEE Conference on Decision and Control, Phoenix, AZ: IEEE, 1999:5108-5113.
3Song T Chu S. Reinforcement learning and its appfication to force control of an industrial robot [J]. Control Engineering Practice(S0967-0661), 1998, (6): 37-44.
4Distante C, Anglani A. Target reaching by using visual information and Q-leafing controllers [J]. Autonomous Robots (S0929-5593),2000, (9): 41-50.
5Shibata K, Ito K. Hand-eye coordination in robot arm reaching task by reinforcement learning using a neural network[C]//Proceedings of the 1999 IEEE International Conference on Systems, Man, and Cybernetics, Tokyo: IEEE, 1999: 458-463.
6Martin P, Millan R. Robot arm reaching through neural inversions and reinforcement learning [J]. Robotics and Autonomous Systems(S0921-8890), 2000, 31(4): 227-246.
7Moussa A, Kamel S. An experimental approach to robotic grasping using reinforcement learning and generic grasping functions[C]//Proceedings of the 1996 IEEE International Conference on Roboticsand Automation, Minneapolis, MN: IEEE, 1996: 2767-2773.
8Nakashima T, Udo M, Ishibuchi H. Knowledge acquisition for asoccer a gent by fuzzy reinforcement learning [C]//IEEE International Conference on Systems, Man and Cybernetics. 2003: 4256-4261.
9Gullapalli V. A stochastic reinforcement learning algorithm for learning real-valued function[J]. Neural Networks (S0893-6080),1990, 3: 671-692.
10Neumann G, Neumann S. The reinforcement learning toolbox[EB/OL]. (2005)[2005]. http://www.igi.tugraz.at/ril-toolbox.

共引文献22

1刘建兴.一种仓库搬运机器人分类入库系统的设计[J].广西农业机械化,2019,0(4):21-22.
2石鸿雁,孙茂相,孙昌志.未知环境下移动机器人路径规划方法[J].沈阳工业大学学报,2005,27(1):63-69. 被引量：10
3黄炳强,曹广益,王占全.强化学习原理、算法及应用[J].河北工业大学学报,2006,35(6):34-38. 被引量：19
4赵晓华,石建军,李振龙,赵国勇.基于Q-learning和BP神经元网络的交叉口信号灯控制[J].公路交通科技,2007,24(7):99-102. 被引量：8
5赵晓华,李振龙,陈阳舟,荣建.Q学习中基于模糊规则的强化函数设计方法[J].模式识别与人工智能,2008,21(2):254-259.
6齐勇,魏志强,殷波,费云瑞,于忠达,庄晓东.增强蚁群算法的机器人最优路径规划[J].哈尔滨工业大学学报,2009,41(3):130-133. 被引量：8
7唐平,杨宜民.多智能体协调系统的研究及实现方法[J].现代计算机,1999,5(11):28-30.
8孙亮,甘飞梅.基于机器学习实现双轮机器人平衡控制的应用研究[J].计算机测量与控制,2011,19(12):2972-2974. 被引量：3
9张小川,唐艳,梁宁宁.采用时间差分算法的九路围棋机器博弈系统[J].智能系统学报,2012,7(3):278-282. 被引量：5
10张汝波,顾国昌,刘照德,王醒策.强化学习理论、算法及应用[J].控制理论与应用,2000,17(5):637-642. 被引量：91

1段勇,崔宝侠,徐心和.进化强化学习及其在机器人路径跟踪中的应用[J].控制与决策,2009,24(4):532-536. 被引量：6
2刘纯辉.面向对象的计量方法[J].中国计量,2010(10):89-90. 被引量：1
3金山词霸不能在Firefox中取词[J].电脑爱好者（普及版）,2010(A02):73-73.
4郜园园,朱凡,宋洪军.进化操作行为学习模型及在移动机器人避障上的应用[J].计算机应用,2013,33(8):2283-2288. 被引量：3
5冷平,维维.智创未来:九阵福云计算平台(AHC)[J].内江科技,2013,34(9):134-134.
6蔡文澜,王俊生,税海涛,马宏绪,黄茜薇.基于增强学习的无人直升机姿态控制器设计[J].弹箭与制导学报,2008,28(2):73-76. 被引量：1
7李华忠,洪炳熔,杨维萍,柳长安.自由飞行空间机器人的动力学控制及其仿真[J].武汉汽车工业大学学报,1999,21(2):8-12.
8范波,潘泉,张洪才.多智能体学习中基于知识的强化函数设计方法[J].计算机工程与应用,2005,41(3):77-79. 被引量：3
9杨亚,王铮,张素兰,郭飞飞.基于小波变换的多聚焦图像融合[J].计算机技术与发展,2010,20(3):56-58. 被引量：7
10叶勇.手机平板连续输入大写字母[J].电脑爱好者,2013(9):40-40.

系统仿真学报

2006年第9期

浏览历史

内容加载中请稍等...

基于增强学习的关节型机器人动态操作任务运动规划

参考文献10

共引文献22

相关作者

相关机构

相关主题

浏览历史