期刊文献+

基于强化学习的机械臂避碰研究 被引量:1

Reinforcement learning based obstacle avoidance for robotic manipulator
下载PDF
导出
摘要 将强化学习引入机械臂的避碰问题研究,建立了平面三自由度机械臂的多Agent避碰系统,系统结合了最近障碍物信息和偏差角信息来产生控制指令。采用基于K-均值聚类的强化学习方法作为基本的控制策略,给出了系统算法的具体实施过程。通过仿真试验,证明了基于聚类划分的强化学习方法在机械臂避碰问题中的可行性和有效性。 This paper reports on the obstacle avoidance problem for robotic manipulators.The reinforcement learning(RL) method was applied to obstacle avoidance problem and a multi-agent system was built.According the real-time demand of manipulator control, the Sarsa(A) algorithm, which was combined with K-means clustering algorithm, has been selected for its on-policy feature and efficiency. The implement process of the algorithm was given and in the end of this paper, a simulation experiment with different environment was done, the result showed the RL method's feasibility and availability.
出处 《机械设计与制造》 北大核心 2007年第8期140-142,共3页 Machinery Design & Manufacture
基金 国家民用航天科研专项计划(科工技[20041530)
关键词 强化学习 避碰 AGENT K-均值聚类 Reinforcement learning Obstacle avoidance Agent K-means clustering
  • 相关文献

参考文献5

  • 1O.Khatib.Real-time obstacle avoidance for anipulators and mobile robots[J].Robotics,1986,5(1):90-98.
  • 2庄慧忠,杜树新,吴铁军.机器人路径规划及相关算法研究[J].科技通报,2004,20(3):210-215. 被引量:36
  • 3K.Altlioefer,L.D.Seneviratnc.Fuzzy Navigation for Robotic Manipulators[J].International Journal of Uncertainty,Fuzziness and Knowledge-based System,1998,6(2):179-188.
  • 4R S Sutton,A G Barto.Reinforcement Learning:An Introduction[M].Massachusetts:The MIT Press,1998.
  • 5高阳,陈世福,陆鑫.强化学习研究综述[J].自动化学报,2004,30(1):86-100. 被引量:269

二级参考文献40

  • 1Barraquand J,Langlois B,Latombe J C.Numerical potential field techniques for robot path planning [J].IEEE Trans on Systems,Man and Cybernetics,1992,22(2):224-241.
  • 2Kondo K.Motion planning with six degrees of freedom by multi-strategic bi-directional heuristic free-space enumeration [J].IEEE Trans on Robotics and Automation,1991,7(3):267-277.
  • 3Gupta K K.Fast collision avoidance for manipulator arm:a sequential search strategy [J].IEEE Trans on Robotics and Automation,1990,6(5):522-532.
  • 4Gupta K,K,Guo Z P.Motion planning for many degrees of freedom:sequential search with backtracking [J].IEEE Trans on Robotics and Automation,1995,11(6):897-906.
  • 5Kavraki L E,Svestka P,Latombe J C,et al.Probabilistic roadmaps for path planning in high-dimensional configuration spaces [J].IEEE Trans on Robotics and Automation,1996,12(4):566-580.
  • 6Lozano-Perez T.Automatic planning of manipulator transfer movement [J].IEEE Trans on Systems,Man and Cybernetics,1981,11(10):681-698.
  • 7Lozano-Perez T.Spatial planning:A configuration space approach [J].IEEE Trans on Computers,1983,32(2):108-120.
  • 8Zhu D,Latombe J C.New heuristic for efficient hierarchical path planning [J].IEEE Trans.on Robotics and Automation,Feb 1991,7:9-20.
  • 9Schwartz J T,Sharir M.On the 'piano movers' problem-I:The case of a two-dimensional rigid polygonal body moving amidst polygonal barriers [J].Comm Pure Applied Math,1983,36:345-398.
  • 10Schwartz J T,Sharir M.On the 'piano movers' problem-II:General techniques for computing topological properties of real algebraic manifolds [J].Advances in Applied Mathematics,1983,4:298-351.

共引文献303

同被引文献1

引证文献1

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部