期刊文献+

基于深度强化学习的蛇形机械臂控制策略研究 被引量:1

Research on Snake-like Arm Control Strategy Based on Deep Reinforcement Learning
下载PDF
导出
摘要 针对蛇形机械臂控制问题,提出了一种基于深度强化学习的控制策略,该控制策略采用深度确定性策略梯度算法(DDPG)。分析了蛇形机械臂的结构和工作范围。基于Python语言,使用gym中的pyglet模块搭建用于产生数据的仿真环境,设置奖励函数、状态变量和动作变量,最终实现了对蛇形机械臂的精确控制。仿真实验表明:DDPG算法在蛇形机械臂的控制过程中能快速收敛,同时该控制策略在2D平面可实现对目标物的快速精确逼近,并具有较好的鲁棒性。 Aiming at the control problem of snake-like arm,a control strategy based on deep reinforcement learning is proposed,which adopts deep deterministic policy gradient(DDPG).This paper analyzes the structure and working range of the snake-like arm.Based on Python language,using pyglet module in gym to build a simulation environment for generating data,setting reward function,state variables and action variables,the precise control of the snake-like arm is finally realized.The simulation results show that the DDPG algorithm can converge quickly in the control process of the snake-like arm,and the control strategy can achieve fast and accurate approximation of the target object in the 2D plane,and has good robustness.
作者 唐超 张帆 王文龙 李徐 Tang Chao;Zhang Fan;Wang Wenlong;Li Xu(School of Mechanical and Automotive Engineering,Shanghai University of Engineering Science,Shanghai 201620,China)
出处 《农业装备与车辆工程》 2022年第8期17-21,共5页 Agricultural Equipment & Vehicle Engineering
基金 上海市科委生物医药领域科技支撑计划资助(17441901200)。
关键词 深度强化学习 蛇形机械臂 2D 控制策略 DDPG deep reinforcement learning snake-like arm 2D control strategy DDPG
  • 相关文献

参考文献7

二级参考文献44

  • 1陈伟海,陈泉柱,张建斌,张颖.线驱动拟人臂机器人逆向运动学分析[J].机械工程学报,2007,43(4):12-20. 被引量:22
  • 2ROBINSON G, DAVIES J B C. Continuum robots-a state of the art[C]//Proceedings of IEEE Intemational Conference on Robotics and Automation, May 10-15, 1999, Detroit, Michigan. IEEE, 1999: 2849-2854.
  • 3HANNAN M W, WALKER I D. Kinematics and the implementation of an elephant's trunk manipulator and other continuum style robots[J]. Journal of Robotic Systems, 2003, 20(2): 45-63.
  • 4WALKER I D, CARRERAS C, MCDONNELL R, et al Extension versus bending for continuum robots[J] International Journal of Advanced Robotic Systems, 2006, 3(2): 171-178.
  • 5GRAVAGNE I A, RAHN C D, WALKER I D. Large deflection dynamics and control for planar continuum robots[J]. IEEE/ASME Transactions on Mechatronics, 2003, 8(2): 299-307.
  • 6WALKER I D, HANNAN M W. A novel 'elephant's trunk' robot[C]//Proceedings of IEEE/ASME Imernational Conference on Advanced Intelligent Mechatronics, Sept. 19-23, 1999, Atlanta, USA. IEEE, 1999: 410-415.
  • 7JONES B A, MCMAHAN W, WALKER I D. Design and analysis of a novel pneumatic manipulator[C]// Proceedings of 3rd IFAC Symposium on Mechatronic Systems, Sept. 6-8, 2004, Sydney, Australia. 2004: 745-750.
  • 8MCMAHAN W, CHITRAKARAN V, CSENCSITS M, et al. Field trials and testing of the OctArm continuum manipulator[C]//Proceedings of IEEE International Conference on Robotics and Automation, May 15-19, 2006, Orlando, Florida. IEEE, 2006: 2336-2341.
  • 9SIMAAN N. Snake-like units using flexible backbones and actuation redundancy for enhanced miniaturization[C]// Proceedings of IEEE International Conference on Robotics and Automation, Apr. 18-22, 2005, Barcelona, Spain. IEEE, 2005. 3012-3017.
  • 10CHEN G, PHAM M T, REDARCE T, et al. Development and kinematic analysis of a silicone-rubber bending tip for colonoscopy[C]//Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, Oct. 9-15, 2006, Beijing, China. IEEE/RSJ, 2006: 168-173.

共引文献97

同被引文献12

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部