期刊文献+

基于MRD-DDPG的机械臂避障路径规划方法

Obstacle Avoidance Path Planning Method of Robotic Arm Based on MRD-DDPG
下载PDF
导出
摘要 提出将MRD-DDPG算法应用在机械臂避障路径规划上,解决了DDPG算法在训练过程中学习效率低、样本利用率低的问题。首先,在DDPG算法的基础上,通过改进经验池机制,提出多经验池延迟采样的深度确定性策略梯度(multi-replay buffer delay sampling-deep deterministic policy gradient,MRD-DDPG)算法,有效的缓解了样本利用率低的问题;其次,针对机械臂交互探索过程中奖励稀疏问题,设计了一种适用于避障路径规划的位置奖励函数,有效的提高了智能体的学习效率。实验结果表明,机械臂避障路径规划的平均成功率达97%左右;MRD-DDPG算法相比于DDPG算法的平均成功率提升了88%;机械臂的平均规划时间为0.638 s。 In this study,MRD-DDPG algorithm is applied to obstacle avoidance path planning of manipulator,which solves the problem of low learning efficiency and low sample utilization of DDPG algorithm in training process.Firstly,on the basis of DDPG algorithm,by improving the experience pool mechanism,a multi-replay buffer delay sampling-deep deterministic policy gradient is proposed,which effectively alleviates the problem of low sample utilization efficiency.Secondly,a position reward function suitable for obstacle avoidance path planning is designed to solve the problem of reward sparseness in the interactive exploration process of manipulators,which effectively improve the learning efficiency of the agent.The experimental results show that the average success rate of obstacle avoidance path planning is about 97%.The average success rate of MRD-DDPG algorithm is 88%higher than that of DDPG algorithm.The average planning time of the manipulator is 0.638 s.
作者 付子强 郑威强 张立萍 何丽 袁亮 邵明明 FU Ziqiang;ZHENG Weiqiang;ZHANG Liping;HE Li;YUAN Liang;SHAO Mingming(School of Mechanical Engineering,Xinjiang University,Urumqi 830047,China;School of Information Science and Technology,Beijing University of Chemical Technology,Beijing 100029,China)
出处 《组合机床与自动化加工技术》 北大核心 2023年第7期41-45,共5页 Modular Machine Tool & Automatic Manufacturing Technique
基金 国家自然科学基金项目(62063033) 新疆维吾尔自治区科技支疆项目计划(2021E02049)。
关键词 深度强化学习 DDPG 奖励函数 机械臂 路径规划 deep reinforcement learning DDPG reward function robotic arm path planning
  • 相关文献

参考文献9

二级参考文献52

共引文献96

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部