摘要
六足机器人拥有多个冗余自由度,适用于复杂的非结构环境。离散环境作为非结构环境的一个苛刻特例,需要六足机器人具备更加高效可靠的运动策略。以平面随机梅花桩为例,设定随机起始点与目标区域,利用深度强化学习算法进行训练,并得到六足机器人在平面梅花桩环境中的运动策略。为了加快训练进程,采用具有优先经验重放机制的深度确定性策略梯度算法。最后在真实环境中进行验证,实验结果表明,所规划的运动策略能让六足机器人在平面梅花桩环境中高效平稳地从起始点运动到目标区域。为六足机器人在真实离散环境中的精确运动规划奠定了基础。
Hexapod robot have multiple redundant degrees of freedom and are suitable for complex unstructured environments.Discrete environments,as a harsh special case of unstructured environments,require hexapod robots to have more efficient and reliable motion strategies.A plane random plum-blossom pile environment was taken as an example.A random starting point and a target area were set,and the deep reinforcement learning algorithm was applied to plan a motion strategy for a hexapod robot in theplane plum-blossompile environment.To speed up the training process,a deep deterministic policy gradient algorithm with a prioritized experience replay mechanism was used.Finally the policy was verified in a real environment.The results show that the planned motion strategy can make the hexapod robot move efficiently and smoothly from a starting point to a target area in aplane plum-blossom pile environment.This work lays the foundation for the precise motion planning of hexapod robots in the real discrete environment.
作者
傅汇乔
唐开强
邓归洲
王鑫鹏
陈春林
FU Huiqiao;TANG Kaiqiang;DENG Guizhou;WANG Xinpeng;CHEN Chunlin(School of Management and Engineering,Nanjing University,Nanjing 210046,China;School of Manufacturing Science and Engineering,Southwest University of Science and Technology,Mianyang 621010,China)
出处
《智能科学与技术学报》
2020年第4期361-371,共11页
Chinese Journal of Intelligent Science and Technology
基金
国家自然科学基金资助项目(No.71732003,No.62073160)
国家重点研发计划资助项目(No.2018AAA0101100)
载人航天第四批预研项目(No.030602)
关键词
六足机器人
运动规划
深度强化学习
hexapod robot
motion planning
deep reinforcement learning