摘要
针对双足机器人在非平整地面行走时容易失去运动稳定性的问题,提出一种基于一种基于价值的深度强化学习算法DQN(Deep Q-Network)的步态控制方法。首先通过机器人步态规划得到针对平整地面环境的离线步态,然后将双足机器人视为一个智能体,建立机器人环境空间、状态空间、动作空间及奖惩机制,该过程与传统控制方法相比无需复杂的动力学建模过程,最后经过多回合训练使双足机器人学会在不平整地面进行姿态调整,保证行走稳定性。在V-Rep仿真环境中进行了算法验证,双足机器人在非平整地面行走过程中,通过DQN步态调整学习算法,姿态角度波动范围在3°以内,结果表明双足机器人行走稳定性得到明显改善,实现了机器人的姿态调整行为学习,证明了该方法的有效性。
Aiming at the problem that biped robots may easily lose their motion stability when walking on uneven ground,a value-based deep reinforcement learning algorithm called Deep Q-Network(DQN)gait control method was proposed,which is an intelligent learning method of posture adjustment.Firstly,an off-line gait for a flat ground environment was obtained through the gait planning of the robot.Secondly,instead of implementing a complex dynamic model compared to traditional control methods,a bipedal robot was regarded as an agent to establish robot environment space,state space,action space and Reward-Punishment(RP)mechanism.Finally,through multiple rounds of training,the biped robot learned to adjust its posture on the uneven ground and ensures the stability of walking.The performance and effectiveness of the proposed algorithm was validated in a V-Rep simulation environment.The results demonstrate that the biped robot s lateral tile angle is less than 3°after implementing the proposed method and the walking stability is improved obviously,which achieves the robot s posture adjustment behavior learning and proves the effectiveness of the method.
作者
赵玉婷
韩宝玲
罗庆生
ZHAO Yuting;HAN Baoling;LUO Qingsheng(School of Mechanical Engineering,Beijing Institute of Technology,Beijing 100081,China;School of Mechatronical Engineering,Beijing Institute of Technology,Beijing 100081,China)
出处
《计算机应用》
CSCD
北大核心
2018年第9期2459-2463,共5页
journal of Computer Applications
基金
国家部委重点预研基金资助项目(3020020221111)~~