期刊文献+

基于再励学习的被动动态步行机器人 被引量:6

Reinforcement learning for passive dynamic walking robot
原文传递
导出
摘要 为了研究仿人、能量高效的双足机器人步行,研制了由MACCEPA(mechanically adjustable compliance and controllable equilibrium position actuator)柔性驱动器驱动的半被动双足机器人,并实现了其动力学仿真系统。提出一种基于再励学习的步行控制方法。该方法首先采用Q-学习方法学习机器人在理想环境中的稳定步行步态及其控制策略,然后将此步态和控制策略作为模糊优胜学习方法的参考步态和参考控制策略并在线学习模糊网络的优胜值参数。仿真结果表明:利用学习训练的结果控制柔性驱动器在步行相转换时的动作,机器人可以实现稳定动态步行。 A quasi-passive dynamic walking robot was built to study natural, energy-efficient biped walking. The robot was actuated by mechanically adjustable compliance and controllable equilibrium position actuators (MACCEPA). A reinforcement learning based method was used to control the robot to walk. The method firstly learned the desired gait for walking in ideal environment with a gait model based Q-learning algorithm. Then, a fuzzy advantage learning method was used to teach the robot to walk in uneven floor. Stable walking of the robot is achieved by using the learning result to control the action of the actuators when changes occur in the walking phase. The effectiveness of the method was verified by simulations.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2008年第1期92-96,共5页 Journal of Tsinghua University(Science and Technology)
关键词 机器人 双足机器人 被动动态步行 再励学习 robots biped robots passive dynamic walking reinforcement learning
  • 相关文献

参考文献7

  • 1Collins S, Ruina A, Tedrake R, et al. Efficient bipedal robots based on passive-dynamic walkers [J]. Science, 2005, 307: 1082- 1085.
  • 2MeGeer T. Passive dynamic walking[J]. The International Journal of Robotics Research, 1990, 9(2): 62 - 82.
  • 3Schuitema E, Hobbelen D G E, Jonker P P, et al. Using a controller based on reinforcement learning for a passive dynamic walking robot [C]//Proeeedings of IEEE International Conference on Humanoid Robots. Tsukuba, Japan: IEEE, 2005: 232-237.
  • 4Ham R V, Vanderborght B, Verrelst B, et al. MACCEPA: The mechanically adjustable compliance and controllable equilibrium position actuator used in the "controlled passive walking" biped veronica[C]//Proceedings of the 8th International Conference on Climbing and Walking Robots. London, UK: Springer, 2005: 759- 766.
  • 5Wisse M, Schwab A L. First steps in passive dynamic walking [C]//Proceedings of the 7th International Conference on Climbing and Walking Robots. Madrid, Spain: Springer, 2004.
  • 6Sutton R S, Barto A G. Reinforcement Learning: an Introduction [M].Cambridge, MA: The MIT Press, 1998.
  • 7Yan X W, Deng Z D, Sun Z Q. Fuzzy advantage learning [C]//Proeeedings of IEEE International Conference on Fuzzy Systems. Texas, US: IEEE, 2000: 865- 870.

同被引文献61

引证文献6

二级引证文献78

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部