Multiple Action Sequence Learning and Automatic Generation for a Humanoid Robot Using RNNPB and Reinforcement Learning

Multiple Action Sequence Learning and Automatic Generation for a Humanoid Robot Using RNNPB and Reinforcement Learning

下载PDF

导出

摘要 This paper proposes how to learn and generate multiple action sequences of a humanoid robot. At first, all the basic action sequences, also called primitive behaviors, are learned by a recurrent neural network with parametric bias (RNNPB) and the value of the internal nodes which are parametric bias (PB) determining the output with different primitive behaviors are obtained. The training of the RNN uses back propagation through time (BPTT) method. After that, to generate the learned behaviors, or a more complex behavior which is the combination of the primitive behaviors, a reinforcement learning algorithm: Q-learning (QL) is adopt to determine which PB value is adaptive for the generation. Finally, using a real humanoid robot, the proposed method was confirmed its effectiveness by the results of experiment. This paper proposes how to learn and generate multiple action sequences of a humanoid robot. At first, all the basic action sequences, also called primitive behaviors, are learned by a recurrent neural network with parametric bias (RNNPB) and the value of the internal nodes which are parametric bias (PB) determining the output with different primitive behaviors are obtained. The training of the RNN uses back propagation through time (BPTT) method. After that, to generate the learned behaviors, or a more complex behavior which is the combination of the primitive behaviors, a reinforcement learning algorithm: Q-learning (QL) is adopt to determine which PB value is adaptive for the generation. Finally, using a real humanoid robot, the proposed method was confirmed its effectiveness by the results of experiment.

作者 Takashi Kuremoto Koichi Hashiguchi Keita Morisaki Shun Watanabe Kunikazu Kobayashi Shingo Mabu Masanao Obayashi

机构地区 Graduate School of Science and Engineering School of Information Science and Technology

出处《Journal of Software Engineering and Applications》 2012年第12期128-133,共6页 软件工程与应用（英文）

关键词 RNNPB HUMANOID robot BPTT REINFORCEMENT LEARNING MULTIPLE action SEQUENCES RNNPB Humanoid robot BPTT reinforcement learning multiple action sequences

分类号 R73 [医药卫生—肿瘤]

引文网络
相关文献

1Misao Miyagawa,Yuko Yasuhara,Tetsuya Tanioka,Hirokazu Ito,Motoyuki Suzuki,Rozzano Locsin.Development of Algorithm and System for Automatic Generation of Nursing Summaries from Nursing Care Plans[J].Intelligent Information Management,2014,6(3):97-103.
2Gideon Okpoti Tetteh,Maurice Schonert.Automatic Generation of Water Masks from RapidEye Images[J].Journal of Geoscience and Environment Protection,2015,3(10):17-23.
3Agnieszka Zielińska.Framework for Extensible Application Testing[J].Journal of Software Engineering and Applications,2012,5(5):351-363.
4Trinh Cong Duy,Nguyen Thanh Binh,Ioannis Parissis.Automatic Generation of Test Cases in Regression Testing for Lustre/SCADE Programs[J].Journal of Software Engineering and Applications,2013,6(10):27-35.
5Aurobindo Behera,Tapas Kumar Panigrahi,PrakashK·Ray,Arun Kumar Sahoo.A Novel Cascaded PID Controller for Automatic Generation Control Analysis With Renewable Sources[J].IEEE/CAA Journal of Automatica Sinica,2019,6(6):1438-1451. 被引量：5
6Chaudhry Muhammad Nadeem Faisal,Muhammad Shakeel Faridi,Zahid Javed,Muhammad Shahid.Users' Adoptive Behavior Towards the ERP System[J].Intelligent Information Management,2012,4(3):75-79.
7Xi Wang,Huaikou Miao,Liang Guo.Towards Automatic Transformation from UML Model to FSM Model for Web Applications[J].Journal of Software Engineering and Applications,2008,1(1):68-75.
8Anwar Al-Osaimi,Thamer Salim Ali,Waleed Al-Zubari,Humood Naser.Effect of Brine Discharge From Al-Dur RO Desalination Plant on the Infauna Species Composition in the East Coast of Bahrain[J].Management Studies,2019,7(6):609-623.
9Giuliana Lauro.Simulation Models and GIS Technology in Environmental Planning and Landscape Management[J].Journal of Geographic Information System,2013,5(3):292-302. 被引量：2

Journal of Software Engineering and Applications

2012年第12期

浏览历史

内容加载中请稍等...

Multiple Action Sequence Learning and Automatic Generation for a Humanoid Robot Using RNNPB and Reinforcement Learning

相关作者

相关机构

相关主题

浏览历史