A new adaptive state space construction method for the mobile robot navigation 被引量：1

A new adaptive state space construction method for the mobile robot navigation

下载PDF

导出

摘要 In order to solve the combinative explosion problems in a continuous and high dimensional statespace,a function approximation approach is usually used to represent the state space.The normalized ra-dial basis function(NRBF)was adopted as the local function approximator and a kind of adaptive statespace construction strategy based on the NRBF(ASC-NRBF)was proposed,which enables the system toallocate appropriate number and size of the basis functions automatically.Combined with the reinforce-ment learning method,the proposed ASC-NRBF method was applied to the robot navigation problem.Simulation results illustrate the performance of the proposed method. In order to solve the combinative explosion problems in a continuous and high dimensional state space, a function approximation approach is usually used to represent the state space. The normalized radial basis function （NRBF） was adopted as the local function approximator and a kind of adaptive state space construction strategy based on the NRBF （ASC-NRBF） was proposed, which enables the system to allocate appropriate number and size of the basis functions automatically. Combined with the reinforce- ment learning method, the proposed ASC-NRBF method was applied to the robot navigation problem. Simulation results illustrate the performance of the proposed method.

作者黄炳强 Cao Guangyi Fei Yanqiong Li Jianhua

机构地区 Department of Automation Institute of Robotics Research Department of Computer Science

出处《High Technology Letters》 EI CAS 2008年第2期182-186,共5页 高技术通讯（英文版）

基金 the National Natural Science Foundation of China(No50305021)

关键词 reinforcement learning normalized radial basis function （NRBF） function approximation robot navigation 正规化辐射基本功能自适应空间结构移动机器人导航技术

分类号 TP242 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献10

1Velagic J,Lacevic B,Penmicic B.A 3-level autonomous mobile robot navigation system designed by using reasoning/search approaches[].Robotics and Autonomous Systems.2006
2Awad H A,Al-zorkany M A.Mobile robot navigation using local model networks[].International conference on computational intelhgence.2004
3Barto AG,Sutton RS,Anderson CW.Neuronlike adaptive elements that can solve difficult learning control problems[].IEEE Transactions on Systems Man and Cybernetics.1983
4H.Hagras,V.Callaghn,M.Colley.Learning and adaption of an intelligent mobile robot navigator operating in unstructured environment based on a novel online fuzzy-genetic system[].Fuzzy Sets and Systems.2004
5A.E.Gaweda,M.K.Muezzinoglu,G.R.Aronoff.Individualization of pharmacological anemia management using reinforcement learning[].Neural Networks.2005
6A.J.Ijspeert,M.Murate,N.Wakamiya.Biologically Inspired Approaches to Advanced Information Technology[].BioADIT.
7K.Macek,I.Petrovic,N.Peric.A reinforcement learning approach to obstacle avoidance of mobile robots[].Proceedings of the th IEEE International Workshop on Advanced Motion Control.2002
8W Ilg,K Berns.A learning architecture based on reinforcement learning foradaptive controlof the walking machine LAURON[].Robotics and Autonomous Systems.1995
9J.Murata,M.Suzuki,K.Hirasawa.Networks with input gates for situation-dependent input selection in reinforcement learning[].Proceedings of the international joint conference on neural networks.2002
10J.Morimoto,K.Doya.Acquisition of stand up behavior by real robot using reinforcement learning[].Proceedings of international conference on machine learning.2000

同被引文献10

1Murata Y, Hasegawa M. The Architecture and a Business Model for the Open Heterogeneous Mobile Network [ J]. IEEE Communications Magazine, 2009, 47(5): 95-101.
2Liu Yutao, Xu Guisen. A Novel Spectrum Allocation Mechanism Based on Graph Coloring and Bidding Theory [ C]// International Conference on CINC. Wuhan: IEEE Press, 2009: 155-158.
3Niyato D, Hossain E. Dynamic Spectrum Access in IEEE 802. 22-based Cognitive Wireless Networks: a Game Theoretic Model for Competitive Spectrum Bidding and Pricing [ J]. IEEE Wireless Communications, 2009, 16(2): 16-23.
4Versele C, Deblecker O. Multiobjective Optimal Design of High Frequency Transformers Using Genetic Algorithm [ C]//13th European Conference on Power Electronics and Applications. Barcelona: IEEE Press, 2009: 1-10.
5Tsamis D, Alpcan T. Game Theoretic Rate Control for Mobile Devices [ C]//International Conference on Game Theory for Networks. Istanbul: IEEE Press, 2009: 646-652.
6Williams R J. Simple Statistical Gradient-following Algorithms for Connectionist Reinforcement Learning [ J]. Machine Learning, 1992, 8(3): 229-256.
7Bugmann G. Normalized Gaussian Radial Basis Function Networks [J]. Neurocomputing, 1995, 20( 1): 97-110.
8Sutton R S. Temporal Credit Assignment in Reinforcement Learning [ D]. Amherst: University of Massachusetts, 1984.
9Abhayawardhana V S, Wassell I J. Comparison of Empirical Propagation Path Loss Models for Fixed Wireless Access Systems [ C]//Proc of the 61st IEEE VTC. Stockholm: IEEE Press, 2005: 73-77.
10邱晶,周正.认知无线电网络中的分布式动态频谱共享[J].北京邮电大学学报,2009,32(1):69-72. 被引量：18

引证文献1

1张文柱,邵丽娜.异构无线网络中基于强化学习的频谱管理算法[J].西安电子科技大学学报,2011,38(4):32-37. 被引量：1

二级引证文献1

1张英,韦闽峰,王世会,陶磊岩,曹健,张兴.飞行器强化学习多模在轨控制[J].西安电子科技大学学报,2020,47(2):75-82. 被引量：1

1Sulan Zhang,Jifu Zhang,Lihua Hu.A New Concept Lattice and Incremental Construction Method[J].通讯和计算机（中英文版）,2005,2(7):1-3.
2Zhaoxuan Zhu Houjun Wang Zhigang Wang.A Model of Sampling Base on State Space[J].通讯和计算机（中英文版）,2010,7(2):78-83.
3LUO Yuan ZHANG Bai-sheng ZHANG Yi LI Ling.Indoor monocular mobile robot navigation based on color landmarks[J].重庆邮电大学学报（自然科学版）,2009,21(2):162-165.
4Nobuhiko Mukai,Yoshihiro Tatefuku,Kiyomi Niki,Shuichiro Takanashi.Construction Method of 3D Aorta Model[J].Journal of Mathematics and System Science,2012,2(4):272-279.
5Feng Yang Jian-Hao Hu Shao-Qian Li.Novel Sampling and Reconstruction Method for Non-Bandlimited Impulse Signals[J].Journal of Electronic Science and Technology of China,2009,7(3):193-197.
6YUAN Chen,KAN HaiBin,WANG Xin,IMAI Hideki.A construction method of matroidal networks[J].Science China(Information Sciences),2012,55(11):2445-2453. 被引量：1
7Huiqing Pan Shulin Tian Peng Ye.Frequency Reconstruction Method of Non-uniform Data for Multi-channel Sampling Systems[J].通讯和计算机（中英文版）,2010,7(3):8-12.
8机器人、机械手、自动调节、控制与执行机构[J].电子科技文摘,2003,0(8):152-154.
9谢文龙,苏剑波.A New Plan and Coordination Strategy for Robot System Based on State Space[J].Journal of Shanghai Jiaotong university(Science),2009,14(3):299-305. 被引量：2
10Shi-guang LI,Hong-wei ZHANG,Zheng-zhong GAO,Xiang-bin LIN,Fan-xue KONG (Shandong university of science and technology,qingdao 266510,China).Research on mobile robot navigation based on gyro[J].Journal of Measurement Science and Instrumentation,2010,1(S1):66-68.

High Technology Letters

2008年第2期

浏览历史

内容加载中请稍等...

A new adaptive state space construction method for the mobile robot navigation 被引量：1

参考文献10

同被引文献10

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史