期刊文献+

A new adaptive state space construction method for the mobile robot navigation 被引量:1

A new adaptive state space construction method for the mobile robot navigation
下载PDF
导出
摘要 In order to solve the combinative explosion problems in a continuous and high dimensional statespace,a function approximation approach is usually used to represent the state space.The normalized ra-dial basis function(NRBF)was adopted as the local function approximator and a kind of adaptive statespace construction strategy based on the NRBF(ASC-NRBF)was proposed,which enables the system toallocate appropriate number and size of the basis functions automatically.Combined with the reinforce-ment learning method,the proposed ASC-NRBF method was applied to the robot navigation problem.Simulation results illustrate the performance of the proposed method. In order to solve the combinative explosion problems in a continuous and high dimensional state space, a function approximation approach is usually used to represent the state space. The normalized radial basis function (NRBF) was adopted as the local function approximator and a kind of adaptive state space construction strategy based on the NRBF (ASC-NRBF) was proposed, which enables the system to allocate appropriate number and size of the basis functions automatically. Combined with the reinforce- ment learning method, the proposed ASC-NRBF method was applied to the robot navigation problem. Simulation results illustrate the performance of the proposed method.
出处 《High Technology Letters》 EI CAS 2008年第2期182-186,共5页 高技术通讯(英文版)
基金 the National Natural Science Foundation of China(No50305021)
关键词 reinforcement learning normalized radial basis function (NRBF) function approximation robot navigation 正规化辐射基本功能 自适应空间结构 移动机器人 导航技术
  • 相关文献

参考文献10

  • 1Velagic J,Lacevic B,Penmicic B.A 3-level autonomous mobile robot navigation system designed by using reasoning/search approaches[].Robotics and Autonomous Systems.2006
  • 2Awad H A,Al-zorkany M A.Mobile robot navigation using local model networks[].International conference on computational intelhgence.2004
  • 3Barto AG,Sutton RS,Anderson CW.Neuronlike adaptive elements that can solve difficult learning control problems[].IEEE Transactions on Systems Man and Cybernetics.1983
  • 4H.Hagras,V.Callaghn,M.Colley.Learning and adaption of an intelligent mobile robot navigator operating in unstructured environment based on a novel online fuzzy-genetic system[].Fuzzy Sets and Systems.2004
  • 5A.E.Gaweda,M.K.Muezzinoglu,G.R.Aronoff.Individualization of pharmacological anemia management using reinforcement learning[].Neural Networks.2005
  • 6A.J.Ijspeert,M.Murate,N.Wakamiya.Biologically Inspired Approaches to Advanced Information Technology[].BioADIT.
  • 7K.Macek,I.Petrovic,N.Peric.A reinforcement learning approach to obstacle avoidance of mobile robots[].Proceedings of the th IEEE International Workshop on Advanced Motion Control.2002
  • 8W Ilg,K Berns.A learning architecture based on reinforcement learning foradaptive controlof the walking machine LAURON[].Robotics and Autonomous Systems.1995
  • 9J.Murata,M.Suzuki,K.Hirasawa.Networks with input gates for situation-dependent input selection in reinforcement learning[].Proceedings of the international joint conference on neural networks.2002
  • 10J.Morimoto,K.Doya.Acquisition of stand up behavior by real robot using reinforcement learning[].Proceedings of international conference on machine learning.2000

同被引文献10

  • 1Murata Y, Hasegawa M. The Architecture and a Business Model for the Open Heterogeneous Mobile Network [ J]. IEEE Communications Magazine, 2009, 47(5): 95-101.
  • 2Liu Yutao, Xu Guisen. A Novel Spectrum Allocation Mechanism Based on Graph Coloring and Bidding Theory [ C]// International Conference on CINC. Wuhan: IEEE Press, 2009: 155-158.
  • 3Niyato D, Hossain E. Dynamic Spectrum Access in IEEE 802. 22-based Cognitive Wireless Networks: a Game Theoretic Model for Competitive Spectrum Bidding and Pricing [ J]. IEEE Wireless Communications, 2009, 16(2): 16-23.
  • 4Versele C, Deblecker O. Multiobjective Optimal Design of High Frequency Transformers Using Genetic Algorithm [ C]//13th European Conference on Power Electronics and Applications. Barcelona: IEEE Press, 2009: 1-10.
  • 5Tsamis D, Alpcan T. Game Theoretic Rate Control for Mobile Devices [ C]//International Conference on Game Theory for Networks. Istanbul: IEEE Press, 2009: 646-652.
  • 6Williams R J. Simple Statistical Gradient-following Algorithms for Connectionist Reinforcement Learning [ J]. Machine Learning, 1992, 8(3): 229-256.
  • 7Bugmann G. Normalized Gaussian Radial Basis Function Networks [J]. Neurocomputing, 1995, 20( 1): 97-110.
  • 8Sutton R S. Temporal Credit Assignment in Reinforcement Learning [ D]. Amherst: University of Massachusetts, 1984.
  • 9Abhayawardhana V S, Wassell I J. Comparison of Empirical Propagation Path Loss Models for Fixed Wireless Access Systems [ C]//Proc of the 61st IEEE VTC. Stockholm: IEEE Press, 2005: 73-77.
  • 10邱晶,周正.认知无线电网络中的分布式动态频谱共享[J].北京邮电大学学报,2009,32(1):69-72. 被引量:18

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部