期刊文献+

一种自适应强化学习算法在状态空间构建中的应用 被引量:3

Application of Adaptive Reinforcement Learning for State Space Construction
下载PDF
导出
摘要 针对模型未知以及具有连续状态的系统控制问题,提出一种基于强化学习的自适应控制策略。在Actor-Critic框架下,建立归一化径向基网络的自适应调节机制,实现未知系统状态空间的动态创建。有效克服了状态空间分割所带来的维度灾难,而且能够使得系统的结构总保持在最佳状态。通过对倒立摆控制的仿真研究验证了方法的有效性。 In order to solve the control problem for unknown model system with continuous state, an adaptive control strategy based on reinforcement learning was proposed. Under the Actor-Critic architecture, the adaptive adjustment mechanism for normalized radial basis function network was established to realize the state space construction dynamically. This approach could overcome the curse of dimensionality caused by state space division effectively and make the system structure always stay the optimal status. Simulation research for inverted pendulum control demonstrates the validity of the proposed method.
出处 《系统仿真学报》 EI CAS CSCD 北大核心 2006年第1期188-191,共4页 Journal of System Simulation
基金 中国矿业大学青年科研基金(OC4466) 校优秀创新团队"复杂系统与控制"资助
关键词 归一化径向基网络 Actor-Critic学习 状态空间构建 倒立摆 normalized RBF network Actor-Critic learning state space construction inverted pendulum
  • 相关文献

参考文献6

  • 1Barto A G,Sutton R S,Anderson C W.Neurolike adaptive elements that can solve difficult learning control problems[J].IEEE Transactions on System,Man and Cybernetics,1983,13(5):834-846.
  • 2Lin C T,Lee C G.Reinforcement structure/parameter learning for neural-network-based fuzzy logic control systems[J].IEEE Transactions on Fuzzy Systems,1994,2(1):41-63.
  • 3Samejima K,Omori T.Adaptive internal state space construction method for reinforcement learning of a real-world agent[J].Neural Network,1999,12(7-8):1143-1155.
  • 4Moody J,Darken C.Fast learning in networks of locally-tuned processing units[J].Neural Computation,1989,(1):281-294.
  • 5Kaebling L P,Littman M L,Moore A W.Reinforcement learning:a survey[J].Journal of Artificial Intelligence Research,1996,4:237-285.
  • 6Platt J C.A resource-allocating network for function interpolation[J].Neural Computation,1991,(1):213-225.

同被引文献33

引证文献3

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部