期刊文献+

一种有限时段Markov决策过程的强化学习算法 被引量:4

An algorithm of reinforcement learning for finite-horizon Markov decision processes
下载PDF
导出
摘要 研究有限时段非平稳的 Markov决策过程的强化学习算法。通过引入一个人工吸收状态 ,把有限时段问题变为无限时段问题 ,从而可利用通常的强化学习方法来求解。在文献 [3]提出的算法思想基础上 ,提出了一种新的有限时段非平稳的 Markov决策过程的强化学习算法 。 In this paper, reinforcement learning algorithms for finite horizon non stationary Markov decision processes are studied By introducing an artificialabsorbingstate, the finite horizon problem transforms to an infinite horizon one, so that a normal method of reinforcement learning algorithm can be used to solve the finite-horizon problem A new reinforcement learning algorithm of the finite horizon non stationary Markov decision process is put forward based on the algorithm thought presented by reference book\ And an experiment in inventory control problem with non complete model has been done
出处 《广西工学院学报》 CAS 2003年第1期1-4,共4页 Journal of Guangxi University of Technology
关键词 强化学习 有限时段 MARKOV决策过程 无完全模型 库存控制 机器学习 非平稳 reinforcementlearning Markov decision process non stationary inventory control
  • 相关文献

参考文献3

  • 1李春贵,林海涛,刘永信.多步截断SARSA强化学习算法[J].广西工学院学报,2002,13(1):1-4. 被引量:5
  • 2Sutton R S,Barto A G.An Introduction to reinforcement learning[M].MIT Press,Cambridge,MA,1998.
  • 3Puterman M L.Markov Decision Problems[M].Wiley,NY.1994.

二级参考文献6

  • 1[1]Watkins C.J. C. H. Learning from delayed rewards [D] . Cambridge Univ. , England. 1989.
  • 2[2]Sutton R.S.Learning to predict by the method of temporal difference [J] .Machine Learning , 1988, (3): 9-44.
  • 3[3]Peng J.& Williams R.Incremental multi-step Q-learning [J] .Machine Learning, 1996, (22): 283-290.
  • 4[4]Rummery G.A & Niranjan M.On-line Q-learning using connectionist systems [R] .CUED/F-INFENG/TR 166,Cambridge University, UK.1994.
  • 5[5]Bertsekas D.P.Dynamic programming: deterministic and stochastic models [M] .Prentice Hall, USA.1987.
  • 6[6]Sutton R.S.& Barto A.G.An introduction to reinforcement learning [M] .The MIT Press, USA.1998.

共引文献4

同被引文献37

引证文献4

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部