期刊文献+

马氏决策向量过程模型初步研究 被引量:4

Preliminary Study on Markov Decision-making Vector Processes
下载PDF
导出
摘要 在传统马氏单元决策过程(MDP)模型中引入多元行动来确定系统的状态转移概率,通过运用传统MDP的基本理论以及结合多元行动集、决策向量、相合度等新定义,提出了马氏向量决策过程模型. This paper studies the multivariate actions to define the state-transition probability during the traditional model of MDP.By applying the Markov decision processes theory and the new definition of multivariate actions set,decision-making vector,consistent degree ETC,the new model of Markov decision-making vector processes is introduced.
出处 《河南师范大学学报(自然科学版)》 CAS CSCD 北大核心 2010年第5期38-40,共3页 Journal of Henan Normal University(Natural Science Edition)
基金 国家自然科学基金(10801056)
关键词 多元行动 决策向量 相合度 马氏决策向量过程 multivariate actions decision-making vector consistent degree Markov decision-making vector processes
  • 相关文献

参考文献4

  • 1Shapley L.Stochastic games[J].Proc Nat Acad Sci,1953(3):1095-1100.
  • 2Howard R.Dynamic programming and Markov decision processes[M].Cambrige:MIT Press,1960:66-103.
  • 3胡奇英.一般化马氏决策规划的现状与展望[J].运筹学杂志,1992,11(2):21-29. 被引量:7
  • 4胡奇英,刘建庸.马氏决策过程引论[M].西安:西安电子科技大学,2000:1-2.

共引文献6

同被引文献12

  • 1Ballman R and Salle J P La. On non -zero sum games and stochastic process [ M ]. RM -212, RAND Corp, Santa Monica, CA, 1949:26 - 46.
  • 2Howard R. Dynamic programming and Markov decision processes [ M ]. Cambrige, MS: MIT Press, 1960:66 - 103.
  • 3Ballman R and SaUe J P La. On non - zero sum games and stochastic process [ M J. RM - 212, RAND Corp, Santa Monica, CA, 1949:26 - 46.
  • 4Howard R. Dynamic programming and Markov decision processes [ M ]. Cambrige, MS:MIT Press, 1960:66 - 103.
  • 5Raftery A E.A model for high- order Markov chains[J]. Journal of the Royal Statistical Society.SeriesB (Methodological), 1985,47(3) : 528- 539.
  • 6Raftery A, Tavare S. Estimation and modelling repeated patterns in high order Markov chains with the mix- ture transition distribution model[J]. Applied Statistics, 1994,43(1)~ 179-199.
  • 7Berchtold A, Raftery A E. The mix- ture transition distribution model for high-order Markov chains and non- Gaussian time series[J]. Statistical Science, 2002,17(3) : 328-356.
  • 8Ching W K, Fung E S,Ng M K. A multivariate Markov chain model for categorical data sequences and its ap- plications in demand predictions[J]. IMA Journal of Management Mathematics, 2002,13(3):187-199.
  • 9Ching W K, Ng M K, Fung E S. Higher-order multivariate Markov chains and their applications[J].Linear Algebra and its Applications,2008,428(2):492-507.
  • 10陈杰,刘再明,邢灵博.基于马氏决策向量过程模型的有限阶段期望总报酬准则及其最优方程[J].数学理论与应用,2011,31(4):7-13. 被引量:2

引证文献4

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部