期刊文献+

Variance minimization for continuous-time Markov decision processes: two approaches 被引量:1

Variance minimization for continuous-time Markov decision processes: two approaches
下载PDF
导出
摘要 This paper studies the limit average variance criterion for continuous-time Markov decision processes in Polish spaces. Based on two approaches, this paper proves not only the existence of solutions to the variance minimization optimality equation and the existence of a variance minimal policy that is canonical, but also the existence of solutions to the two variance minimization optimality inequalities and the existence of a variance minimal policy which may not be canonical. An example is given to illustrate all of our conditions. This paper studies the limit average variance criterion for continuous-time Markov decision processes in Polish spaces. Based on two approaches, this paper proves not only the existence of solutions to the variance minimization optimality equation and the existence of a variance minimal policy that is canonical, but also the existence of solutions to the two variance minimization optimality inequalities and the existence of a variance minimal policy which may not be canonical. An example is given to illustrate all of our conditions.
作者 ZHU Quan-xin
出处 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2010年第4期400-410,共11页 高校应用数学学报(英文版)(B辑)
基金 supported by the National Natural Science Foundation of China(10801056) the Natural Science Foundation of Ningbo(2010A610094)
关键词 Continuous-time Markov decision process Polish space variance minimization optimality equation optimality inequality. Continuous-time Markov decision process, Polish space, variance minimization, optimality equation, optimality inequality.
  • 相关文献

参考文献17

  • 1R N Bhattacharya.On the functional central limit theorem and the law of the iterated logarithm for Markov processes,Z Wahrscheinlichkeit,1982,60:185-201.
  • 2E A Feinberg.Continuous-time jump Markov decision processes:A discrete-event approach,Math Oper Res,2004,29:492-524.
  • 3X P Guo.Continuous-time Markov decision processes with discounted rewards:The case of Polish spaces,Math Oper Res,2007,32:73-87.
  • 4X P Guo,U Rieder.Average optimality for continuous-time Markov decision processes in Polish spaces,Ann Appl Probab,2006,16:730-756.
  • 5OHernández-Lerma,J B Lasserre.Further Topics on Discrete-Time Markov Control Processes,Springer,1999.
  • 6M Kurano.Markov decision processes with a minimum-variance criterion,J Math Anal Appl,1987,123:572-583.
  • 7R L Miller.Finite state continuous-time Markov decision processes with an infinite planning horizon,J Math Anal Appl,1968,22:522-569.
  • 8T Prieto-Rumeau,O Hernández-Lerma.Bias optimality for continuous-time controlled Markov chains,SIAM J Control Optim,2006,45:51-73.
  • 9T Prieto-Rumeau,O Hernández-Lerma.Variance minimization and the overtaking optimality approach to continuous-time controlled Markov chains,Math Methods Oper Res,2009,70:527-540.
  • 10M L Puterman.Markov Decision Process,Wiley,1994.

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部