Variance minimization for continuous-time Markov decision processes: two approaches 被引量：1

Variance minimization for continuous-time Markov decision processes: two approaches

下载PDF

导出

摘要 This paper studies the limit average variance criterion for continuous-time Markov decision processes in Polish spaces. Based on two approaches, this paper proves not only the existence of solutions to the variance minimization optimality equation and the existence of a variance minimal policy that is canonical, but also the existence of solutions to the two variance minimization optimality inequalities and the existence of a variance minimal policy which may not be canonical. An example is given to illustrate all of our conditions. This paper studies the limit average variance criterion for continuous-time Markov decision processes in Polish spaces. Based on two approaches, this paper proves not only the existence of solutions to the variance minimization optimality equation and the existence of a variance minimal policy that is canonical, but also the existence of solutions to the two variance minimization optimality inequalities and the existence of a variance minimal policy which may not be canonical. An example is given to illustrate all of our conditions.

作者 ZHU Quan-xin

机构地区 Department of Mathematics

出处《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2010年第4期400-410,共11页 高校应用数学学报（英文版）（B辑）

基金 supported by the National Natural Science Foundation of China(10801056) the Natural Science Foundation of Ningbo(2010A610094)

关键词 Continuous-time Markov decision process Polish space variance minimization optimality equation optimality inequality. Continuous-time Markov decision process, Polish space, variance minimization, optimality equation, optimality inequality.

分类号 O211.62 [理学—概率论与数理统计] TP273.2 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献17

1R N Bhattacharya.On the functional central limit theorem and the law of the iterated logarithm for Markov processes,Z Wahrscheinlichkeit,1982,60:185-201.
2E A Feinberg.Continuous-time jump Markov decision processes:A discrete-event approach,Math Oper Res,2004,29:492-524.
3X P Guo.Continuous-time Markov decision processes with discounted rewards:The case of Polish spaces,Math Oper Res,2007,32:73-87.
4X P Guo,U Rieder.Average optimality for continuous-time Markov decision processes in Polish spaces,Ann Appl Probab,2006,16:730-756.
5OHernández-Lerma,J B Lasserre.Further Topics on Discrete-Time Markov Control Processes,Springer,1999.
6M Kurano.Markov decision processes with a minimum-variance criterion,J Math Anal Appl,1987,123:572-583.
7R L Miller.Finite state continuous-time Markov decision processes with an infinite planning horizon,J Math Anal Appl,1968,22:522-569.
8T Prieto-Rumeau,O Hernández-Lerma.Bias optimality for continuous-time controlled Markov chains,SIAM J Control Optim,2006,45:51-73.
9T Prieto-Rumeau,O Hernández-Lerma.Variance minimization and the overtaking optimality approach to continuous-time controlled Markov chains,Math Methods Oper Res,2009,70:527-540.
10M L Puterman.Markov Decision Process,Wiley,1994.

引证文献1

1Quan-xin ZHU.Average Sample-path Optimality for Continuous-time Markov Decision Processes in Polish Spaces[J].Acta Mathematicae Applicatae Sinica,2011,27(4):613-624.

1袁琴,俞芳婷,王淼坤.第二类完全椭圆积分的平均值不等式[J].湖州师范学院学报,2017,39(2):12-16. 被引量：3
2胡奇英,刘建庸.马氏决策过程平均准则最优不等式综述[J].运筹学杂志,1996,15(2):1-9.
3ZHU Quanxin,GUO Xianping.STRONG N-DISCOUNT AND FINITE-HORIZON OPTIMALITY FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES[J].Journal of Systems Science & Complexity,2014,27(5):1045-1063.
4陶永 Wang Tianmiao Wei Hongxing Chen Diansheng.A navigation method based on POMDP for smart wheelchair in uncertain environments[J].High Technology Letters,2010,16(2):164-170.
5李枚毅,蔡自兴,石跃祥,孙国荣,蒙祖强.进化计算的一种变异概率自适应方法[J].计算机科学,2002,29(z1):144-145.
6Xianping GUO,Lanlan ZHANG.TOTAL REWARD CRITERIA FOR UNCONSTRAINED/CONSTRAINED CONTINUOUS-TIME MARKOV DECISION PROCESSES[J].Journal of Systems Science & Complexity,2011,24(3):491-505.
7吴军,徐昕,王健,贺汉根.面向多机器人系统的增强学习研究进展综述[J].控制与决策,2011,26(11):1601-1610. 被引量：22
8Yong-hui Huang Xian-ping Guo.First Passage Models for Denumerable Semi-Markov Decision Processes with Nonnegative Discounted Costs[J].Acta Mathematicae Applicatae Sinica,2011,27(2):177-190. 被引量：2
9刘培德.UMD空间及其应用[J].应用泛函分析学报,2002,4(3):280-288. 被引量：1
10Quan-xin ZHU.Average Sample-path Optimality for Continuous-time Markov Decision Processes in Polish Spaces[J].Acta Mathematicae Applicatae Sinica,2011,27(4):613-624.

Applied Mathematics(A Journal of Chinese Universities)

2010年第4期

浏览历史

内容加载中请稍等...

Variance minimization for continuous-time Markov decision processes: two approaches 被引量：1

参考文献17

引证文献1

相关作者

相关机构

相关主题

浏览历史