Optimal stopping time on discounted semi-Markov processes

导出

摘要 This paper attempts to study the optimal stopping time for semi- Markov processes (SMPs) under the discount optimization criteria with unbounded cost rates. In our work, we introduce an explicit construction of the equivalent semi-Markov decision processes (SMDPs). The equivalence is embodied in the expected discounted cost functions of SMPs and SMDPs, that is, every stopping time of SMPs can induce a policy of SMDPs such that the value functions are equal, and vice versa. The existence of the optimal stopping time of SMPs is proved by this equivalence relation. Next, we give the optimality equation of the value function and develop an effective iterative algorithm for computing it. Moreover, we show that the optimal and ε-optimal stopping time can be characterized by the hitting time of the special sets. Finally, to illustrate the validity of our results, an example of a maintenance system is presented in the end.

作者 Fang CHEN Xianping GUO Zhong-Wei LIAO

机构地区 School of Mathematics College of Education for the Future

出处《Frontiers of Mathematics in China》 SCIE CSCD 2021年第2期303-324,共22页 中国高等学校学术文摘·数学（英文）

基金 This work was supported in part by the National Natural Science Foundation of China(Grant Nos.11931018,61773411,11701588,11961005) the Guangdong Basic and Applied Basic Research Foundation(Grant No.2020B1515310021).

关键词 Optimal stopping time semi-Markov processes(SMPs) value function semi-Markov decision processes(SMDPs) optimal policy iterative lgorithm

分类号 O17 [理学—基础数学]

引文网络
相关文献

1张文萍,陈桂芬,刘可欣.车载云计算系统中的资源管理优化研究[J].长春理工大学学报（自然科学版）,2020,43(6):102-112. 被引量：1
2Anatoliy A. Pogorui,Ramón M. Rodríguez-Dagnino.Stationary Distribution of Random Motion with Delay in Reflecting Boundaries[J].Applied Mathematics,2010,1(1):24-28.
3Jaime Eduardo Martínez-Sánchez.Asymptotic Evaluations of the Stability Index for a Markov Control Process with the Expected Total Discounted Reward Criterion[J].American Journal of Operations Research,2021,11(1):62-85.
4Yifei WANG,Mohammad SHAHIDEHPOUR,Chuangxin GUO.Applications of survival functions to continuous semi-Markov processes for measuring reliability of power transformers[J].Journal of Modern Power Systems and Clean Energy,2017,5(6):959-969. 被引量：2
5Weicheng Xu,Tian Zhou,Di Peng.Endogenous Explanation for Random Fluctuation of Stock Price and Its Application: Based on the View of Repeated Game with Asymmetric Information[J].Journal of Applied Mathematics and Physics,2021,9(4):694-706.
6REN Yuxue,WEN Chengfeng,ZHEN Shengxian,LEI Na,LUO Feng,GU David Xianfeng.Characteristic Class of Isotopy for Surfaces[J].Journal of Systems Science & Complexity,2020,33(6):2139-2156.
7具身认知视角下研学旅行与学校课程的衔接[J].教育探究,2020,15(6):60-60.
8Dingqian SUN.The Convergence Rate from Discrete to Continuous Optimal Investment Stopping Problem[J].Chinese Annals of Mathematics,Series B,2021,42(2):259-280.
9Xiang Chu,Zhong Wen,Jian Chen.Optimal Grading Policies in the Online Acquisition of Used Products[J].Journal of Systems Science and Systems Engineering,2021,30(1):29-43.
10Wei Yan,Jianhua Huang,Boling Guo.The Cauchy problem for the stochastic generalized Benjamin-Ono equation[J].Science China Mathematics,2021,64(2):331-350.

Frontiers of Mathematics in China

2021年第2期

浏览历史

内容加载中请稍等...

Optimal stopping time on discounted semi-Markov processes

相关作者

相关机构

相关主题

浏览历史