THE BOREL STATE SPACE SEMI-MARKOVDECISION PROCESS WITH EXPECTED TOTAL REWARDS IN A SEMI-MARKOV ENVIRONMENT

THE BOREL STATE SPACE SEMI-MARKOV DECISION PROCESS WITH EXPECTED TOTAL REWARDS IN A SEMI-MARKOV ENVIRONMENT

导出

摘要 This paper investigates the Borel state space semi-Markov decision process (SMDP) with the criterion of expected total rewards in a semi-Markov environment. It describes a system which behaves like a SMDP except that the system is influenced by its environment modeled by a semi-Markov process. We transform the SMDP in a semiMarkov environment into an equivalent discrete time Markov decision process under the condition that rewards are all positive or all negative, and obtain the optimality equation and some properties for it. This paper investigates the Borel state space semi-Markov decision process (SMDP) with the criterion of expected total rewards in a semi-Markov environment. It describes a system which behaves like a SMDP except that the system is influenced by its environment modeled by a semi-Markov process. We transform the SMDP in a semiMarkov environment into an equivalent discrete time Markov decision process under the condition that rewards are all positive or all negative, and obtain the optimality equation and some properties for it.

作者 XU Chen(School of Science, Shenzhen University, Shenzhen 518060, China)HU Qiying (School of Economy and Management, Xidian University, Xi’an 710071, China)

出处《Systems Science and Mathematical Sciences》 SCIE EI CSCD 1999年第1期82-91,共10页

关键词 Semi-Markov DECISION PROCESSES semi-Markov ENVIRONMENT EXPECTED TOTAL rewards BOREL state space. Semi-Markov decision processes, semi-Markov environment, expected total rewards, Borel state space.

分类号 O211 [理学—概率论与数理统计]

引文网络
相关文献

1Xiao Yun MO,Xiang Qun YANG.Criterion of Semi-Markov Dependent Risk Model[J].Acta Mathematica Sinica,English Series,2014,30(7):1273-1280.
2杨虎,薛凯.RUIN PROBABILITY IN A SEMI-MARKOV RISK MODEL WITH CONSTANT INTEREST FORCE AND HEAVY-TAILED CLAIMS[J].Acta Mathematica Scientia,2013,33(4):998-1006. 被引量：2
3Hou Zhenting,Liu Zaiming,Zou Jiezhong.Markov skeleton processes[J].Chinese Science Bulletin,1998,43(11):881-889. 被引量：9
4刘建庸,黄思明,胡光华.On discounted Markov decision programming with multi-vector constraints[J].Chinese Science Bulletin,1996,41(3):202-207.
5Yong-hui Huang Xian-ping Guo.First Passage Models for Denumerable Semi-Markov Decision Processes with Nonnegative Discounted Costs[J].Acta Mathematicae Applicatae Sinica,2011,27(2):177-190. 被引量：2
6Qiming HE.ANALYSIS OF A CONTINUOUS TIME SM[K]/PH[K]/1/FCFS QUEUE:AGE PROCESS,SOJOURN TIMES,AND QUEUE LENGTHS[J].Journal of Systems Science & Complexity,2012,25(1):133-155.
7HUANG XiangXiang,ZOU XiaoLong,GUO XianPing.A minimization problem of the risk probability in first passage semi-Markov decision processes with loss rates[J].Science China Mathematics,2015,58(9):1923-1938. 被引量：3
8Manoj Kumar,A.K.Verma,A.Srividya.Analyzing Effect of Demand Rate on Safety of Systems with Periodic Proof-tests[J].International Journal of Automation and computing,2007,4(4):335-341. 被引量：1

Systems Science and Mathematical Sciences

1999年第1期

浏览历史

内容加载中请稍等...

THE BOREL STATE SPACE SEMI-MARKOVDECISION PROCESS WITH EXPECTED TOTAL REWARDS IN A SEMI-MARKOV ENVIRONMENT

相关作者

相关机构

相关主题

浏览历史