期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
CONSTRAINED DENUMERABLE STATE NON-STATIONARY MDPs WITH EXPECTED TOTAL REWARD CRITERION
1
作者 郭先平 《Acta Mathematicae Applicatae Sinica》 SCIE CSCD 2000年第2期205-212,共8页
In this paper, we consider constrained denumerable state non-stationary Markov decision processes (MDPs, for short) with expected total reward criterion. By the mechanics of intro- ducing Lagrange multiplier and using... In this paper, we consider constrained denumerable state non-stationary Markov decision processes (MDPs, for short) with expected total reward criterion. By the mechanics of intro- ducing Lagrange multiplier and using the methods of probability and analytics, we prove the existence of constrained optimal policies. Moreover, we prove that a constrained optimal policy may be a Markov policy, or be a randomized Markov policy that randomizes between two Markov policies, that differ in only one state. 展开更多
关键词 Non-stationary MDPs expected total reward criterion constrained optimal policies
全文增补中
上一页 1 下一页 到第
使用帮助 返回顶部