期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
CONTINUOUS TIME MARKOV DECISION PROGRAMMING WITH AVERAGE REWARD CRITERION AND UNBOUNDED REWARD RATE
1
作者 郑少慧 《Acta Mathematicae Applicatae Sinica》 SCIE CSCD 1991年第1期6-16,共11页
This paper deals with the continuous time Markov decision programming (briefly CTMDP) withunbounded reward rate.The economic criterion is the long-run average reward. To the models withcountable state space,and compac... This paper deals with the continuous time Markov decision programming (briefly CTMDP) withunbounded reward rate.The economic criterion is the long-run average reward. To the models withcountable state space,and compact metric action sets,we present a set of sufficient conditions to ensurethe existence of the stationary optimal policies. 展开更多
关键词 continuous TIME MARKOV decision PROGRAMMING WITH AVERAGE REWARD CRITERION AND UNBOUNDED REWARD RATE CTMDP
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部