摘要
部分可观察马尔可夫决策规划──首达目标模型刘迪芬(湖南师范大学数学系,长沙410081)刘建庸,刘克(中国科学院应用数学研究所,北京100080)PARTIALLYOBSERVABLEMARKOVDECISIONPROGRAMMING:FIRSTPA...
In this paper, we discuss the first passage problem of Markov decision programming with incomplete state information. It was` shown that the problem can be transformed to Markov decision programming with complete state information. We discuss the existence conditions for existence of an optimal stationary policy. Finally, we discuss the algorithm.
出处
《应用数学学报》
CSCD
北大核心
1994年第1期44-58,共15页
Acta Mathematicae Applicatae Sinica
基金
国家青年科学基金