摘要
该文考虑的是可数状态空间有限行动空间非齐次马氏决策过程的期望总报酬准则.与以往不同的是,我们是通过扩大状态空间的方法,将非齐次的马氏决策过程转化成齐次的马氏决策过程,于是非常简洁地得到了按传统的方法所得的主要结果.
In this paper we consider the homogenization of Non-homogneous Markov deci-sion model with expected total reward criterion.denumerable state space and finite actionspaces. We translate the non-homogeneous Markov decision processes into a homegeneousone by the method of extanding state space which is different from the usual one, and thenwe easily obtain the mail results obtained by usual method. Specially, the mail results ob-tained by K. Hinderer'
出处
《数学物理学报(A辑)》
CSCD
北大核心
1997年第4期432-438,共7页
Acta Mathematica Scientia
基金
国家自然科学基金
关键词
马氏决策过程
非齐次
齐次
期望总报酬准则
Markov decision processes, Non-homogeneous, Homogenization, Optimal policies