摘要
在马氏决策向量过程模型的理论基础上,结合决策向量和相合度等新定义,进一步提出有限阶段期望总报酬准则和最优方程,并证明最优方程的解的存在性.
By applying Markov decision - making vector processes theory and the new definition of decision - making vector, consistency degree, ETC. This paper will study the finite stage of expected totall reward model and optimality equation in Markov decision - making vector processes. Finally we proved the existence of solutions in the optimality equation.
出处
《数学理论与应用》
2011年第4期7-13,共7页
Mathematical Theory and Applications
基金
琼州学院青年基金资助项目
编号QYQN201126
关键词
马氏决策向量过程模型
报酬准则
最优方程
存在性
Markov decision - making vector processes Expected totall reward model Optimality equation Existence