摘要
本文考虑的是可数状态空间不完全信息的非平稳MDP平均模型,借助于模型的转化,建立了不完全信息的非平稳MDP平均模型的最优方程,并进一步给出了最优方程的解及ε(>,0)——最优策略存在的充分条件。
In this paper, We consider the denumable state space non- stationary MDP average model with incomplete information, By the translation of the model, We build up a optimal equation (OE) for the MDP average Model with incomplete Information, and also give the condition under which the solution of OE and the ε-optimal policies must exist.
关键词
马氏决策规划
不完全信息
平均目标
平均模型
Markov Decision Programming (MDP) Incomplete Information Non -stationary Average criteria.