摘要
本文运用策略改进迭代法,给出了折扣因子可以不同的平稳策略类上DMOMDP的求解方法,并证明了一个策略是最优策略的充要条件是策略为最优方程的有效不动点。
An algorithm is presented in this paper to solve DMOMDP with different discounted facts on the stationary strategy set by employing the iterative method of strategy improvement. It is proved that a strategy is an optimal strategy if and only if it is an effective fixed point of the optimal equation.
出处
《桂林电子工业学院学报》
1989年第2期84-89,共6页
Journal of Guilin Institute of Electronic Technology
关键词
最优策略
最优方程
不动点
optimal strategy
optimal equation
fixed point