摘要
本文定义了平均多目标马氏决策规划,讨论了它的平稳策略优势及其局限性,并在平稳策略集上给出了策略改进迭代求解法.
Average Multi-objective Markov Decision Programming is defined in this paper.The dominant positions and the limitations of the stationary strategies are discussed and an iterative algorithm is given to solve the programming on the stationary strategy set.
出处
《系统工程》
CSCD
1989年第1期25-32,共8页
Systems Engineering