摘要
在一种新的准则概率阈值准则下讨论马尔可夫决策的最优解的算法问题.在该准则下,采用基于未来阈值的方法,求解马尔可夫最优策略.
The arithmetic problem of Markov optimum solution under a new principle named probability threshold value principle is discussed.With this principle,the Markov optimum policy is solved based on the future threshold value.
出处
《吉林化工学院学报》
CAS
2004年第2期97-99,共3页
Journal of Jilin Institute of Chemical Technology