期刊文献+

半Markov控制过程在平均准则下的优化算法 被引量:1

Optimization Algorithms for Semi-Markov Control Processes With Average Criteria
下载PDF
导出
摘要 研究了一类半Markov控制过程(SMCP)在紧致行动集上关于无限水平平均代价准则的性能优化算法.利用等价Markov过程的方法,导出了SMCP的性能势公式和平均代价最优性方程,给出了求解最优或次最优平稳策略的策略迭代算法和数值迭代算法,并证明了算法的收敛性.最后给出了一个数值例子来说明算法的应用. Optimization algorithms are studied for a class of semi-Markov control processes (SMCPs) with infinite horizon average-cost criteria and compact action sets. By the equivalent Markov process, formulas of performance potentials and average-cost optimality equations for SMCPs are derived, and a policy iteration algorithm and a value iteration algorithm are proposed, which can lead to an optimal or suboptimal stationary policy in a finite number of iterations. The convergence of these algorithms is established, without the assumption of the corresponding iteration operator being an sp-contraction. A numerical example is provided to illustrate the application of the algorithms.
出处 《中国科学技术大学学报》 CAS CSCD 北大核心 2005年第2期202-207,共6页 JUSTC
基金 国家自然科学基金(60274012) 安徽省自然科学基金(01042308)资助项目.
关键词 半Markov控制过程 紧致行动集 性能势 策略迭代 数值迭代 semi-Markov control processes compact action set performance potentials policy iteration value iteration
  • 相关文献

参考文献14

  • 1殷保群,周亚平,杨孝先,奚宏生,孙德敏.状态相关闭排队网络中的性能指标灵敏度公式[J].控制理论与应用,1999,16(2):255-257. 被引量:15
  • 2Yin B Q,Xi H S,Zhou Y PI Sensitivity analysis ofpPerform-ance in queueing systems with Phase-Type service distributions[J].运筹学学报,2000,4(4):55—62.
  • 3Guo X P,Lin K,A note On optimality conditions for ContinuoumTime markov decision processes with average cost criterion[J].IEEE Transactions On Automation Control,2001,46(12):1 984—1 989.
  • 4Guo X P, Perndndez-Lerma O. Continoustime controlled Markov chains[J]. Ann. Appl. Probab. , 2001,13:363-388.
  • 5奚宏生,唐昊,殷保群.连续时间MCP在紧致行动集上的最优策略(英文)[J].自动化学报,2003,29(2):206-211. 被引量:12
  • 6胡奇英 刘建墉.马尔可夫决策过程引论[M].西安:西安电子科技大学出版社,2001..
  • 7Howard R.Semi—Markovian decision processes[J].Bull.Inst.Intcernat.Statist,1963,40:625—652.
  • 8Jewell W S.Markov renewal programming Ⅰ and Ⅱ[J].Operat.Res.,1963,2:938—971.
  • 9Ross S M, Applied Probability Models with Optimization Applications [M].San Francso:Holden-Day,1971.
  • 10Beutler F J,Ross K W.Uniformization for Semi-Markov decision processes under stationary policies[J].J Appl.Prob.,1987,24:644—656.

二级参考文献7

共引文献25

同被引文献4

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部