期刊文献+

连续时间MCP在紧致行动集上的最优策略(英文) 被引量:12

Optimal Policies for a Continuous Time MCP with Compact Action Set
下载PDF
导出
摘要 文中研究了一类连续时间Markov控制过程 (CTMCP)无穷水平平均代价性能的最优控制决策问题 .文章采用无穷小生成元和性能势的基本性质 ,直接导出了平均代价模型在紧致行动集上的最优性方程及其解的存在性定理 ,提出了求解ε 最优平稳控制策略的数值迭代算法 ,并给出了这种算法的收敛性证明 .最后通过分析一个数值例子来说明这种方法的应用 . We study optimal policies for a class of continuous-time Markov control processes (CTMCPs) with infinite horizon average-cost criteria. Using the basic properties of infinitesimal generators and performance potentials, we give directly the optimality equation and establish the existence of solutions to this equation for the average-cost model on a compact action set. A fast value iteration algorithm, which leads to an Ε-optimal stationary policy, is proposed and the convergence of this algorithm is studied. Finally, we provide one numerical example to show applications of the proposed method.
出处 《自动化学报》 EI CSCD 北大核心 2003年第2期206-211,共6页 Acta Automatica Sinica
基金 NationalNaturalScienceFoundationofP .R .China (6 9974 0 37) NationalHighPerformanceComput ingFoundationofP .R .China(0 0 2 0 8)
关键词 MCP 紧致行动集 最优策略 性能势 平均代价准则 数值迭代算法 ε-最优平衡控制策略 Algorithms Iterative methods Mathematical models Optimization Performance Theorem proving
  • 相关文献

参考文献2

二级参考文献11

  • 1Cao X R 秦化淑.中国控制会议论文集[M].北京:中国科学技术出版社,1995.22-39.
  • 2Cao X R,IEEE Trans Automat Control,1997年,42卷,10期,1382页
  • 3Cao X R,中国控制会议论文集,1995年,22页
  • 4Cao X R,Realization Probabilities:the Dynamics of Queueing Systems,1994年
  • 5邓永录,随机模型及其应用,1994年
  • 6Cao Xiren,IEEE Trans Automat Control,1997年,42卷,10期,1382页
  • 7Cao Xiren,IEEE Trans Automat Control,1994年,39卷,7期,1460页
  • 8Chong E P,IEEE Trans Automat Control,1994年,37卷,7期,1440页
  • 9孙德敏,工程最优化.方法及应用,1991年,133页
  • 10Yao D D,IEEE Trans Automat Control,1989年,34卷,2期,236页

共引文献18

同被引文献58

  • 1代桂平,殷保群,李衍杰,周亚平,奚宏生.半Markov控制过程在平均准则下的优化算法[J].中国科学技术大学学报,2005,35(2):202-207. 被引量:1
  • 2TANGHao YUANJi-Bin LUYang CHENGWen-Juan.Performance Potential-based Neuro-dynamic Programming for SMDPs[J].自动化学报,2005,31(4):642-645. 被引量:10
  • 3ARAPOSTATHIS A, BORKAR V S,FERNANDEZ-GAUCHER, et al. Discrete-time controlled Markov processes with average cost criterion: a survey [J]. SIAM J of Control Optimization, 1993,31 (2): 282-344.
  • 4RAUL Montes-de-Oca. The average cost optimality equation for Markov control processes on Borel spaces [ J]. System and Control Letters, 1994,22(5): 351 - 357.
  • 5SENNOT L I. Another set of conditions for average optimality in Markov control processes [ J]. Systems and Control Letters, 1995,23(2):147- 151.
  • 6CAO X R, CHEN H F. Perturbation realization, potentials and sensitivity analysis of Markov processes [ J ]. IEEE Trans on Automatic Control, 1997,42(10): 1382 - 1393.
  • 7CAO X R. The relations among potentials, perturbation analysis, and Markov decision processes [ J ]. Discrete Event Dynamic Systems:Theory and Applications, 1998,8( 1 ): 71 - 78.
  • 8CAO X R. A unified approach to Markov decision problems and performance sensitivity analysis [ J ]. Automatica, 2000, 36 (5): 771 -774.
  • 9YIN Baoqun,ZHOU Yaping,XI Hongsheng,et al. Sensitivity formulas of performance in two-server cyclic queuing networks with phasetype distributed service times [ J]. Int Trans on Operation Research,1999,6(6) :649 - 663.
  • 10CINLAR E. Introduction to Stochastic Processes [M].Englewood Cliffs, NJ: Prentice-hall, 1975.

引证文献12

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部