期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
基于马尔可夫决策过程的动态火力目标匹配 被引量:1
1
作者 王宏磊 《电脑知识与技术(过刊)》 2011年第4X期2655-2656,共2页
该文在对战场实际情况合理假设的基础上,建立了基于马尔可夫决策理论的动态火力目标匹配模型,井通过具体例子进行计算算和分析,证明了该力一法的利学性和可行性。对指挥系统的目标处理子系统及战场目标综合处理的辅助决策系统开发提供... 该文在对战场实际情况合理假设的基础上,建立了基于马尔可夫决策理论的动态火力目标匹配模型,井通过具体例子进行计算算和分析,证明了该力一法的利学性和可行性。对指挥系统的目标处理子系统及战场目标综合处理的辅助决策系统开发提供了一定的理论支撑。 展开更多
关键词 马尔可夫 动态火力 目标匹配
下载PDF
CTMDP AND ITS RELATIONSHIP WITH DTMDP
2
作者 胡奇英 《Chinese Science Bulletin》 SCIE EI CAS 1990年第8期687-691,共5页
Continuous time Markov decision programming (shortly, CTMDP) with discount return criterion investigated in this note is {S,[(A(i), (i)), i∈S], q, r, α}. In this model the state set S is countable; the action set A(... Continuous time Markov decision programming (shortly, CTMDP) with discount return criterion investigated in this note is {S,[(A(i), (i)), i∈S], q, r, α}. In this model the state set S is countable; the action set A(i)is non-empty, (i)is a σ-algebra on A(i) which contains all single point sets of A(i); the family of the transition rate q(j|i, a) 展开更多
关键词 CTMDP dtmdp OPTIMALITY equation.
原文传递
First passage Markov decision processes with constraints and varying discount factors 被引量:2
3
作者 Xiao WU Xiaolong ZOU Xianping GUO 《Frontiers of Mathematics in China》 SCIE CSCD 2015年第4期1005-1023,共19页
This paper focuses on the constrained optimality problem (COP) of first passage discrete-time Markov decision processes (DTMDPs) in denumerable state and compact Borel action spaces with multi-constraints, state-d... This paper focuses on the constrained optimality problem (COP) of first passage discrete-time Markov decision processes (DTMDPs) in denumerable state and compact Borel action spaces with multi-constraints, state-dependent discount factors, and possibly unbounded costs. By means of the properties of a so-called occupation measure of a policy, we show that the constrained optimality problem is equivalent to an (infinite-dimensional) linear programming on the set of occupation measures with some constraints, and thus prove the existence of an optimal policy under suitable conditions. Furthermore, using the equivalence between the constrained optimality problem and the linear programming, we obtain an exact form of an optimal policy for the case of finite states and actions. Finally, as an example, a controlled queueing system is given to illustrate our results. 展开更多
关键词 Discrete-time Markov decision process (dtmdp) constrainedoptimality varying discount factor unbounded cost
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部