期刊文献+
共找到4篇文章
< 1 >
每页显示 20 50 100
Asymptotic Evaluations of the Stability Index for a Markov Control Process with the Expected Total Discounted Reward Criterion
1
作者 Jaime Eduardo Martínez-Sánchez 《American Journal of Operations Research》 2021年第1期62-85,共24页
In this work, for a control consumption-investment process with the discounted reward optimization criteria, a numerical estimate of the stability index is made. Using explicit formulas for the optimal stationary poli... In this work, for a control consumption-investment process with the discounted reward optimization criteria, a numerical estimate of the stability index is made. Using explicit formulas for the optimal stationary policies and for the value functions, the stability index is explicitly calculated and through statistical techniques its asymptotic behavior is investigated (using numerical experiments) when the discount coefficient approaches 1. The results obtained define the conditions under which an approximate optimal stationary policy can be used to control the original process. 展开更多
关键词 Control Consumption-Investment Process Discrete-Time Markov Control Process expected Total Discounted Reward Probabilistic Metrics Stability Index Estimation
下载PDF
Evolution of Damage on Historical Heritage Buildings in Presence of Catastrophic Events and Aggressive Natural Phenomena
2
作者 Elsa Garavaglia Chiara Molina 《Journal of Civil Engineering and Architecture》 2011年第7期585-595,共11页
The feasibility analysis of projects for the preservation of the historical heritage buildings is an important problem concerning the evaluation of "the total cost of intervention", which includes all the future dam... The feasibility analysis of projects for the preservation of the historical heritage buildings is an important problem concerning the evaluation of "the total cost of intervention", which includes all the future damage costs. The total cost of intervention represents a suitable measure of the expected deterioration risk and its evolution obviously depends on the damage process which buildings are subjected to. That damage phenomena affecting masonry buildings pleased into an aggressive environment are suitably modelled by renewal processes: this happens both in the case of catastrophic events, or in the case of the so-called "natural aging", in which damage comes off gradually in time. In the hypothesis ofa Markovian renewal process (Mrp) describing the damage process, the total cost of all the future damage is evaluated taking into account both the damage aspects: damages due to catastrophic aspects and damages due to aggressive environment, supposing different maintenance and/or rehabilitation scenarios. A semi-Markov process (s-Mp) is defined to model the damage rehabilitation history of buildings in presence of seismic events, natural ageing and rehabilitation strategies. The expected rewards connected to the process are defined; they represent a significant measure of the risk. 展开更多
关键词 Historical heritage buildings damaging phenomena rehabilitation strategies semi-Markov and renewal processes expected rewards.
下载PDF
CONSTRAINED DENUMERABLE STATE NON-STATIONARY MDPs WITH EXPECTED TOTAL REWARD CRITERION
3
作者 郭先平 《Acta Mathematicae Applicatae Sinica》 SCIE CSCD 2000年第2期205-212,共8页
In this paper, we consider constrained denumerable state non-stationary Markov decision processes (MDPs, for short) with expected total reward criterion. By the mechanics of intro- ducing Lagrange multiplier and using... In this paper, we consider constrained denumerable state non-stationary Markov decision processes (MDPs, for short) with expected total reward criterion. By the mechanics of intro- ducing Lagrange multiplier and using the methods of probability and analytics, we prove the existence of constrained optimal policies. Moreover, we prove that a constrained optimal policy may be a Markov policy, or be a randomized Markov policy that randomizes between two Markov policies, that differ in only one state. 展开更多
关键词 Non-stationary MDPs expected total reward criterion constrained optimal policies
全文增补中
STRONG N-DISCOUNT AND FINITE-HORIZON OPTIMALITY FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES 被引量:1
4
作者 ZHU Quanxin GUO Xianping 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2014年第5期1045-1063,共19页
This paper studies the strong n(n =—1,0)-discount and finite horizon criteria for continuoustime Markov decision processes in Polish spaces.The corresponding transition rates are allowed to be unbounded,and the rewar... This paper studies the strong n(n =—1,0)-discount and finite horizon criteria for continuoustime Markov decision processes in Polish spaces.The corresponding transition rates are allowed to be unbounded,and the reward rates may have neither upper nor lower bounds.Under mild conditions,the authors prove the existence of strong n(n =—1,0)-discount optimal stationary policies by developing two equivalence relations:One is between the standard expected average reward and strong—1-discount optimality,and the other is between the bias and strong 0-discount optimality.The authors also prove the existence of an optimal policy for a finite horizon control problem by developing an interesting characterization of a canonical triplet. 展开更多
关键词 Continuous-time Markov decision process expected average reward criterion finite-horizon optimality Polish space strong n-discount optimality
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部