This paper concerns the construction and regularity of a transition (probability) function of a nonhomogeneous continuous-time Maxkov process with given transition rates and a general state space. Motivating from a ...This paper concerns the construction and regularity of a transition (probability) function of a nonhomogeneous continuous-time Maxkov process with given transition rates and a general state space. Motivating from a lot of restriction in applications of a transition function with continuous (in t ≥0) and consewative transition rates q(t, x, A), we consider the case that q(t, x, A) axe only required to satisfy a mild measurability (in t ≥ O) condition, which is a generalization of the continuity condition. Under the measurability condition we construct a transition function with the given transition rates, provide a necessary and sufficient condition for it to be regular, and further obtain some interesting additional results.展开更多
This paper is an attempt to study the minimization problem of the risk probability of piecewise deterministic Markov decision processes(PDMDPs)with unbounded transition rates and Borel spaces.Different from the expect...This paper is an attempt to study the minimization problem of the risk probability of piecewise deterministic Markov decision processes(PDMDPs)with unbounded transition rates and Borel spaces.Different from the expected discounted and average criteria in the existing literature,we consider the risk probability that the total rewards produced by a system do not exceed a prescribed goal during a first passage time to some target set,and aim to find a policy that minimizes the risk probability over the class of all history-dependent policies.Under suitable conditions,we derive the optimality equation(OE)for the probability criterion,prove that the value function of the minimization problem is the unique solution to the OE,and establish the existence ofε(≥0)-optimal policies.Finally,we provide two examples to illustrate our results.展开更多
This work develops near-optimal controls for systems given by differential equations with wideband noise and random switching.The random switching is modeled by a continuous-time,time-inhomogeneous Markov chain.Under ...This work develops near-optimal controls for systems given by differential equations with wideband noise and random switching.The random switching is modeled by a continuous-time,time-inhomogeneous Markov chain.Under broad conditions,it is shown that there is an associated limit problem,which is a switching jump diffusion.Using near-optimal controls of the limit system,we then build controls for the original systems.It is shown that such constructed controls are nearly optimal.展开更多
基金Supported by the National Natural Science Foundation of China (No.10925107)Guangdong Province Universities and Colleges Pearl River Scholar Funded Schemethe Fundamental Research Funds for the Central Universities (No.11612314)
文摘This paper concerns the construction and regularity of a transition (probability) function of a nonhomogeneous continuous-time Maxkov process with given transition rates and a general state space. Motivating from a lot of restriction in applications of a transition function with continuous (in t ≥0) and consewative transition rates q(t, x, A), we consider the case that q(t, x, A) axe only required to satisfy a mild measurability (in t ≥ O) condition, which is a generalization of the continuity condition. Under the measurability condition we construct a transition function with the given transition rates, provide a necessary and sufficient condition for it to be regular, and further obtain some interesting additional results.
基金supported by the National Natural Science Foundation of China(Nos.11931018,11961005)Guangdong Province Key Laboratory of Computational Science at the Sun Yat-sen University(No.2020B1212060032)the Natural Science Foundation of Guangxi Province(No.2020GXNSFAA297196)。
文摘This paper is an attempt to study the minimization problem of the risk probability of piecewise deterministic Markov decision processes(PDMDPs)with unbounded transition rates and Borel spaces.Different from the expected discounted and average criteria in the existing literature,we consider the risk probability that the total rewards produced by a system do not exceed a prescribed goal during a first passage time to some target set,and aim to find a policy that minimizes the risk probability over the class of all history-dependent policies.Under suitable conditions,we derive the optimality equation(OE)for the probability criterion,prove that the value function of the minimization problem is the unique solution to the OE,and establish the existence ofε(≥0)-optimal policies.Finally,we provide two examples to illustrate our results.
基金supported in part by the National Science Foundation under DMS-1207667supported in part by NSFC and RFDP
文摘This work develops near-optimal controls for systems given by differential equations with wideband noise and random switching.The random switching is modeled by a continuous-time,time-inhomogeneous Markov chain.Under broad conditions,it is shown that there is an associated limit problem,which is a switching jump diffusion.Using near-optimal controls of the limit system,we then build controls for the original systems.It is shown that such constructed controls are nearly optimal.