期刊文献+
共找到9篇文章
< 1 >
每页显示 20 50 100
A RISK-SENSITIVE STOCHASTIC MAXIMUM PRINCIPLE FOR OPTIMAL CONTROL OF JUMP DIFFUSIONS AND ITS APPLICATIONS 被引量:1
1
作者 史敬涛 吴臻 《Acta Mathematica Scientia》 SCIE CSCD 2011年第2期419-433,共15页
A stochastic maximum principle for the risk-sensitive optimal control prob- lem of jump diffusion processes with an exponential-of-integral cost functional is derived assuming that the value function is smooth, where ... A stochastic maximum principle for the risk-sensitive optimal control prob- lem of jump diffusion processes with an exponential-of-integral cost functional is derived assuming that the value function is smooth, where the diffusion and jump term may both depend on the control. The form of the maximum principle is similar to its risk-neutral counterpart. But the adjoint equations and the maximum condition heavily depend on the risk-sensitive parameter. As applications, a linear-quadratic risk-sensitive control problem is solved by using the maximum principle derived and explicit optimal control is obtained. 展开更多
关键词 risk-sensitive control jump diffusions maximum principle adioint equation
下载PDF
Risk-sensitive reinforcement learning algorithms with generalized average criterion
2
作者 殷苌茗 王汉兴 赵飞 《Applied Mathematics and Mechanics(English Edition)》 SCIE EI 2007年第3期405-416,共12页
A new algorithm is proposed, which immolates the optimality of control policies potentially to obtain the robnsticity of solutions. The robnsticity of solutions maybe becomes a very important property for a learning s... A new algorithm is proposed, which immolates the optimality of control policies potentially to obtain the robnsticity of solutions. The robnsticity of solutions maybe becomes a very important property for a learning system when there exists non-matching between theory models and practical physical system, or the practical system is not static, or the availability of a control action changes along with the variety of time. The main contribution is that a set of approximation algorithms and their convergence results are given. A generalized average operator instead of the general optimal operator max (or rain) is applied to study a class of important learning algorithms, dynamic prOgramming algorithms, and discuss their convergences from theoretic point of view. The purpose for this research is to improve the robnsticity of reinforcement learning algorithms theoretically. 展开更多
关键词 reinforcement learning risk-sensitive generalized average algorithm convergence
下载PDF
Optimal Risk-Sensitive Filtering for System Stochastic of Second and Third Degree
3
作者 Ma Aracelia Alcorta-Garcia Sonia Gpe Anguiano Rostro Mauricio Torres Torres 《Intelligent Control and Automation》 2011年第1期47-56,共10页
The risk-sensitive filtering design problem with respect to the exponential mean-square cost criterion is con-sidered for stochastic Gaussian systems with polynomial of second and third degree drift terms and intensit... The risk-sensitive filtering design problem with respect to the exponential mean-square cost criterion is con-sidered for stochastic Gaussian systems with polynomial of second and third degree drift terms and intensity parameters multiplying diffusion terms in the state and observations equations. The closed-form optimal fil-tering equations are obtained using quadratic value functions as solutions to the corresponding Focker- Plank-Kolmogorov equation. The performance of the obtained risk-sensitive filtering equations for stochastic polynomial systems of second and third degree is verified in a numerical example against the optimal po-lynomial filtering equations (and extended Kalman-Bucy for system polynomial of second degree), through comparing the exponential mean-square cost criterion values. The simulation results reveal strong advan-tages in favor of the designed risk-sensitive equations for some values of the intensity parameters. 展开更多
关键词 OPTIMAL Nonlinear FILTERING risk-sensitive FILTERING Extended Kalman-Bucy FILTERING
下载PDF
Partially Observed Risk-Sensitive Stochastic Control Problems with Non-Convexity Restriction
4
作者 MA Heping LI Ruijing 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2023年第2期672-685,共14页
The paper considers partially observed optimal control problems for risk-sensitive stochastic systems,where the control domain is non-convex and the diffusion term contains the control v.Utilizing Girsanov’s theorem,... The paper considers partially observed optimal control problems for risk-sensitive stochastic systems,where the control domain is non-convex and the diffusion term contains the control v.Utilizing Girsanov’s theorem,spike variational technique as well as duality method,the authors obtain four adjoint equations and establish a maximum principle under partial information.As an application,an example is presented to demonstrate the result. 展开更多
关键词 Girsanov's theorem maximum principle partial information risk-sensitive optimal control
原文传递
G-stochastic maximum principle for risk-sensitive control problem and its applications
5
作者 Meriyam Dassa Adel Chala 《Probability, Uncertainty and Quantitative Risk》 2023年第4期463-484,共22页
This study advances the G-stochastic maximum principle(G-SMP)from a risk-neutral framework to a risk-sensitive one.A salient feature of this advancement is its applicability to systems governed by stochastic different... This study advances the G-stochastic maximum principle(G-SMP)from a risk-neutral framework to a risk-sensitive one.A salient feature of this advancement is its applicability to systems governed by stochastic differential equations under G-Brownian motion(G-SDEs),where the control variable may influence all terms.We aim to generalize our findings from a risk-neutral context to a risk-sensitive performance cost.Initially,we introduced an auxiliary process to address risk-sensitive performance costs within the G-expectation framework.Subsequently,we established and validated the correlation between the G-expected exponential utility and the G-quadratic backward stochastic differential equation.Furthermore,we simplified the G-adjoint process from a dual-component structure to a singular component.Moreover,we explained the necessary optimality conditions for this model by considering a convex set of admissible controls.To describe the main findings,we present two examples:the first addresses the linear-quadratic problem and the second examines a Merton-type problem characterized by power utility. 展开更多
关键词 Stochastic optimal control G-EXPECTATION G-Brownian motion G-Stochastic differential equation G-stochastic maximum principle risk-sensitive control Logarithmic transformation
原文传递
Design of satisfaction output feedback controls for stochastic nonlinear systems under quadratic tracking risk-sensitive index 被引量:8
6
作者 刘允刚 张纪峰 潘子刚 《Science in China(Series F)》 2003年第2期126-144,共19页
In this paper, the design problem of satisfaction output feedback controls for stochastic nonlinear systems in strict feedback form under long-term tracking risk-sensitive index is investigated. The index function ado... In this paper, the design problem of satisfaction output feedback controls for stochastic nonlinear systems in strict feedback form under long-term tracking risk-sensitive index is investigated. The index function adopted here is of quadratic form usually encountered in practice, rather than of quartic one used to beg the essential difficulty on controller design and performance analysis of the closed-loop systems. For any given risk-sensitive parameter and desired index value, by using the integrator backstepping method, an output feedback control is constructively designed so that the closed-loop system is bounded in probability and the risk-sensitive index is upper bounded by the desired value. 展开更多
关键词 integrator backstepping nonlinear system stochastic disturbance risk-sensitive index output feedback.
原文传递
Quasi-equality constrained risk-sensitive filtering for nonlinear discrete-time systems 被引量:1
7
作者 Linfeng SHEN, Yan LIN School of Automation Science and Electrical Engineering, Beijing University of Aeronautics and Astronautics, Beijing 100191, China 《控制理论与应用(英文版)》 EI 2012年第2期229-235,共7页
More and more data fusion models contain state constraints with valuable information in the filtering process. In this study, an optimal filter of risk sensitive with quasi-equality constraints is formulated using the... More and more data fusion models contain state constraints with valuable information in the filtering process. In this study, an optimal filter of risk sensitive with quasi-equality constraints is formulated using the reference probability method. Through recursion processes of probability density acquired from the probability measure change, the derived algorithm is optimal in the sense of the risk sensitive parameter. The system and constraint models are Consistent in statistics. Simulation results show that it is more robust and efficient than projection filters for the worst-case of noises and model uncertainty. 展开更多
关键词 Optimal estimation risk-sensitive filtering CONSTRAINTS
原文传递
Stressed portfolio optimization with semiparametric method
8
作者 Chuan-Hsiang Han Kun Wang 《Financial Innovation》 2022年第1期821-854,共34页
Tail risk is a classic topic in stressed portfolio optimization to treat unprecedented risks,while the traditional mean–variance approach may fail to perform well.This study proposes an innovative semiparametric meth... Tail risk is a classic topic in stressed portfolio optimization to treat unprecedented risks,while the traditional mean–variance approach may fail to perform well.This study proposes an innovative semiparametric method consisting of two modeling components:the nonparametric estimation and copula method for each marginal distribution of the portfolio and their joint distribution,respectively.We then focus on the optimal weights of the stressed portfolio and its optimal scale beyond the Gaussian restriction.Empirical studies include statistical estimation for the semiparametric method,risk measure minimization for optimal weights,and value measure maximization for the optimal scale to enlarge the investment.From the outputs of short-term and long-term data analysis,optimal stressed portfolios demonstrate the advantages of model flexibility to account for tail risk over the traditional mean–variance method. 展开更多
关键词 Portfolio optimization Tail risk Semiparametric method Kernel method Copula method Risk measure risk-sensitive value measure Scaling effect
下载PDF
Robust Designs Through Risk Sensitivity:An Overview
9
作者 BASAR Tamer 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2021年第5期1634-1665,共32页
This is an overview paper on the relationship between risk-averse designs based on exponential loss functions with or without an additional unknown(adversarial)term and some classes of stochastic games.In particular,t... This is an overview paper on the relationship between risk-averse designs based on exponential loss functions with or without an additional unknown(adversarial)term and some classes of stochastic games.In particular,the paper discusses the equivalences between risk-averse controller and filter designs and saddle-point solutions of some corresponding risk-neutral stochastic differential games with different information structures for the players.One of the by-products of these analyses is that risk-averse controllers and filters(or estimators)for control and signal-measurement models are robust,through stochastic dissipation inequalities,to unmodeled perturbations in controlled system dynamics as well as signal and the measurement processes.The paper also discusses equivalences between risk-sensitive stochastic zero-sum differential games and some corresponding risk-neutral three-player stochastic zero-sum differential games,as well as robustness issues in stochastic nonzero-sum differential games with finite and infinite populations of players,with the latter belonging to the domain of mean-field games. 展开更多
关键词 Mean-field games risk-sensitive control risk-sensitive filtering risk-sensitive games risk sensitivity ROBUSTNESS
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部