期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
A review on Markov Decision Processes 被引量:4
1
作者 J. A. Filar and LIU Ke Centre for Industrial and Applicable Mathematics , University of South Australia , Australia Institute of Applied Mathematics, Chinese Academy of Sciences , Beijing 100080, China 《Chinese Science Bulletin》 SCIE EI CAS 1999年第7期672-672,共1页
MARKOV decision processes (MDPs) have been studied by mathematicians, probabilists, operation researchers and engineers since the late 1950s. In an MDPs a stochastic, dynamic system is controlled by a 'policy'... MARKOV decision processes (MDPs) have been studied by mathematicians, probabilists, operation researchers and engineers since the late 1950s. In an MDPs a stochastic, dynamic system is controlled by a 'policy' selected by a decision-maker/controller, with the goal of maximizing an overall reward function that is an appropriately defined aggregate of immediate rewards, over either finite or infinite time horizon.As such MDPs are a useful paradigm for modeling many processes occurring naturally in the management and engineering contexts.. 展开更多
关键词 A review on Markov Decision Processes
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部