Topological Order Value Iteration Algorithm for Solving Probabilistic Planning

Topological Order Value Iteration Algorithm for Solving Probabilistic Planning

下载PDF

导出

摘要 AI researchers typically formulated probabilistic planning under uncertainty problems using Markov Decision Processes (MDPs).Value Iteration is an inef?cient algorithm for MDPs, because it puts the majority of its effort into backing up the entire state space, which turns out to be unnecessary in many cases. In order to overcome this problem, many approaches have been proposed. Among them, LAO*, LRTDP and HDP are state-of-the-art ones. All of these use reach ability analysis and heuristics to avoid some unnecessary backups. However, none of these approaches fully exploit the graphical features of the MDPs or use these features to yield the best backup sequence of the state space. We introduce an improved algorithm named Topological Order Value Iteration (TOVI) that can circumvent the problem of unnecessary backups by detecting the structure of MDPs and backing up states based on topological sequences. The experimental results demonstrate the effectiveness and excellent performance of our algorithm. AI researchers typically formulated probabilistic planning under uncertainty problems using Markov Decision Processes (MDPs).Value Iteration is an inef?cient algorithm for MDPs, because it puts the majority of its effort into backing up the entire state space, which turns out to be unnecessary in many cases. In order to overcome this problem, many approaches have been proposed. Among them, LAO*, LRTDP and HDP are state-of-the-art ones. All of these use reach ability analysis and heuristics to avoid some unnecessary backups. However, none of these approaches fully exploit the graphical features of the MDPs or use these features to yield the best backup sequence of the state space. We introduce an improved algorithm named Topological Order Value Iteration (TOVI) that can circumvent the problem of unnecessary backups by detecting the structure of MDPs and backing up states based on topological sequences. The experimental results demonstrate the effectiveness and excellent performance of our algorithm.

作者 Xiaofei Liu Mingjie Li Qingxin Nie

机构地区 Department of Computer Science Polytechnic School School of Foundation Courses

出处《Communications and Network》 2013年第1期86-89,共4页 通讯与网络（英文）

关键词 PROBABILISTIC Planning MARKOV DECISION Processes Dynamic PROGRAMMING Value ITERATION Probabilistic Planning Markov Decision Processes Dynamic Programming Value Iteration

分类号 O1 [理学—基础数学]

引文网络
相关文献

1Experimental probing topological order and its breakdown through modular matrices[J].Science Foundation in China,2017,25(4).
2Xiao-Gang Wen.Discovery of Fractionalized Neutral Spin-1/2 Excitation of Topological Order[J].Chinese Physics Letters,2017,34(9):1-2. 被引量：1
3饶东宁,郭海峰,蒋志华.基于并行概率规划的股票指数模拟[J].计算机学报,2019,42(6):1334-1350. 被引量：5
4Edilson F. Arruda,Fabrício Ourique.Adaptive Strategies for Accelerating the Convergence of Average Cost Markov Decision Processes Using a Moving Average Digital Filter[J].American Journal of Operations Research,2013,3(6):514-520.
5Abdul-Sattar J. Al-Saif,Assma J. Harfash.A Comparison between the Reduced Differential Transform Method and Perturbation-Iteration Algorithm for Solving Two-Dimensional Unsteady Incompressible Navier-Stokes Equations[J].Journal of Applied Mathematics and Physics,2018,6(12):2518-2543. 被引量：1
6Adrián ángel Inchauspe.Torsadogenic Index: Its Chinese Medical Origin[J].Pharmacology & Pharmacy,2013,4(7):1-3.
7A. S. Rasulov,M. T. Bakoev,D. R. Akabirhodjaeva.Probabilistic Approach to the Asynchronous Iteration[J].Journal of Applied Mathematics and Physics,2014,2(1):32-40.
8胡愈挺,万义顿,吴咏时.Boundary Hamiltonian Theory for Gapped Topological Orders[J].Chinese Physics Letters,2017,34(7):207-211. 被引量：2
9Yi Li.Fixed Point of a Countable Family of Uniformly Totally Quasi- <i>&Oslash</i>-Asymptotically Nonexpansive Multi-Valued Mappings in Reflexive Banach Spaces with Applications[J].Applied Mathematics,2013,4(9):6-12. 被引量：1
10Ruopeng Wang,Hong Shi,Kai Ruan,Xiangyu Gao.Fixed-Point Iteration Method for Solving the Convex Quadratic Programming with Mixed Constraints[J].Applied Mathematics,2014,5(2):256-262. 被引量：1

Communications and Network

2013年第1期

浏览历史

内容加载中请稍等...

Topological Order Value Iteration Algorithm for Solving Probabilistic Planning

相关作者

相关机构

相关主题

浏览历史