Revisiting the ODE Method for Recursive Algorithms:Fast Convergence Using Quasi Stochastic Approximation

导出

摘要 Several decades ago,Profs.Sean Meyn and Lei Guo were postdoctoral fellows at ANU,where they shared interest in recursive algorithms.It seems fitting to celebrate Lei Guo’s 60 th birthday with a review of the ODE Method and its recent evolution,with focus on the following themes:The method has been regarded as a technique for algorithm analysis.It is argued that this viewpoint is backwards:The original stochastic approximation method was surely motivated by an ODE,and tools for analysis came much later(based on establishing robustness of Euler approximations).The paper presents a brief survey of recent research in machine learning that shows the power of algorithm design in continuous time,following by careful approximation to obtain a practical recursive algorithm.While these methods are usually presented in a stochastic setting,this is not a prerequisite.In fact,recent theory shows that rates of convergence can be dramatically accelerated by applying techniques inspired by quasi Monte-Carlo.Subject to conditions,the optimal rate of convergence can be obtained by applying the averaging technique of Polyak and Ruppert.The conditions are not universal,but theory suggests alternatives to achieve acceleration.The theory is illustrated with applications to gradient-free optimization,and policy gradient algorithms for reinforcement learning.

作者 CHEN Shuhang DEVRAJ Adithya BERSTEIN Andrey MEYN Sean

机构地区 Department of Mathematics Stanford University NREL University of Florida

出处《Journal of Systems Science & Complexity》 SCIE EI CSCD 2021年第5期1681-1702,共22页 系统科学与复杂性学报（英文版）

基金 ARO W911NF1810334 NSF under EPCN 1935389 the National Renewable Energy Laboratory(NREL)。

关键词 Learning and adaptive systems in artificial intelligence reinforcement learning stochastic approximation

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1Chandra Sen.Role of Examples and Interpretation of Results in Developing Multi-Objective Optimization Techniques[J].American Journal of Operations Research,2020,10(4):138-145.
2Jinghuai GAO,Weimin HAN,Yanbin HE,Haixia ZHAO,Hui LI,Yijie ZHANG,Zongben XU.Seismic wave equations in tight oil/gas sandstone media[J].Science China Earth Sciences,2021,64(3):377-387. 被引量：1
3TANG Shidong.Study on Temporal and Spatial Psychological Transformation Mechanism of Landscape Perception[J].Journal of Landscape Research,2021,13(5):68-70.
4Paola Gervasio,Alfio Quarteroni.The INTERNODES Method for Non-conforming Discretizations of PDEs[J].Communications on Applied Mathematics and Computation,2019,1(3):361-401.
5康凌宇(编译).咖啡品质受气候变化影响[J].中国食品工业,2021(21):86-86.
6XIE Liang-Liang,ZHANG Ji-Feng.Preface[J].Journal of Systems Science & Complexity,2021,34(5).
7Javaria Ahmad Khan,Atif Akbar.Density Estimation Using Gumbel Kernel Estimator[J].Open Journal of Statistics,2021,11(2):319-328.
8Shukai CHEN,Zenghu LI.CONTINUOUS TIME MIXED STATE BRANCHING PROCESSES AND STOCHASTIC EQUATIONS[J].Acta Mathematica Scientia,2021,41(5):1445-1473. 被引量：1
9Ya-Hui Sun,Yong-Ge Yang,Wei Xu.Stochastic P-bifurcations of a noisy nonlinear system with fractional derivative element[J].Acta Mechanica Sinica,2021,37(3):507-515. 被引量：4
10Chun-Hu Wang,Meng-Jie Shan,Hao Liu,Yan Hao,Ke-Xin Song,Huan-Wen Wu,Tian Meng,Cheng Feng,Zheng Qi,Zhi Wang,You-Bin Wang.Hyperbaric oxygen treatment on keloid tumor immune gene expression[J].Chinese Medical Journal,2021(18):2205-2213. 被引量：5

Journal of Systems Science & Complexity

2021年第5期

浏览历史

内容加载中请稍等...

Revisiting the ODE Method for Recursive Algorithms:Fast Convergence Using Quasi Stochastic Approximation

相关作者

相关机构

相关主题

浏览历史