Nearly optimal stochastic approximation for online principal subspace estimation

导出

摘要 Principal component analysis(PCA) has been widely used in analyzing high-dimensional data. It converts a set of observed data points of possibly correlated variables into a set of linearly uncorrelated variables via an orthogonal transformation. To handle streaming data and reduce the complexities of PCA,(subspace)online PCA iterations were proposed to iteratively update the orthogonal transformation by taking one observed data point at a time. Existing works on the convergence of(subspace) online PCA iterations mostly focus on the case where the samples are almost surely uniformly bounded. In this paper, we analyze the convergence of a subspace online PCA iteration under more practical assumptions and obtain a nearly optimal finite-sample error bound. Our convergence rate almost matches the minimax information lower bound. We prove that the convergence is nearly global in the sense that the subspace online PCA iteration is convergent with high probability for random initial guesses. This work also leads to a simpler proof of the recent work on analyzing online PCA for the first principal component only.

作者 Xin Liang Zhen-Chen Guo Li Wang Ren-Cang Li Wen-Wei Lin

机构地区 Yau Mathematical Sciences Center Yanqi Lake Beijing Institute of Mathematical Sciences and Applications Department of Mathematics Department of Mathematics Department of Mathematics Nanjing Center for Applied Mathematics Department of Applied Mathematics

出处《Science China Mathematics》 SCIE CSCD 2023年第5期1087-1122,共36页 中国科学：数学（英文版）

基金 supported by National Natural Science Foundation of China(Grant No.11901340) National Science Foundation of USA(Grant Nos.DMS-1719620 and DMS-2009689) the ST Yau Centre at the Yang Ming Chiao Tung University.

关键词 principal component analysis principal component subspace stochastic approximation high-dimensional data online algorithm nite-sample analysis

分类号 O212.1 [理学—概率论与数理统计]

引文网络
相关文献

1黄海,刘红雨,邢琳,那宁,李春宝.DOT快速算法及其通用架构设计[J].哈尔滨理工大学学报,2021,26(2):9-16.
2Xue CHEN,Cheng WANG,Qing YANG,Teng HU,Changjun JIANG.Locally differentially private high-dimensional data synthesis[J].Science China(Information Sciences),2023,66(1):21-38.
3Jia HU,Qimin HU.A Symmetric Linearized Alternating Direction Method of Multipliers for a Class of Stochastic Optimization Problems[J].Journal of Systems Science and Information,2023,11(1):58-77.
4Tiansu Chen,Shi bin Zhang,Qirun Wang,Yan Chang.Quantum Fuzzy Regression Model for Uncertain Environment[J].Computers, Materials & Continua,2023(5):2759-2773.
5WANG Jimin,TAN Jianwei,ZHANG Ji-Feng.Differentially Private Distributed Parameter Estimation[J].Journal of Systems Science & Complexity,2023,36(1):187-204. 被引量：2
6Nur Laila Ab Ghani,Izzatdin Abdul Aziz,Said Jadid AbdulKadir.Subspace Clustering in High-Dimensional Data Streams:A Systematic Literature Review[J].Computers, Materials & Continua,2023(5):4649-4668.
7Ruipeng Yang,Aimin Yu,Lijun Cai,Dan Meng.Subspace clustering via graph auto-encoder network for unknown encrypted traffc recognition[J].Cybersecurity,2023,6(2):14-28.
8Ruduan Plug,Yan Liang,Aliya Aktau,Mariam Basajja,Francisca Oladipo,Mirjam van Reisen.Terminology for a FAIR Framework for the Virus Outbreak Data Network-Africa[J].Data Intelligence,2022,4(4):698-723.
9申培萍,王亚飞,吴殿晓.求解一类Minimax分式优化问题的几何规划方法[J].河南师范大学学报（自然科学版）,2023,51(2):56-62.
10Andrew Walcott Beckwith.Does a Fine Tuning of a Quartic Potential Allow for an Invariant Cosmological Constant? How This Supposition Could Lead to a Macro Model of Pressure in the Start of Inflation?[J].Journal of High Energy Physics, Gravitation and Cosmology,2023,9(2):552-560.

Science China Mathematics

2023年第5期

浏览历史

内容加载中请稍等...

Nearly optimal stochastic approximation for online principal subspace estimation

相关作者

相关机构

相关主题

浏览历史