球面上l_(1)正则优化的随机临近梯度方法

STOCHASTIC PROXIMAL GRADIENT METHOD FOR l_(1)REGULA RIZED OPTIMIZATION OVER A SPHERE

导出

摘要本文研究球面上的l_(1)正则优化问题,其目标函数由一般光滑函数项和非光滑l_(1)正则项构成,且假设光滑函数的随机梯度可由随机一阶oracle估计.这类优化问题被广泛应用在机器学习,图像、信号处理和统计等领域.根据流形临近梯度法和随机梯度估计技术,提出一种球面随机临近梯度算法.基于非光滑函数的全局隐函数定理,分析了子问题解关于参数的Lipschtiz连续性,进而证明了算法的全局收敛性.在基于随机数据集和实际数据集的球面l_(1)正则二次规划问题、有限和SPCA问题和球面l_(1)正则逻辑回归问题上数值实验结果显示所提出的算法与流形临近梯度法、黎曼随机临近梯度法相比CPU时间上具有一定的优越性. This paper presents a stochastic proximal gradient method for solving thel_(1)regularized optimization problem over a sphere.The objective function of the optimization problem is composed of a general smooth function and a non-smoothl_(1)regularization term,and it is.assumed that the noisy gradient of the smooth function can be estimated by some stochastic first-order oracles.These optimization problems are widely used in machine learning,image,signal processing,and statistics.We employ the manifold proximal gradient method and the stochastic technique for estimating the gradient information to present a sphere stochastic proximal gradient algorithm for solving thel_(1)regularized optimization over a sphere.Via establishing the global implicit function theorem of a certain non-smooth function,we analyze the Lipschtiz continuity of the solutions of the subproblems and prove the global convergence of the proposed algorithm under some assumptions.Numerical results on thel_(1)regularized quadratic programming problem,the finite-sum sparse PCA problem,and thel_(1)regularized logistic regression problem over the sphere with synthetic and real data sets illustrate that the proposed algorithm is competitive with the manifold proximal gradient algorithm and Riemannian stochastic proximal gradient method in terms of CPU time.

作者米玲薛文娟沈春根 Mi Ling;Xue Wenjuan;Shen Chungen(College of Science,University of Shanghai for Science and Technology,Shanghai 200093,China;School of Mathematics and Physics,Shanghai University of Electric Power,Shanghai 200090,China)

机构地区上海理工大学理学院上海电力大学数理学院

出处《计算数学》 CSCD 北大核心 2022年第1期34-62,共29页 Mathematica Numerica Sinica

基金国家自然科学基金(11601318)资助。

关键词球面约束 l_(1)正则优化随机梯度估计全局隐函数定理全局收敛 Spherical constraint l_(1)regularization stochastic gradient estimation glob-al implicit function theorem global convergence.

分类号 O224 [理学—运筹学与控制论]

引文网络
相关文献

参考文献1

1Zheng-Jian Bai,Michael K. Ng,Liqun Qi.A Coordinate Gradient Descent Method for Nonsmooth Nonseparable Minimization[J].Numerical Mathematics(Theory,Methods and Applications),2009,2(4):377-402. 被引量：9

二级参考文献15

1Paul Tseng,Sangwoon Yun.A coordinate gradient descent method for nonsmooth separable minimization[J].Mathematical Programming (-).2009(1-2)
2O.L. Mangasarian,David R. Musicant.Large Scale Kernel Regression via Linear Programming[J].Machine Learning (-).2002(1-3)
3Masao Fukushima.A successive quadratic programming method for a class of constrained nonsmooth optimization problems[J].Mathematical Programming (-).1990(1-3)
4K. C. Kiwiel.A method for minimizing the sum of a convex function and a continuously differentiable function[J].Journal of Optimization Theory and Applications.1986(3)
5H. Mine,M. Fukushima.A minimization method for the sum of a convex function and a continuously differentiable function[J].Journal of Optimization Theory and Applications.1981(1)
6E.T.Hale,W.Yin,Y.Zhang.A fixed-point continuation method for■1-regularized minimization with applications to compressed sensing. CAA Technical Report TR07-07 . 2007
7M.Lustig,D.Donoho,J.Pauly.Sparse MRI:The application of compressed sensing for rapid MR imaging[].Magnetic Resonance Review.2007
8S.Ruzinsky.Sequential Least Absolute Deviation Estimation of Autoregressive Parameters[]..1989
9P.Tseng,S.Yun.A coordinate gradient descent method for nonsmooth separable minimization[].MathProgramSerB.2008
10M.Wakin,J.Laska,M.Duarte,D.Baron,S.Sarvotham,D.Takhar,K.Kelly,R.Baraniuk.An architecture for compressiving image[].Proceedings of the International Conference on Image Processing (ICIP).2006

共引文献8

1崔琨鹏,赵强.Logistic模型自适应组Lasso算法[J].山东师范大学学报（自然科学版）,2018,33(4):396-400. 被引量：3
2方升,梁飞豹,刘勇进.统计回归模型及其优化算法综述[J].福州大学学报（自然科学版）,2021,49(5):638-654. 被引量：10
3胡佳,郭田德,韩丛英.小批量随机块坐标下降算法[J].运筹学学报,2022,26(1):1-22. 被引量：1
4薛付忠.生态流行病学研究设计与统计分析策略[J].中华疾病控制杂志,2022,26(10):1152-1160. 被引量：1
5陈鸿升,叶建豪,张嘉昊,程万友.求解大规模l1问题的L-BFGS算法[J].计算数学,2023,45(3):309-320.
6叶建豪,陈鸿升,胡子健,程万友.一种改进的积极集共轭梯度法[J].昆明理工大学学报（自然科学版）,2023,48(6):198-206.
7陈鸿升,叶建豪,胡子健,程万友.求解非凸正则化问题的L-BFGS算法[J].湘潭大学学报（自然科学版）,2023,45(6):69-77.
8邹航,姜云卢.高维线性回归模型稳健变量选择方法综述[J].应用概率统计,2024,40(1):157-181.

1陈亦佳,孙国艳.浅谈格林公式的推广[J].玉溪师范学院学报,2021,37(3):28-31.
2田大平,汪敏.黎曼流形上光滑函数的Hessian在共形度量下的关系式[J].湖北大学学报（自然科学版）,2021,43(6):667-670. 被引量：1
3刘文丽,迟晓妮,张璐,李绍刚.非负象限权互补问题的免导数非单调光滑牛顿法[J].桂林电子科技大学学报,2021,41(6):504-509.
4黄华娟,韦修喜,周永权.光滑孪生参数化不敏感支持向量回归机[J].郑州大学学报（工学版）,2022,43(2):28-34. 被引量：1
5陈建新,朱洪春,刘伯超.数字技术赋能传统制造业数字化转型升级路径研究[J].商情,2022(4):156-159.
6曾维佳,张日权.Lasso变量选择的分布式算法[J].应用概率统计,2022,38(1):99-110. 被引量：2
7王晓敏,冯进钤,陈越超.白噪声激励下非线性声学超材料的随机响应[J].力学与实践,2021,43(6):887-895. 被引量：1
8邓香香,何勇,张娜.Berwald双挠积Finsler度量[J].新疆师范大学学报（自然科学版）,2021,40(2):10-16. 被引量：1
9喻高航,周艺,吕来水.带动量项的高阶幂法解高阶马尔科夫链的极限概率分布向量[J].计算数学,2022,44(1):71-88.
10钱颖一.法治约束政府行为也提升政府利益[J].支点,2022(1):13-13.

计算数学

2022年第1期

浏览历史

内容加载中请稍等...

球面上l_(1)正则优化的随机临近梯度方法

参考文献1

二级参考文献15

共引文献8

相关作者

相关机构

相关主题

浏览历史