基于随机L-BFGS的二阶非凸稀疏优化算法被引量：1

Second-Order Nonconvex Sparse Optimization Method Based on Stochastic L-BFGS

下载PDF

导出

摘要在稀疏模型中普遍采用一阶优化算法进行学习,这些算法的普遍思路都是将迭代硬阈值算法与优化算法进行结合。相较于一阶优化算法,二阶优化算法很少被应用到稀疏优化问题中,因为Hessian矩阵及其逆矩阵的计算需要消耗极大的算力资源。所以为了有效且高效地利用二阶信息,提出一种新的随机L-BFGS硬阈值优化算法用于解决非凸的稀疏学习问题,算法将迭代硬阈值方法(IHT)引入随机L-BFGS算法,在保持模型性能的同时显著提升了算法的收敛速度,并在线性回归和逻辑回归上的实验结果上证明了新算法的优越性。 First-order optimization methods have been widely applied in the learning of sparse models. The fundamental idea of these methods is to incorporate the Iterative Hard Thresholding(IHT) algorithm into traditional optimization methods. Compared with the first-order methods, very few second-order methods have been applied to sparse optimization problems because of the tremendous computation that obtains the Hessian matrix and its inverse. In order to make use of the second-order information effectively and efficiently, this paper proposes a novel optimization method to solve the nonconvex sparse learning problems. The core idea of the proposed Stochastic L-BFGS Hard Thresholding approach is to incorporate the Stochastic L-BFGS into the Iterative Hard Thresholding(IHT) method, which significantly accelerates the convergence rate and maintains the effectiveness of the model. The experimental results on linear regression and logistic regression demonstrate the superiority of the proposed approach.

作者刘光宇张令威杭仁龙 LIU Guang-yu;ZHANG Ling-wei;HANG Ren-long(Jiangsu Key Laboratory of Big Data Analysis Technology,Nanjing University of Information Science and Technology,Nanjing Jiangsu 210044,China)

机构地区南京信息工程大学江苏省大数据分析技术重点实验室

出处《计算机仿真》北大核心 2022年第10期359-363,共5页 Computer Simulation

基金江苏省青年基金项目(BK20180786) 国家自然科学基金(61906096)。

关键词一阶优化算法二阶信息迭代硬阈值稀疏学习 First-order optimization methods Second-order information IHT Sparse learning

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献1

1陶卿,高乾坤,姜纪远,储德军.稀疏学习优化问题的求解综述[J].软件学报,2013,24(11):2498-2507. 被引量：23

二级参考文献42

1Vapnik VN. Statistical Learning Theory. New York: Wiley-Interscience, 1998.
2Zhang T. Statistical behavior and consistency of classification methods based on convex risk minimization. Annals of Statistics, 2004,32(l):56-85. [doi: 10.1214/aos/1079120130].
3Zhang T. Statistical analysis of some multi-category large margin classification methods. Journal of Machine Learning Research, 2004,5:1225-1251.
4Wang J, Tao Q. Machine learning: The state of the art. IEEE Intelligent Systems, 2008,23(6):49-55. [doi: 10.1109/MIS.2008.107].
5Bennett KP, Parrado-Hemandez E. The interplay of optimization and machine learning research. Journal of Machine Learning Research, 2006,7:1265-1281.
6Tibshirani R. Regression shrinkage and selection via the lasso. Journal of Royal Statistical Society (Series B), 1996,58(l):267-288.
7Nesterov Y. Primal-Dual subgradient methods for convex problems. Mathematical Programming, 2009,120(l):221-259. [doi: 10. 1007/sl0107-007-0149-x].
8Bertsekas DP, Nedic A, Ozdaglar AE. Convex Analysis and Optimization. Belmont: Athena Scientific, 2003.
9Zinkevich M. Online convex programming and generalized infinitesimal gradient ascent. In: Proc. of the Int’l Conf. on Machine Learning. 2003. 928-936.
10Shalev-Shwartz S, Singer Y, Srebro N. Pegasos: Primal estimated sub-gradient solver for SVM. In: Proc. of the Int’l Conf. on Machine Learning. 2007. 807-814. [doi: 10.1145/1273496.1273598].

共引文献22

1邵言剑,陶卿,姜纪远,周柏.一种求解强凸优化问题的最优随机算法[J].软件学报,2014,25(9):2160-2171. 被引量：11
2姜纪远,夏良,章显,陶卿.一种具有O(1/T)收敛速率的稀疏随机算法[J].计算机研究与发展,2014,51(9):1901-1910. 被引量：3
3刘建伟,崔立鹏,刘泽宇,罗雄麟.正则化稀疏模型[J].计算机学报,2015,38(7):1307-1325. 被引量：66
4周柏,陶卿,储德军.基于随机步长具有最优瞬时收敛速率的稀疏随机优化算法[J].模式识别与人工智能,2015,28(10):876-885.
5易磊,潘志松,邱俊洋,薛胶,任会峰.在线学习的大规模网络流量分类研究[J].智能系统学报,2016,11(3):318-327. 被引量：3
6刘建伟,崔立鹏,罗雄麟.概率图模型的稀疏化学习[J].计算机学报,2016,39(8):1597-1611. 被引量：4
7徐金东,牟春晓,范宝德,张艳洁,童向荣,倪梦莹.图像的多尺度稀疏分解及其在遥感图像融合上的应用[J].烟台大学学报（自然科学与工程版）,2017,30(1):48-54. 被引量：5
8陶卿,马坡,张梦晗,陶蔚.机器学习随机优化方法的个体收敛性研究综述[J].数据采集与处理,2017,32(1):17-25. 被引量：6
9彭艺,董智超.基于竞价机制的认知无线蜂窝网D2D功率分配方法[J].计算机工程,2017,34(5):88-91. 被引量：2
10田猛,王先培,董政呈,朱国威,代荡荡,赵乐.基于拉格朗日乘子法的虚假数据攻击策略[J].电力系统自动化,2017,41(11):26-32. 被引量：15

同被引文献14

1双锴,李怡雯,吕志恒,韩静,刘建伟.基于归一化特征判别的日志模板挖掘算法[J].北京邮电大学学报,2020,43(1):68-73. 被引量：6
2解坤,张俊芳.基于KMO-Bartlett典型风速选取的PCA-WNN短期风速预测[J].发电设备,2017,31(2):86-91. 被引量：28
3樊长幸,沈春根,王云龙.求解约束最小二乘半正定规划问题的L-BFGS方法[J].上海理工大学学报,2019,41(4):321-326. 被引量：1
4赵睿颖.用于深度学习的一种改进L-BFGS算法[J].首都师范大学学报（自然科学版）,2021,42(5):8-14. 被引量：3
5Zhang Kai,Xu Xin,Liu Hong-Xing,Xu Yi-Peng,Li Zhen-Chun,Jiang Ping.Optimization method of fi rst-arrival waveform inversion based on the L-BFGS algorithm[J].Applied Geophysics,2021,18(4):515-524. 被引量：1
6李俊,王丛丛,王萍.全数字钼靶乳腺X光机影像与肿瘤分子生物学特性的关联[J].中国医疗设备,2022,37(5):69-73. 被引量：4
7梁旭,姚建莉,董晓蕾,徐忠孜,许国辉,周鹏.数字乳腺断层重建2D图像技术在乳腺病变诊断中的应用价值[J].中国医疗设备,2023,38(2):68-72. 被引量：2
8李晴晴,张良西,朱向明,张青陵,包婷婷.乳腺癌前哨淋巴结转移的超声及病理危险因素分析[J].包头医学院学报,2023,39(4):13-17. 被引量：4
9程虎跃,刘贞,史志光,王永红,姜洪权,杨得焱,高建民,支泽林.一种多源特征融合深度学习模型及复杂构件缺陷类型识别方法[J].无损检测,2023,45(2):12-17. 被引量：4
10于珊珊,陈坤,孟杰,马强,马跃.不同乳腺癌组织SWV值及对腋窝淋巴结转移的定量预警[J].中国医疗设备,2023,38(4):143-148. 被引量：3

引证文献1

1王龙琦,周学超.基于多源特征智能融合技术在乳腺癌识别中的应用[J].中国医疗设备,2024,39(8):55-61.

计算机仿真

2022年第10期

浏览历史

内容加载中请稍等...

基于随机L-BFGS的二阶非凸稀疏优化算法被引量：1

参考文献1

二级参考文献42

共引文献22

同被引文献14

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于随机L-BFGS的二阶非凸稀疏优化算法 被引量：1

参考文献1

二级参考文献42

共引文献22

同被引文献14

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于随机L-BFGS的二阶非凸稀疏优化算法被引量：1