基于随机步长具有最优瞬时收敛速率的稀疏随机优化算法

Sparse Stochastic Optimization Algorithm with Optimal Individual Convergence Rate Based on Random Step-Size

下载PDF

导出

摘要几乎所有的稀疏随机算法都来源于在线形式,只能获得平均输出方式的收敛速率,对于强凸优化问题无法达到最优的瞬时收敛速率.文中避开在线形式转到随机模式,直接研究随机优化算法.首先在含有L1正则化项的稀疏优化问题中加入L2正则化项,使之具有强凸特性.然后将黑箱优化方法中的随机步长策略引入到当前通用的结构优化算法COMID中,得到基于随机步长的混合正则化镜面下降稀疏随机优化算法.最后通过分析L1正则化问题中软阈值方法的求解特点,证明算法具有最优的瞬时收敛速率.实验表明,文中算法的稀疏性优于COMID. Almost all sparse stochastic algorithms are developed from the online setting, and only the convergence rate of average output can be obtained. The optimal rate for strongly convex optimization problems can not be reached as well. The stochastic optimization algorithms are directly studied instead of the online to batch conversation in this paper. Firstly, by incorporating the L2 regularizer into the Ll-regularized sparse optimization problems, the strong convexity can be obtained. Then, by introducing the random step-size strategy from the black-box optimization method to the state-of-the-art algorithm-composite objective mirror descent （COMID） , a sparse stochastic optimization algorithm based introducing on random step-size hybrid regularized mirror descent （RS-HMD） is achieved. Finally, based on the analysis of characteristics of soft threshold methods in solving the L1-regularized problem, the optimal individual convergence rate is proved. Experimental results demonstrate that sparsity of RS- HMD is better than that of COMID.

作者周柏陶卿储德军

机构地区中国人民解放军陆军军官学院十一系

出处《模式识别与人工智能》 EI CSCD 北大核心 2015年第10期876-885,共10页 Pattern Recognition and Artificial Intelligence

基金国家自然科学基金项目(No.61273296) 安徽省自然科学基金青年项目(No.1508085QF114 1308085QF121)资助

关键词机器学习随机优化最优瞬时收敛速率稀疏性 Machine Learning, Stochastic Optimization, Optimal Individual Convergence Rate, Sparsity

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献18

1Wang J, Tao Q. Machine Learning: The State of The Art. IEEE Intelligent Systems, 2008, 23(6): 49-55.
2陶卿,高乾坤,姜纪远,储德军.稀疏学习优化问题的求解综述[J].软件学报,2013,24(11):2498-2507. 被引量：22
3Shalev-Shwartz S, Singer Y, Srebro N, et al. Pegasos: Primal Esti- mated Sub-gradient Solver for SVM. Mathematical Programming, 2011, 127(1) : 3-30.
4Duchi J, Shalev-Shwartz S, Singer Y, et al. Composite Objective Mirror Descent//Proc of the 23rd Annual Workshop on Computa- tional Learning Theory. New York, USA, 2010:116-128.
5Xiao L. Dual Averaging Methods for Regularized Stochastic Learning and Online Optimization. Journal of Machine Learning Research, 2010, 11:2543-2596.
6Zinkevich M. Online Convex Programming and Generalized Infinite- simal Gradient Ascent// Proe of the 20th International Conference on Machine Learning. New York, USA, 2003 : 928-936.
7Rakhlin A, Shamir O, Sridharan K. Making Gradient Descent Opti- mal for Strongly Convex Stochastic Optimization//Proc of the 29th International Conference on Machine Learning. Edinburgh, UK, 2012 : 449-456.
8Shamir O, Zhang T. Stochastic Gradient Descent for Non-smooth Optimization: Convergence Results and Optimal Averaging Schemes //Proc of the 30th International Conference on Machine Learning. New York, USA, 2013:71-79.
9Lacoste-Julien S, Schmidt M, Bach F. A Simpler Approach to Ob- taining an O(l/t) Convergence Rate for Projected Stochastic Sub- gradient Descent [ EB/OL ]. [ 2014 - 12 - 20 ]. http ://arxiv. org/ pdf./1212. 2002v2. pdf.
10Hazan E, Kale S. Beyond the Regret Minimization Barrier: Opti- mal Algorithms for Stochastic Strongly-Convex Optimization. Jour- nal of Machine Learning Research, 2014, 15 ( 1 ) : 2489-2512.

二级参考文献43

1Vapnik VN. Statistical Learning Theory. New York: Wiley-Interscience, 1998.
2Zhang T. Statistical behavior and consistency of classification methods based on convex risk minimization. Annals of Statistics, 2004,32(l):56-85. [doi: 10.1214/aos/1079120130].
3Zhang T. Statistical analysis of some multi-category large margin classification methods. Journal of Machine Learning Research, 2004,5:1225-1251.
4Wang J, Tao Q. Machine learning: The state of the art. IEEE Intelligent Systems, 2008,23(6):49-55. [doi: 10.1109/MIS.2008.107].
5Bennett KP, Parrado-Hemandez E. The interplay of optimization and machine learning research. Journal of Machine Learning Research, 2006,7:1265-1281.
6Tibshirani R. Regression shrinkage and selection via the lasso. Journal of Royal Statistical Society (Series B), 1996,58(l):267-288.
7Nesterov Y. Primal-Dual subgradient methods for convex problems. Mathematical Programming, 2009,120(l):221-259. [doi: 10. 1007/sl0107-007-0149-x].
8Bertsekas DP, Nedic A, Ozdaglar AE. Convex Analysis and Optimization. Belmont: Athena Scientific, 2003.
9Zinkevich M. Online convex programming and generalized infinitesimal gradient ascent. In: Proc. of the Int’l Conf. on Machine Learning. 2003. 928-936.
10Shalev-Shwartz S, Singer Y, Srebro N. Pegasos: Primal estimated sub-gradient solver for SVM. In: Proc. of the Int’l Conf. on Machine Learning. 2007. 807-814. [doi: 10.1145/1273496.1273598].

共引文献31

1邵言剑,陶卿,姜纪远,周柏.一种求解强凸优化问题的最优随机算法[J].软件学报,2014,25(9):2160-2171. 被引量：11
2姜纪远,夏良,章显,陶卿.一种具有O(1/T)收敛速率的稀疏随机算法[J].计算机研究与发展,2014,51(9):1901-1910. 被引量：3
3刘建伟,崔立鹏,刘泽宇,罗雄麟.正则化稀疏模型[J].计算机学报,2015,38(7):1307-1325. 被引量：62
4易磊,潘志松,邱俊洋,薛胶,任会峰.在线学习的大规模网络流量分类研究[J].智能系统学报,2016,11(3):318-327. 被引量：3
5朱小辉,陶卿.求解非光滑强凸优化问题的减小方差加权随机算法[J].模式识别与人工智能,2016,29(7):577-589.
6刘建伟,崔立鹏,罗雄麟.概率图模型的稀疏化学习[J].计算机学报,2016,39(8):1597-1611. 被引量：4
7夏浩,张丽杰.随机噪声干扰下的迭代学习控制器设计[J].计算机应用,2017,37(1):294-298. 被引量：3
8徐金东,牟春晓,范宝德,张艳洁,童向荣,倪梦莹.图像的多尺度稀疏分解及其在遥感图像融合上的应用[J].烟台大学学报（自然科学与工程版）,2017,30(1):48-54. 被引量：5
9陶卿,马坡,张梦晗,陶蔚.机器学习随机优化方法的个体收敛性研究综述[J].数据采集与处理,2017,32(1):17-25. 被引量：6
10陶蔚,潘志松,朱小辉,陶卿.线性插值投影次梯度方法的最优个体收敛速率[J].计算机研究与发展,2017,54(3):529-536. 被引量：5

1全浩军,张涛,郭继昌.基于改进人工鱼群算法的软硬件划分方法[J].天津大学学报（自然科学与工程技术版）,2013,46(10):923-928. 被引量：7
2邵言剑,陶卿,姜纪远,周柏.一种求解强凸优化问题的最优随机算法[J].软件学报,2014,25(9):2160-2171. 被引量：11
3方木云,王俊,王超,陶陶.随机步长无向双环网络通信延迟的研究[J].计算机工程与应用,2016,52(15):141-145. 被引量：2
4臧明相,马轩,段奕明.一种改进的人工蜂群算法[J].西安电子科技大学学报,2015,42(2):65-70. 被引量：12
5赵谦,孟德宇,徐宗本.L_(1/2)正则化Logistic回归[J].模式识别与人工智能,2012,25(5):721-728. 被引量：15
6张泰,屠思远,吴滨,顾晓峰.增强寻优能力的自适应人工蜂群算法[J].计算机应用研究,2016,33(10):2946-2948. 被引量：6
7杨晖,陈颖,杨红,林土胜.基于随机步长混沌系统的多格式图文保密算法[J].计算机工程与应用,2005,41(26):81-84.
8张涛,赵鑫,余益科,蔡晓.改进混洗蛙跳算法的软硬件划分方法研究[J].信号处理,2015,31(9):1055-1061.
9方木云,王俊,王超.随机步长无向环网通信延迟的研究[J].计算机技术与发展,2016,26(10):27-31.
10杨一,高社生,胡高歌.基于敏感度方差重要性的RBF神经网络结构优化算法[J].控制与决策,2015,30(8):1393-1398. 被引量：21

模式识别与人工智能

2015年第10期

浏览历史

内容加载中请稍等...

基于随机步长具有最优瞬时收敛速率的稀疏随机优化算法

参考文献18

二级参考文献43

共引文献31

相关作者

相关机构

相关主题

浏览历史