基于随机优化的大规模噪声数据集快速学习方法

Stochastic Optimization Based Fast Learning Method on Large-Scale Noisy Datasets

下载PDF

导出

摘要针对包含噪声与干扰数据的大规模机器学习问题,采用非凸Ramp损失函数抑制噪声和干扰数据的影响,提出一种基于随机优化的非凸线性支持向量机快速学习方法,有效改进训练速度和预测精度.实验结果表明该方法降低学习时间,在MNIST数据集上较传统学习方法的训练时间降低4个数量级.同时在一定程度上改进预测速度,并有效提升分类器对噪声数据集的泛化性能. Aiming at large-scale machine learning problems with noise and interference data, the non-convex Ramp loss function is adopted to suppress the influences of noise and interference data, and a fast learning method is proposed for solving the non-convex linear support vector machines based on stochastic optimization. It effectively improves the training speed and the prediction accuracy. The experimental results manifest that the proposed method greatly reduces the learning time, and on the MNIST dataset the training time is reduced by 4 orders of magnitude compared to the traditional learning method. Meanwhile, it improves the prediction speed in a sense and greatly enhances the generalization performance of the classifiers for noisy dataset.

作者王家宝

机构地区解放军理工大学指挥信息系统学院

出处《模式识别与人工智能》 EI CSCD 北大核心 2013年第4期366-373,共8页 Pattern Recognition and Artificial Intelligence

关键词大规模机器学习支持向量机 Ramp损失随机梯度下降 Large-Scale Machine Learning, Support Vector Machine, Ramp Loss, Stochastic GradientDescent

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献24

1Ertekin S, Bottou L, Giles C L. Non-Con vex Online Support VectorMachines. IEEE Trans on Pattern Recognition and Machine Intelli-gence, 2011,33(2).: 368-381.
2Joachims T. Making Large-Scale SVM Learning Practical //Csholkopf B, Bulges C J C, Smola A J,eds. Advances in KernelMethods : Support Vector Learning. Cambridge,USA : MIT Press,1999: 169-184.
3Platt J C. Fast Training of Support Vector Machines Using Sequen-tial Minimal Optimization // Csholkopf B, Burges C J C, Smola AJ, eds. Advances in Kernel Methods: Support Vector Learning.Cambridge, USA: MIT Press, 1999: 185-208.
4Joachims T. Training Linear SVMs in Linear Time // Proc of the12th ACM SIGKDD International Conference on Knowledge Discov-ery and Data Mining. Philadelphia, USA, 2006 : 217-226.
5Hsu C W, Lin C J. A Simple Decomposition Method for SupportVector Machines. Machine Learning, 2002 , 46(1/2/3). : 291-314.
6Er Smola J, Vishwanathan S V N,Lenicta Q. Bundle Methods forMachine Learning // Platt J C, Koller D,Singer Y, et al, eds. Ad-vances in Neural Information Processing Systems. Cambridge,USA :MIT Press, 2008, XX: 1377-1384.
7Hsieh C J, Chang K W,Lin C J,et al. A Dual Coordinate DescentMethod for Large-Scale Linear SVM // Proc of the 25 th InternationalConference on Machine Learning. Helsinki,Finland,2008 : 408 -415.
8Chang K W, Hsieh C J,Lin C J. Coordinate Descent Method forLarge-Scale L 2>Loss Linear SVM, Journal of Machine Learning Re-search, 2008, 9: 1369-1398.
9Bottou L,Bousquet O. The Tradeoffs of Large Scale Learning //Platt J C, Koller D, Singer Y,et al, eds. Advances in Neural In-formation Processing Systems. Cambridge, USA: MIT Press, 2008,XX: 161-168.
10Kivinen J, Smola A J, Williamson R C. Online Learning with Ker-nels. IEEE Trans on Signal Processing, 2004, 52(8).: 2165 -2176.

1朱秋煜,韩锦成,莫玉龙.一种用于机械手控制的在线快速学习方法[J].上海大学学报（自然科学版）,1995,1(2):205-211. 被引量：1
2徐嗣鑫,戴友元.前向神经网络的一种快速学习方法及其应用[J].控制与决策,1993,8(4):284-288. 被引量：20
3王裕民,顾乃杰,张孝慈.多GPU环境下的卷积神经网络并行算法[J].小型微型计算机系统,2017,38(3):536-539. 被引量：5
4张本慧,唐元生.两类完美的门限可变多秘密共享方案[J].计算机工程与应用,2017,53(5):24-27. 被引量：2
5王佳,韩龙.基于机器视觉的图像平滑方法比较研究[J].可编程控制器与工厂自动化（PLC FA）,2011(11):77-78. 被引量：2
6党德鹏,孟真.基于支持向量机的信息安全风险评估[J].华中科技大学学报（自然科学版）,2010,38(3):46-49. 被引量：36
7业宁,孙瑞祥,董逸生.多拉格朗日乘子协同优化的SVM快速学习算法研究[J].计算机研究与发展,2006,43(3):442-448. 被引量：2
8陈鹏.疯狂的伊万[J].电脑时空,2009(1):31-31.
9吴洲.一种构建安全Web Service系统的方法[J].西南民族大学学报（自然科学版）,2011,37(2):307-311. 被引量：2
10邹劲松,黄凯锋.遥感图像分类中的核稀疏字典学习[J].计算机工程与设计,2016,37(6):1584-1587. 被引量：1

模式识别与人工智能

2013年第4期

浏览历史

内容加载中请稍等...

基于随机优化的大规模噪声数据集快速学习方法

参考文献24

相关作者

相关机构

相关主题

浏览历史