期刊文献+

基于MapReduce的层叠分组并行SVM算法研究 被引量:10

RESEARCH ON CASCADE-GROUPING PARALLEL SVM ALGORITHM BASED ON MAPREDUCE
下载PDF
导出
摘要 随着训练集规模的不断增大,支持向量机学习成为了密集型计算的过程。针对计算过程中存在占用内存大、寻优速度慢等问题,通过大量实验对分组训练和层叠训练两种并行SVM算法进行性能分析,给出层叠分组SVM并行算法,并利用MapReduce并行框架实现,解决了层叠训练模型效率低的问题。实验结果表明,采用这种学习策略,在保持精度损失较小的情况下,一定程度上减少了训练时间,提高了分类速度。 With the constant growing of training set scale,support vector machine learning becomes intensive computing process. In view of the problems in calculation process including large memory and slow optimisation,we analyse the performances of two parallel SVM algorithms of grouping training and cascade training through a great deal of experiments,and present the cascade-grouping SVM parallel algorithm,and implement it using MapReduce parallel framework,this solves the problem of low efficiency of cascade training model.Experimental results show that by using this learning strategy,the training time is reduced and the classification speed is improved both to a certain extent without big precision loss.
出处 《计算机应用与软件》 CSCD 2015年第3期172-176,共5页 Computer Applications and Software
基金 国家自然科学基金项目(61363052) 内蒙古自然科学基金项目(2010MS0913) 内蒙古自治区气象信息中心合作项目
关键词 并行分类算法 支持向量机 MAPREDUCE 大规模数据集处理 Parallel classification algorithm Support vector machine MapReduce Large-scale dataset processing
  • 相关文献

参考文献5

二级参考文献69

  • 1高全学,潘泉,梁彦,张洪才,程咏梅.基于描述特征的人脸识别研究[J].自动化学报,2006,32(3):386-392. 被引量:13
  • 2胡正平,张晔.结构风险最小化近邻分析解决大规模训练集支持向量机学习问题[J].信号处理,2007,23(2):161-164. 被引量:3
  • 3Frieβ,T.T.,Cristianini,N.,Campbell,C.,1998.The Kernel-Adatron Algorithm:A Fast and Simple Learning Procedure for Support Vector Machines.Proc.15th Int.Conf.on Machine Learning,p.188-196.
  • 4Graf,HOP.,Cadambi,S.,Durdanovic,I.,Jakkula,V.,Sankaradass,M.,Cosatto,E.,Chakradhar,S.T.,2008.A Massively Parallel Digital Learning Processor.22nd Annual Conf.on Neural Information Processing Systems,p.529-536.
  • 5Keerthi,S.S.,Shevade,S.K.,Bhattacharyya,C.,Murthy,K.R.K.,2001.Improvements to Platt's SMO algorithm for SVM classifier design.Neut.Comput.,13(3):637-649.[doi:10.1162/089976601300014493].
  • 6Manikandan,J.,Venkataramani,B.,Avanthi,V.,2009.FPGA Implementation of Support Vector Machine Based Isolated Digit Recognition System.Proc.22nd Int.Conf.on VLSI Design,p.347-352.[doi:10.1109/VLSI.Design.2009.23].
  • 7Platt,J.C.,1999.Fast Training of Support Vector Machines Using Sequential Minimal Optimization.In:Scholkopf,B.,Burges,C.,Smola,A.(Eds.),Advances in Kernel Methods:Support Vector Learning.MIT Press,Cambridge,MA,p.185-208.
  • 8Sch(o)lkopf,B.,Burges,C.J.C.,Smola,A.J.,1999.Advances in Kernel Methods:Support Vector Learning.MIT Press,Cambridge,MA,p.1-16.
  • 9Sebald,D.J.,Bucklew,J.A.,2000.Support vector machine techniques for nonlinear equalization.IEEE Trans.Signal Process.,48(11):3217-3226.[doi:10.1109/78.875477].
  • 10Sun,Z.,Zhang,L.,Tang,E.,2005.An incremental learning method based on SVM for online sketchy shape recognition.LNCS,3610:655-659.[doi:10.1007/11539087_82].

共引文献20

同被引文献53

  • 1Vapnik V N.The Nature of Statistical Learning Theory[M].Springer New York,1995:988-999.
  • 2Chang C C,Lin C J.LIBSVM:a Library for Support Vector Machines[J].ACM Transactions on Intelligent Systems & Technology,2006,2(3):389-396.
  • 3Dong J X,Krzyzak A,Suen C Y.Fast SVM training algorithm with decomposition on very large data sets[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2005,27(4):603-618.
  • 4Lin C Y,Tsai C H,Lee C P,et al.Large-scale logistic regression and linear support vector machines using spark[C]∥2014 IEEE International Conference on Big Data.IEEE,2014:519-528.
  • 5Sun Zhan-quan,Fox G.Study on Parallel SVM Based on MapReduce[C]∥The 2012 International Conference on Parallel and Distributed Processing Techniques and Applications.Las Vegas NV USA,2012.
  • 6Dean J,Ghemawat S.MapReduce:Simplified Data Processing on Large Clusters[J].Proceedings of Operating Systems Design and Implementation(OSDI),2004,51(1):107-113.
  • 7Zaharia M,Chowdhury M,Franklin M J,et al.Spark:clustercomputing with working sets[C]∥Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing USENIX Association.2010:10.
  • 8http://spark.apache.org.
  • 9Zaharia M,Chowdhury M,Das T,et al.Resilient distributeddatasets:A fault-tolerant abstraction for in-memory cluster computing[C]∥Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation.2012:141-146.
  • 10Guo Xin-xin.SVM optimization algorithm based on Distributed Computing[D].Xi’an:Xi’an Electronic and Science University,2014(in Chinese).

引证文献10

二级引证文献34

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部