期刊文献+

基于距离排序的快速支持向量机分类算法 被引量:10

FAST SUPPORT VECTOR MACHINE CLASSIFICATION ALGORITHM BASED ON DISTANCE SORTING
下载PDF
导出
摘要 传统支持向量机算法由于时空复杂度较高,因此很难有效地处理大规模数据。为了降低支持向量机算法的时空复杂度,提出一种基于距离排序的快速支持向量机分类算法。该算法首先计算两类样本点的样本中心,然后对每一个样本计算它与另一类样本中心之间的距离,最后根据距离排序选择一定比例的小距离样本作为边界样本。由于边界样本集合很好地包含了支持向量,而且数目较原始样本集合少得多,因此算法可以在保证支持向量机学习精度的前提下,有效地缩短训练时间和节约存储空间。在UCI标准数据集和20-Newsgroups文本分类数据集上的实验说明算法较以往支持向量预选取算法而言可以更为快速准确地进行支持向量预选取。 As the traditional SVM algorithms have high time and space complexities,it is difficult to deal with the large-scale data.In order to reduce the spatiotemporal complexity of SVM algorithm,in this paper we propose a distance sorting-based fast SVM classification algorithm.The algorithm first calculates the sample centres of the two types of sample points,then for each sample the algorithm calculates the distance between its centre and the centre of another type sample,and at last sorts according to the distances,and selects a certain percentage of samples with small distances as the boundary samples.Since the boundary sample set well contain the support vectors,and their number is much less compared with the original sample set,so this algorithm can effectively shorten the training time and save the storage space in premise of guaranteeing the SVM in good learning accuracy.The experiments on UCI standard data sets and 20-Newsgroups text classification data set demonstrate that our algorithm can pre-select the support vectors faster and more accurately compared with previous support vector selection algorithms.
出处 《计算机应用与软件》 CSCD 北大核心 2013年第4期85-87,100,共4页 Computer Applications and Software
基金 国家自然科学基金项目(1072166) 山西省自然科学基金项目(2009011018-4)
关键词 支持向量机 时空复杂度 大规模数据 距离排序 Support vector machine(SVM) Time and space complexity Large-scale data Distance sorting
  • 相关文献

参考文献12

  • 1Cortes C, Vapnik V. Support vector networks[ J]. Machine Learning, 1995,20:273 - 297.
  • 2Yang J, Yu X, Xie Z Q. A novel virtual sample generation method based on Gaussian distribution [ J ]. Knowledge-Based Systems, 2011, 24 ( 8 ) :740 - 748.
  • 3Platt J C. Fast training of support vector machines using sequential minimal optimization [ M ]//B Scholkopf, C Burges, A Smola. Ad- vances in Kernel Methods-Support Vector Learning, MIT Press,1999: 185 - 208.
  • 4李青,焦李成,周伟达.基于向量投影的支撑向量预选取[J].计算机学报,2005,28(2):145-152. 被引量:37
  • 5阎平凡.对多层前向神经网络研究的进一步看法[J].电子学报,1999,27(5):82-85. 被引量:25
  • 6Blake C, Keogh E, Merz CJ. UCI repository of machine learning data- bases[ EB/OL]. Department of Information and Computer Science, U- niversity of California, Irvine, CA, 1998. http://www, ics. uci. edu/ mlearn/MLRepository, html.
  • 7Lang K. Newsweeder: Learning to filter netnews [ C ]//Proceedings of the Twelfth International Conference on Machine Learning, 1995:331 - 339.
  • 8Boser B E, Guyon I M, Vapnik V N. A training algorithm for optimal margin classifiers[ C]//D Haussler. Proceedings Fifth Annual Work- shop on Computational Learning Theory:144 - 152.
  • 9Osuna E, Freund R, Girosi F. An improved training algorithm for sup- port vector machines [ C ]//IEEE Workshop on Neural Networks for Signal Processing :276 - 285.
  • 10Shin H, Cho S. Pattern selection for support vector classifiers [ C ]// Proc. of the 3rd International Conference on Intelligent Data Engineer- ing and Automated Learning:469- 474.

二级参考文献16

  • 1张承福.对当前神经网络研究的几点看法[J].力学进展,1994,24(2):181-186. 被引量:10
  • 2阎平凡.人工神经网络的容量、学习与计算复杂性[J].电子学报,1995,23(5):63-67. 被引量:82
  • 3J.Sklansky 阎平凡等(译).模式分类器和可训练机器(第1章)[M].科学出版社,1987..
  • 4B.N.瓦西里耶夫 阎平凡等(译).机器识别方法与系统[M].科学出版社,1991,71.158.
  • 5边肇祺.模式识别(第7章)[M].清华大学出版社,1987..
  • 6阎平凡.对多层前向神经网络研究的几点看法[J].自动化学报,1997,23(1):129-135. 被引量:34
  • 7Osuna Edgar, Freund Robert, Girosi Federico. An improved training algorithm for support vector machines. In: Proceedings of IEEE NNSP'97, Amelia Island.,FL., 1997, 24~26.
  • 8Smola A. Regression estimation with support vector learning machines[M.S. dissertation]. Technology University of Mumchen, 1996.
  • 9Burges C.J.C. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 1998, 2(2): 1~47.
  • 10Vapnik V.N. An overview of statistical learning theory. IEEE Transactions on Neural Network, 1999, 10(5): 988~999.

共引文献59

同被引文献97

引证文献10

二级引证文献28

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部