期刊文献+

一种新的支持向量机大规模训练样本集缩减策略 被引量:11

New reduction strategy of large-scale training sample set for SVM
下载PDF
导出
摘要 支持向量机(SVM)在许多实际应用中由于训练样本集规模较大且具有类内混杂孤立点数据,引发了学习速度慢、存储需求量大、泛化能力降低等问题,成为直接使用该技术的瓶颈。针对这些问题,通过在点集理论的基础上分析训练样本集的结构,提出了一种新的支持向量机大规模训练样本集缩减策略。该策略运用模糊聚类方法快速的提取出潜在支持向量并去除类内非边界孤立点,在减小训练样本集规模的同时,能够有效地避免孤立点数据所造成的过学习现象,提高了SVM的泛化性能,在保证不降低分类精度的前提下提高训练速度。 It has become a bottleneck to use Support Vector Machine (SVM) due to such problems as slow learning speed, large buffer memory requirement, low generalization performance and so on, which are caused by large-scale training sample set and outlier data immixed in the other class. Concerning these problems, this paper proposed a new reduction strategy for large-scale training sample set according to the analysis on the structure of the training sample set based on the point set theory. This new strategy gets the potential support vectors and removes the non-boundary outlier data immixed in the other class by using fuzzy clustering. That can greatly reduce the scale of the training sample set and improve the generalization performance by effectively avoiding over-learning caused by outlier data, and finally speed up learning rate without reducing the classification accuracy.
出处 《计算机应用》 CSCD 北大核心 2009年第10期2736-2740,共5页 journal of Computer Applications
基金 天津市自然科学基金资助项目(07JCZDJC10800)
关键词 支持向量机 点集 模糊C-均值 潜在支持向量 孤立点 Support Vector Machine (SVM) point set Fuzzy C-Means (FCM) potential support vector outlier
  • 相关文献

参考文献7

二级参考文献27

  • 1文益民,吕宝粮.最小最大模块化支持向量机改进研究[J].计算机工程与应用,2005,41(19):185-188. 被引量:5
  • 2胡荣,罗庆云.kNN算法在文本分类中的改进[J].南华大学学报(自然科学版),2005,19(3):78-80. 被引量:4
  • 3苏金树,张博锋,徐昕.基于机器学习的文本分类技术研究进展[J].软件学报,2006,17(9):1848-1859. 被引量:381
  • 4Hearst M A, Dumais S T, Osman E, Platt J, Scholkopf B.Support Vector Machines. IEEE Intelligent Systems, 1998, 13(4) : 18-28.
  • 5Ke Hai-Xin,Zhang Xue-Gong. Editing support vector machines.In: Proceedings of International Joint Conference on Neural Networks, Washington, USA, 2001, 2:1464-1467.
  • 6Vapnik V N. An overview of statistical learning theory. IEEE Transactions on Neural Networks, 1999, 10 (5): 988-999.
  • 7Vapnik V N. Statistical Learning Theory. 2nd ed. New York:Springer-Verlag : 1999.
  • 8Klaus-Robert Mailer, Sebastian Mika, Gunnar Raetsch, Koji Tsuda, and Bernhard Schoelkopf. An introduction to kernel-based learning algorithms. IEEE Transactions on Neural Networks, 2001, 12 (2): 181-201.
  • 9Burges C J C. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 1998, 2(2): 121-167.
  • 10V N Vapnik 著.许建华,张学工译.统计学习理论[M].北京:电子工业出版社,2004

共引文献81

同被引文献101

引证文献11

二级引证文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部