期刊文献+

一种文本分类的在线SVM学习算法 被引量:13

An On-line Learning Algorithm for Text Categorization with Support Vector Machines
下载PDF
导出
摘要 本文提出了一种用于文本分类的RBF支持向量机在线学习算法。利用RBF核函数的局部性,该算法仅对新训练样本的某一大小邻域内且位于“可能带”中的训练样本集进行重新训练,以实现对现有SVM的更新。为高效的实现该邻域大小的自适应确定,使用ξa泛化错误估计在所有现有训练样本集上对当前SVM的泛化错误进行定性估计。同时引入泛化能力进化因子,使得结果SVM在分类效果上具有自动调整能力,并防止分类能力的退化。在TREC-5真实语料上的对比测试结果表明,该算法显著地加速了增量学习的过程而同时保证结果SVM的分类效果。 This paper suggests an on-line incremental learning algorithm based on RBF SVMs for text categorization problem. By exploiting the locality of RBF kennels, our algorithm updates current SVM using a subset of possible support candidates both in certain neighborhood of the new coming document and in a possible band. The size of subset is decided adaptively and efficiently by using of ζα generation error estimator on all the available training samples to qualitatively estimate the generation error rate. We also use an evolutionary factor of generation ability to make resulting SVMs adaptive on classifying precision and guarantee the generation ability of them. Comparative experiments on real-life TREC - 5 corpus show thai our algorithm can remarkably accelerate the process of incremental learning while retains the classifying precision.
出处 《中文信息学报》 CSCD 北大核心 2005年第5期11-15,23,共6页 Journal of Chinese Information Processing
基金 国家自然科学基金支持项目(60272088)
关键词 计算机应用 中文信息处理 文本分类 在线学习 增量学习 支持向量机 SMO computer application Chinese information processing text categorization on-line learning incremental learning SVM SMO
  • 相关文献

参考文献13

  • 1T Joachims. Text Categorization with Support Vector Machines: Iearning with Many Relevant Features [A]. Proceedings of ECML-98, 10th European Conference on Machine Learning[C]. 1997.
  • 2Ji He, Ah-Hwee Tan and Chew-Lira Tan. A Comparative Study on Chinese Text Categorization Methods [A]. In: the PRICAI'2000 International Workshop on Text and Web Mining[C]. August 2000. Melbourne, Australia. 24- 35.
  • 3代六玲,黄河燕,陈肇雄.中文文本分类中特征抽取方法的比较研究[J].中文信息学报,2004,18(1):26-32. 被引量:228
  • 4V N VapniK. The Nature of Statistical Learning Theory [M]. Springer, New York, 1998.
  • 5J Kivinen, A.J. SmolaandR. C. Williamson. Online learningwith kernels [A]. In: Proc. Advances in Neural Information Processing Systems[C], Cambridge, MA, 2002.
  • 6L Ralaivola, F. d'Alche-Buc. Incremental Support Vector Machine Learning: a Local Approach [A]. In: Proceedings of ICANN'01[C], Vienna, Austria, 2001.
  • 7N Syed, H Liu, and K Sung. Incremental learning with support vector machines [A]. In: Proc. of the Int. Joint Conf. on Artificial Intelligence (IJCAI)[C], 1999.
  • 8T. Joachims. Estimating the generalization performance ofasvm efficiently [A]. In: Proc. ofthe17th Int. Conf. on Machine Learning[C].Morgan Kaufmann, 2000.
  • 9S S Keerthi. Improvements to Platt's SMO algorithm for SVM classifier design [J]. Neural Computation, Vol. 13,March 2001, 637 - 649.
  • 10L Bottou, V Vapnik, Local learning algorithms [J]. Neural computation, 4(6), 888- 900, 1992.

二级参考文献4

共引文献227

同被引文献187

引证文献13

二级引证文献37

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部