期刊文献+

支持向量机在文本自动分类中的应用研究 被引量:7

Support Vector Machine and Its Application in Automatic Document Classification
下载PDF
导出
摘要 针对数据挖掘中的文本自动分类问题,提出了一种基于支持向量机的分类方法。构造了可用于多个模式类识别的多层级连式SVM模型,该模型可完成对多个模式的分类识别。根据训练样本的分类体系完成对模型的构造之后,即可应用于实际文档的自动分类。文中给出了该模型的构造及应用的方法,用两种核函数作为内积回旋方案,以中国期刊网全文数据库部分文档数据为例验证了该方法的有效性。 Aiming to documents classification in data mining, a classification method based on the support vector machine is presented in this paper. The multi-layer linked SVM model that can classify the samples set into multi-categories is constructed. After it is constructed according to training samples set, the model can be applied to practical samples automatic classification. The method is shown about how the model is constructed and applied to classification. Two sorts of kernel function are applied to SVM, they are multinominal function and radial basis function. The model and algorithm are applied to some documents of China periodical document database and classification results illustrate its practicability.
机构地区 哈尔滨工程大学
出处 《情报学报》 CSSCI 北大核心 2006年第2期208-214,共7页 Journal of the China Society for Scientific and Technical Information
关键词 支持向量机 文本分类 机器学习 模式识别 support vector machine, document classification, machine learning, pattern recognition.
  • 相关文献

参考文献5

二级参考文献21

  • 1李辉.支撑向量机及其在文本分类中的应用,北京大学博士论文[M].,2001..
  • 2Peter Cord等 邵维忠等(译).Object-Oriented Analysis.Yourdon Press[M].北京:北京大学出版社,1992.65-77.
  • 3[1]Vapnik V. The Nature of Statistical Learning Theory. New York: Springer-Verlag, 1995.
  • 4[2]Stitson MO, Weston JAE, Gammerman A, Vovk V, Vapnik V. Theory of support vector machines. Technical Report, CSD-TR-96-17, Computational Intelligence Group, Royal Holloway: University of London, 1996.
  • 5[3]Cortes C, Vapnik V. Support vector networks. Machine Learning, 1995,20:273~297.
  • 6[4]Vapnik V. Statistical Learning Theory. John Wiley and Sons, 1998.
  • 7[5]Gammerman A, Vapnik V, Vowk V. Learning by transduction. In: Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence. Wisconsin, 1998. 148~156.
  • 8[6]Joachims T. Transductive inference for text classification using support vector machines. In: Proceedings of the 16th International Conference on Machine Learning (ICML). San Francisco: Morgan Kaufmann Publishers, 1999. 200~209.
  • 9[7]Boser BE, Guyon IM, Vapnik VN. A training algorithm for optimal margin classifiers. In: Haussler D, ed. Proceedings of the 5th Annual ACM Workshop on Computational Learning Theory. Pittsburgh, PA: ACM Press, 1992. 144~152.
  • 10[8]Burges CJC. Simplified support vector decision rules. In: Saitta L, ed. Proceedings of the 13th International Conference on Machine Learning. San Mateo, CA: Morgan Kaufmann Publishers, 1996. 71~77.

共引文献2466

同被引文献50

引证文献7

二级引证文献26

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部