期刊文献+

SVM-KNN分类器在网页分类中的应用 被引量:4

Application of SVM-KNN Classifier into Web Page Classification
下载PDF
导出
摘要 为了提高中文网页自动分类的分类精度,将SVM-KNN方法用于中文网页分类。提出了一种中文网页的表示方法,在将下载的网页全部表示为向量空间的向量后,用SVM构造了一个多类分类器。在分类时通过在特征空间计算网页所表示的向量和分界面的距离决定采用SVM方法还是KNN方法对其分类。实验证明该方法是一种有效的方法,对网页分类的各类,使用该方法均比使用SVM方法具有更高的分类精度,同时能缓解SVM训练时对核参数的选择困难问题。 The SVM-KNN method is applied to Chinese web page classification in order to improve the classify accuracy of Chinese Web page. After analyzing the characteristics of web pages, a new vector representation of web pages was proposed and applies to web page classification. A multi-class classifier is constructed using SVM after the downloaded web pages was presented as a vector of vector space. In the classify phase, the distance is computed from vector to hyper-plane, based on which the SVM and KNN method are chosen. Experiment result shows the SVM-KNN method is valid method, which has a higher classify racy than that of the SVM. Besides, the problem of choosing kernel function parameters SVM is solved using this method.
作者 李蓉 孙媛
出处 《科学技术与工程》 2009年第16期4653-4656,共4页 Science Technology and Engineering
基金 北京市属高等学校人才强教计划项目(PHR200906210) 北京教育委员会科研基地建设项目 北京市教育委员会科技计划项目(KM200810037001) 国家自然科学基金重点项目(10673017)资助
关键词 支持向量 核函数 代表点 分类超平面 support vector kernel function representative point hyperplane
  • 相关文献

参考文献10

  • 1Craven M, Dipasquo A, Freitag A, et al. Learning to extract symbolic knowledge from the World Wide Web. In: Proc of the Fifteenth National Conf on Artificial Intelligence ( AAAI - 98 ), Wisconsin, 1998:509--516.
  • 2Lewis D D, Knowles K A. Threading electronic mail: a preliminary study. Information Processing and Management, 1997; 33 ( 2 ):209--217.
  • 3Lang Ken. News weeder: learning to filter net news. In: Int Conf on Machine Learning (ICML) , California, 1995:331--339.
  • 4Yang Y. Expert network: effective and efficient learning from human decisions in text categorization and retrieval. In: Proc Seventeenth International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, 1994 : 13--22.
  • 5Cohen W W , Singer Y. Context sensitive learning methods for text categorization. In: Proc Nineteenth International ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, 1996:307--315.
  • 6Weber R, Schek H J, Blotut S. A quantifive analysis and performance study for similarity-search methods in high-dimensional spaces. A Proceedings of the 24rd International Conference on Very Large Data Bases, 194--205.
  • 7Vapnik V N. The nature of statistical learning theory. NY : Springer Verlag, 1995.
  • 8Burges C J C. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discvery, 1998 ; 2 (2) :955 --974.
  • 9李蓉,叶世伟,史忠植.SVM-KNN分类器——一种提高SVM分类精度的新方法[J].电子学报,2002,30(5):745-748. 被引量:133
  • 10Salton. Introduction to modem information retrieval. New York: McGraw 2H ill Book Company,1983.

共引文献132

同被引文献46

引证文献4

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部