KNN-SVM网页分类器介绍

Introduction of KNN-SVM Page Classifier

下载PDF

导出

摘要网页分类算法中,KNN算法的缺陷之一是分类效率较低,分类的效果很大程度上依赖于相似度函数和参数K的选择。同时,基于支持向量机(SVM)的网页分类器的限制在于要求处理的向量是数值型向量,而网页特征向量往往是词条特征向量。利用KNN算法生成训练样本,进而将词条特征向量数值化,再利用支持向量机分类器对测试网页进行分类,构建了一种新的分类器——KNN-SVM分类器。 In all kind of methods of web page classifications, KNN＇s efficiency is not good enough, and the performance depends on the similarity function and the parameter K. Meanwhile, the limitation of SVM is the requirement of numeric vectors, but the feature vector of a page is often based on words. Through making use of KNN to generate training samples, and turns word vectors to numeric vectors, then uses SVM to finish the classification, so as to build a new classifier, KNN-SVM classifier.

作者魏梦娟罗文龙

机构地区广州市统计局计算中心中山大学数学与计算科学学院

出处《现代计算机》 2008年第7期92-94,共3页 Modern Computer

关键词 KNN SVM 词条特征向量数值化 K-Nearest Neighbor（KNN） Support Vector Machine（SVM） Word Vectors Numeric

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献7

1Richard O.Duda Peter E.Hart David G.Stork著,李宏东,姚天翔,等译.模式分类.北京:机械工业出版社,2005:16-18.
2LI Bao-li, YU Shi-wen, LU Qin. An Improved K-Nearest Neighbor Algorithm for Text Categorization. Proceedings of the 20th International Conference on Computer Processing of Oriental Languages, Shenyang, China, 2003 : P2
3Thorsten Joachims. Text Categorization with Support Vector Machine, 1998 : P3-6
4Fabrizio Sebastian. Machine Learning in Automated Text Categorization. ACM,2002
5Chih-Chung Chang and Chih-Jen Lin, LIBSVM : a Library for Support Vectormachines, 2007. Software available at http://www.csie.ntu.edu.tw/-cjlin/libsvm
6S. Knerr, L. Personnaz, G. Dreyfus. Single-Layer Learning Revisited: A Stepwise Procedure for Building and Training a Neural Network. Neurocomputing: Algorithms, Architectures and Applications. J. Fogelman, Ed. New York:Springer- Verlag, 1990.
7Nello Cristianini,John Shawe-Taylor.An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods.机械工业出版社,2005-7(第1版第1次印刷)

共引文献1

1陈其昌,薛月菊,胡月明,杨敬锋,陈志民.多种分类器在农用地分等中的应用及其用法改良[J].中国农学通报,2007,23(2):398-402. 被引量：3

1吕成戍,王维国,丁永健.基于KNN-SVM的混合协同过滤推荐算法[J].计算机应用研究,2012,29(5):1707-1709. 被引量：12
2袁臣虎,刘铁根,李秀艳.基于kNN-SVM的手背静脉虹膜和指纹融合身份识别[J].光电工程,2013,40(4):101-105. 被引量：9
3宋军涛,周铜,杜庆灵.支持向量机和蚁群算法的网页分类研究[J].计算机工程与应用,2009,45(17):122-124. 被引量：6
4何永明.基于KNN-SVM的网络安全态势评估模型[J].计算机工程与应用,2013,49(9):81-84. 被引量：16
5和文全,薛惠峰,解丹蕊,杜喆.基于K近邻的支持向量机分类方法[J].计算机仿真,2008,25(11):161-163. 被引量：9
6陈芊希,范磊.基于深度学习的网页分类算法研究[J].微型电脑应用,2016,32(2):25-28. 被引量：3
7蒋宗礼,时福林.基于链接关系的网页分类优化算法[J].计算机与现代化,2014(5):14-17. 被引量：2
8江国荐,顾乃杰,张旭,任开新.基于SAE-LBP的网页分类研究[J].小型微型计算机系统,2016,37(4):738-742. 被引量：4
9彭涛,左万利,赫枫龄,张长利.基于粒子群优化算法的网页分类技术[J].计算机研究与发展,2006,43(z3):33-38. 被引量：2
10李亮,刘万春,徐泉清,朱玉文.一种基于支持向量机的专业中文网页分类器[J].计算机应用,2004,24(4):58-61. 被引量：5

现代计算机

2008年第7期

浏览历史

内容加载中请稍等...

KNN-SVM网页分类器介绍

参考文献7

共引文献1

相关作者

相关机构

相关主题

浏览历史