期刊文献+

一种提高K近邻分类的新方法 被引量:3

A New Method to Scale Up Effect of K-Nearest-Neighbor
下载PDF
导出
摘要 KNN算法是数据挖掘技术中比较常用的分类算法。但是,当样本容量较大以及特征属性较多时,KNN算法分类精度和效率将大大降低。该文将主分量分析(PCA)与粗糙集理论(RS)应用于样本特征提取中,首先采用PCA对输入向量进行甄别,应用粗糙集理论约简与分类无关或关系不大的向量。然后利用模拟退火算法实现随机属性子集选择,组合K近邻分类器,最后利用简单投票方法,对多重K近邻分类器进行组合输出,有效地改进了K近邻法的分类精度和效率。 The k-Nearest-Neighbor (KNN) algorithm has been widely used in data mining areas. But, When the samples become more and more large and characteristic attributes become more and more numerous, then KNN algorithm becomes much lower. A improved KNN algorithm PRMKNN is proposed in the paper ,which first applies Principle Component Analysis(PCA)and rough set theory(RS) to realize feature extraction, We use PCA on selecting the input vector,and use RS on reducing the inessential factors for classification ,then simulation annealing algorithm is used to generate random subset of attributes, and with the simple voting method, the outputs of the multiple KNN classifiers are combined. The method can improve the classification precision and efficiency effectively.
作者 茹强喜 刘永
出处 《电脑知识与技术(过刊)》 2010年第3X期1989-1991,共3页 Computer Knowledge and Technology
关键词 主分量分析 粗糙集 模拟退火 K近邻 组合模型 principle component analysis rough set simulated annealing k-Nearest-Neighbor combination model
  • 相关文献

参考文献11

二级参考文献33

  • 1田澎,杨自厚,张嗣瀛.一类非线性规划的模拟退火求解[J].控制与决策,1994,9(3):173-177. 被引量:11
  • 2张德富,顾卫刚,沈平.一种解旅行商问题的并行模拟退火算法[J].计算机研究与发展,1995,32(2):1-4. 被引量:11
  • 3黄昌宁 等.对自动分词的反思[A]..语言计算与基于内容的文本处理[C].北京:清华大学出版社,2003,7.26-38.
  • 4[1]LIU H,MOTODA H.Feature Selection for Knowledge.Discovery and Data Mining.[M]Boston,Kluwer Aca demic Publishers,1998.
  • 5[2]P.J.M.VAN LAARHOVEN,E.H.L.AATRS,Simu lated Annealing.Theory and applications[M],D.Redidel,1987.
  • 6[3]DASH M,LIUH.Feature selection for Classificat-ion.[J]Intelligent Data Analysis,1997,1 (3):131-156.
  • 7[4]SIEDKLECKI W,SKLANSKY J..On automatic feature selection.[J] Intemational Journal of Pattern Recognition and Artifical Intelligence,1988,9(2):19-22.
  • 8PAWLAK Z.Rough Sets[J].International Journal of Computer Information Science,1982,11 (5):341-356.
  • 9PAWLAK Z,GRZYMALA-BUSSE J,SLOWINSKI R,et al.Rough Sets[J].Communications of the ACM,1995,38(11).
  • 10邢文循 谢金星.现代优化计算方法[M].北京:清华大学出版社,1999.90-129.

共引文献286

同被引文献14

引证文献3

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部