期刊文献+

基于属性值相关距离的KNN算法的改进研究 被引量:28

Improved the KNN Algorithm Based on Related to the Distance of Attribute Value
下载PDF
导出
摘要 样本距离机制的定义直接影响到KNN算法的准确性和效率。针对传统KNN算法在距离的定义及类别决定上的不足,提出了利用属性值对类别的重要性进行改进的KNN算法(FCD-KNN)。首先定义两个样本间的距离为属性值的相关距离,此距离有效度量了样本间的相似度。再根据此距离选取与待测试样本距离最小的K个近邻,最后根据各类近邻样本点的平均距离及个数判断待测试样本的类别。理论分析及仿真实验结果表明,FCD-KNN算法较传统KNN及距离加权-KNN的分类准确性要高。 Definition of the samples will directly impact on the accuracy and the efficiency of KNN. In view of disadvantages to the traditional KNN algorithm on the distance the definition and categories of decision, proposed the use of attribute importance to category to improve KNN algorithm (FCD-KNN). At first, a distance of the two samples is defined as the correlation distance of the same attribute values. The distance can effectively measure the similarity degree of the two sample. Secondly, According to this distance selects the k nearest neighbors. Finally, the category of the test sample is decided by the average distance and the numbers on the respective category. The theoretical analysis and the simulation experiment show that compared with KNN and-KNN, raised the rate of accuracy enormously in classification.
机构地区 河池学院
出处 《计算机科学》 CSCD 北大核心 2013年第11A期157-159,187,共4页 Computer Science
基金 广西教育厅科研基金项目(201106LX577 201106LX604) 国家自然科学基金项目(40971234) 河池学院青年科研项目(2012B-N005 2012B-N007)资助
关键词 KNN算法 相关距离 属性值 样本距离机制 KNN algorithm, Correlation distances, Attribute, Sample distance mechanism
  • 相关文献

参考文献8

二级参考文献59

  • 1陈振洲,李磊,姚正安.基于SVM的特征加权KNN算法[J].中山大学学报(自然科学版),2005,44(1):17-20. 被引量:51
  • 2王煜,王正欧,白石.用于文本分类的改进KNN算法[J].中文信息学报,2007,21(3):76-82. 被引量:15
  • 3王煜,张明,王正欧,白石.用于文本分类的改进KNN算法[J].计算机工程与应用,2007,43(13):159-162. 被引量:6
  • 4魏孝章,豆增发.一种基于信息增益的K-NN改进算法[J].计算机工程与应用,2007,43(19):188-191. 被引量:9
  • 5Dasarathv B V.Nearest neighbor(NN) norms NN pattern classification techniques[M].Las Alarnitos,California:IEEE Computer Society Press, 1991.
  • 6Joachins T.Text categorization with support vector machines learning with many relevant features[C]//Proceedings of ECML-98 10th European Conference on Machine Learning.Berline:Springer-Verlag, 1998: 137-142.
  • 7Cover T M,Hart P E.Nearest neighbor pattern classification[J]. IEEE Transactions on Information Theory, 1968, IT- 13 : 21-27.
  • 8D'Amato C,Malerba D,Esposito F,et al.Extending the K-Nearest Neighbour classification algorithm to symbolic objects [EB/OL]. ( 2006 ).http://www.di.uniba.it-malerba/.
  • 9Pawlak Z.Rough sets[J].International Journal of Computer Information Science, 1982, 11 (5) : 341-356.
  • 10Martinez A M,Kak A C.PCA versus LDA[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2001,23(2):228-233.

共引文献107

同被引文献232

引证文献28

二级引证文献131

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部