期刊文献+

基于多属性分类的KNN改进算法

An Improved KNN Algorithm Based on Multi-attribute Classification
下载PDF
导出
摘要 提出了一种基于多属性分类的KNN改进算法,可有效提高传统的欧几里德KNN算法和基于信息熵的KNN改进算法的分类准确度.首先,按照单个属性不同属性值的个数占整个属性包含样本的比例进行属性的分类,分为基于信息熵的KNN算法处理的离散属性和基于传统欧几里德KNN相似度处理的连续属性两类,然后分别对不同属性进行区别处理;其次,将两类不同处理后得到的结果按比例求和作为样本之间的距离;最后,选取与待测样本的距离最小的k个样本判断测试样本的决策属性类别. To improve the classification accuracy of the conventional Euclidean KNN algorithm and the im-proved KNN algorithm based on information entropy,this paper proposes an improved KNN algorithm based on multi-attribute classification. The procedures of the new algorithm comprise:i) classify the attributes according to the percentage of their attribute values in an entire attribute of sample set into those discrete attributes suit-able for entropy-based KNN algorithm and those continuous attributes suitable for conventional Euclidean KNN similarity-based algorithm;ii) process the two types of attributes separately and then sum up the two series of results with weighing and put the sum as the distance between samples;iii) select k samples those are closest to the test sample to determine the decision attribute type of the test sample.
出处 《鞍山师范学院学报》 2013年第6期38-41,59,共5页 Journal of Anshan Normal University
关键词 离散属性 连续属性 KNN算法 多属性分类 Discrete attribute Continuous attribute KNN algorithm Multi-attribute classification
  • 相关文献

参考文献6

二级参考文献27

  • 1魏孝章,豆增发.一种基于信息增益的K-NN改进算法[J].计算机工程与应用,2007,43(19):188-191. 被引量:9
  • 2黄金才 陈文伟.遗传算法和模糊神经网络相结合在数据开采中的应用[J].清华大学学报,1998,.
  • 3Wu Xindong,Kumar V,Quinlan J R,et al.Top 10 algorithms in data mining[J].Knowledge and Information Systems,2008,14(1 ): 1-37.
  • 4Zou Wen,Genetic Programming Conference,1997年
  • 5赵振宇,模糊理论与神经网络的基础和应用,1996年
  • 6黄金才,清华大学学报,1998年
  • 7康塔尼克 闪四清译.数据挖掘:概念、模型、方法和算法[M].北京:清华大学出版社,2003..
  • 8Hayashi K. Multi-criteria Analysis for Agricultural Resource Management: A Critical Survey and Future Perspectives [J]. European Joumal of Operational Research, 2000, 122:486-500.
  • 9Takahara Y, Chen Xiaohong. Contribution of Mathematical General Systems Theory to Organization Theory Integration of Organizational Behaviors on Macro and Micro Levels [A]. Cybernetics and Systems 2000 Fifteeth European Meeting on Cybernetics and Systems Research [C], University of Vienna, 2000, 4(25-28):31-36.
  • 10Shafer J C, Agrawal R, Mehta M. SPRINT: A Scalable Parallel Classifier for Data Mining [J]. Proc.of the 22nd Int. Conf.on Very Large Databases. Mumbai(Bombay), India, 1996.

共引文献52

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部