期刊文献+

基于最重要特征的裁剪k-近邻分类算法设计 被引量:5

Design of cutting k-nearest neighbor classification algorithm based on the most important feature search
下载PDF
导出
摘要 k-近邻分类算法是机器学习分类算中一个重要的算法。其精度高具有广泛应用。但时间和空间复杂度高。本文着眼于此,根据香农熵理论,提出了一种通过计算信息增益寻找对分类结果影响最大的特征,并根据该特征进行原始训练集划分并进行样本裁剪构造训练子集,在该子集上应用传统k-近邻的方法,从而降低传统k-近邻分类算法的时间复杂度。实验结果表明,该方法保持了传统k-近邻分类算法的精度,引入了最重要特征对分类结果的影响,有效降低了传统k-近邻分类算法的时间复杂度。 K- nearest neighbor classification algorithm is an important algorithm in machine learning classification. High precision makes it widely used. But it has shortcoming of high time complexity and storage space. This paper focuses on this,according to the Shannon entropy theory,proposes a method which calculates the information gain of the most important feature Influencing classification results,and according to it,divides the original training set into several subsets. Base on the subsets,this paper constructs a new training subset which has less sample than the original .After that,this paper apply the traditional k- nearest neighbor algorithm in the new training subset,in order to reduce the traditional knearest neighbor classification algorithm’s time complexity. Experimental results show that the method of this paper keeps the accuracy of the traditional k- nearest neighbor classification algorithm,and introduces the most important characteristic of the classification result,and effectively reduces the time complexity of the traditional k- nearest neighbor classification algorithm.
作者 赵琳 行致源 ZHAO Lin;XING Zhi-yuan(School of Computing,Xi’an Aeronautical University,Xi’an 710100,China;College of Aerospace Engineering,Xi’an 710077,China)
出处 《电子设计工程》 2019年第14期135-138,共4页 Electronic Design Engineering
关键词 K-近邻 香农熵 分类 划分 k-Nearest Neighbor Shannon entropy classification divide
  • 相关文献

参考文献12

二级参考文献210

共引文献146

同被引文献31

引证文献5

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部