期刊文献+

基于信息熵的精确属性赋权K-means聚类算法 被引量:36

Accurate property weighted K-means clustering algorithm based on information entropy
下载PDF
导出
摘要 为了进一步提高聚类的精确度,针对传统K-means算法的初始聚类中心产生方式和数据相似性判断依据,提出一种基于信息熵的精确属性赋权K-means聚类算法。首先利用熵值法对数据对象的属性赋权来修正对象间的欧氏距离,然后通过比较初聚类的赋权类别目标价值函数,选择高质量的初始聚类中心来进行更高精度和更加稳定的聚类,最后通过Matlab编程实现。实验证明该算法的聚类精确度和稳定性要明显高于传统K-means算法。 Concerning the initial clustering center generation and the data similarity judgment basis of the traditional K-means algorithm,the paper proposed an accurate property weighted K-means clustering algorithm based on information entropy to further improve the clustering accuracy.First,property weights were determined by using entroy method to correct the Euclidean distance.And then,high-quality initial clustering center was chosen by comparing the empowering target cost function of the initial clusters for more accurate and more stable clustering.Finally,the algorithm was implemented in Matlab.The experimental results show that the algorithm accuracy and stability are significantly higher than the traditional K-means algorithm.
出处 《计算机应用》 CSCD 北大核心 2011年第6期1675-1677,共3页 journal of Computer Applications
关键词 K-MEANS 精确度 信息熵 属性赋权 初始聚类中心 K-means accuracy information entropy property weight initial clustering center
  • 相关文献

参考文献11

  • 1MUATA K, BRYSO O. Towards supporting expert evaluation of clustering results using a data mining process model[ J]. Information Sciences, 2010, 180(3) : 414 -431.
  • 2CAO F Y, LIANG J Y, JIANG G. An initialization method for the K- means algorithm using neighborhood model [ J]. Computers and Mathematics with Applications, 2009, 58(3) :474 -483.
  • 3ALIK K R. An efficient K-means clustering algorithm[ J]. Pattern Recognition Letters, 2008, 29(9) : 1385 - 1391.
  • 4REDMOND S J, HENEGHAN C. A method for initialising the K-means clustering algorithm using KD-trees[ J]. Pattern Recognition Letters, 2007, 28(8) : 965 -973.
  • 5LAI J Z C, HUANG T J, LIAW Y C. A fast K-means clustering algorithm using cluster center displacement[ J]. Pattern Recognition, 2009,42(11): 2551 -2556.
  • 6HAN J W, KAMBER M. Data mining concepts and techniques [ M]. 2nd ed. San Francisco: Morgan Kaufmann Publishers, 2006: 383 -461.
  • 7孟洋,赵方.基于信息熵理论的动态规划特征选取算法[J].计算机工程与设计,2010,31(17):3879-3881. 被引量:6
  • 8陈雷,王延章.熵权法对融合网络服务质量效率保障研究[J].计算机工程与应用,2005,41(23):1-3. 被引量:3
  • 9高孝伟.熵权法在教学评优中的应用研究[J].中国地质教育,2008,17(4):100-104. 被引量:12
  • 10UCI Machine Learning Repository [ DB/OL]. [ 2010 - 12 - 20]. http://archive, ics. uci. edu/ml/.

二级参考文献16

共引文献16

同被引文献356

引证文献36

二级引证文献194

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部