期刊文献+

混合型数据聚类方法的比较 被引量:2

Comparison of Clustering Methods for Mixed Data
下载PDF
导出
摘要 为了科学使用真实世界数据,探索适用于日益常见的混合型数据的聚类方法,文章分析和比较了两种典型的混合型数据聚类方法K-prototypes与ClustMD,改进了聚类方法关键参数选择方法,并提出聚类稳定性指标。结果表明,两种聚类方法均具有很高的有效性和稳定性,各有优缺点。当数据相关性强、数据缺失严重或非连续变量较多时,建议使用K-prototypes。 In order to scientifically use real world data,this paper explores the clustering methods applicable to the increasingly common mixed medical data. The paper analyzes and compares the two typical clustering methods:K-prototypes and ClustMD,improves the key parameter selection method,and also proposes the clustering stability index. Cases analysis results indicate that the two methods are highly effective and stable,each with advantages and disadvantages. When data correlation is strong,data missing is serious or there are relatively more non-continuous variables,K-prototypes is recommended for hybrid data.
作者 刘超 姚清华 乐然 Liu Chao;Yao Qinghua;Le Ran(Mathematics and Systems Science Institute,Beijing University of Aeronautics and Astronautics,Beijing 100083,China;LMIB of the Ministry of Education,Beijing University of Aeronautics and Astronautics,Beijing 100083,China;Academy for Advanced Interdisciplinary Studies,Peking University,Beijing 100871,China)
出处 《统计与决策》 CSSCI 北大核心 2019年第11期64-67,共4页 Statistics & Decision
关键词 混合型数据 聚类有效性 聚类稳定性 mixed data clustering validity clustering stability
  • 相关文献

参考文献4

二级参考文献35

  • 1林作铨,牟克典,韩庆.基于未知扰动的冲突证据合成方法[J].软件学报,2004,15(8):1150-1156. 被引量:27
  • 2王宇,杨莉.基于凝聚函数的混合属性数据聚类算法[J].大连理工大学学报,2006,46(3):446-448. 被引量:2
  • 3赵宇,李兵,李秀,刘文煌,任守榘.混合属性数据聚类融合算法[J].清华大学学报(自然科学版),2006,46(10):1673-1676. 被引量:9
  • 4杨春宇,周杰.一种混合属性数据流聚类算法[J].计算机学报,2007,30(8):1364-1371. 被引量:22
  • 5GAN G,YANG Z,WU J.A genetic fuzzy K-modes algorithm for clustering categorical data[J].Expert Systems with Applications:An International Journal,2009,32(2):1615-1620.
  • 6HUANG Z.Extensions to the K-means algorithm for clustering large data sets with categorical values[J].Data Mining and Knowledge Discovery II,1998(2):283-304.
  • 7HUANG Z,MA N G.Fuzzy K-modes algorithm for clustering categorical data[J].IEEE Transacitons on Fuzzy Systems,1999,7(4):446 -452.
  • 8Dunn J.Well separated clusters and optimal fuzzy partitions[J].J Cybern, 1974,4( 1 ) :95-104.
  • 9Calinski T,Harabasz J.A dendrite method for cluster analysis[J]. Comm in Statistics, 1974,3 ( 1 ) : 1-27.
  • 10Maulik U, Bandyopadhyay S.Performance evaluation of some clustering algorithms and validity indices[J].IEEE PAMI, 2002, 24:1650-1654.

共引文献45

同被引文献14

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部