期刊文献+

一种基于聚类集成技术的混合型数据聚类算法 被引量:6

Clustering Algorithm for Mixed Data Based on Clustering Ensemble Technique
下载PDF
导出
摘要 提出了一种基于集成技术和谱聚类技术的混合数据聚类算法CBEST。它利用聚类集成技术产生混合数据间的相似性,这种相似性度量没有对数据特征值分布模型做任何的假设。基于此相似性度量得到的待聚类数据的相似性矩阵,应用谱聚类算法得到混合数据聚类结果。大量真实和人工数据上的实验结果验证了CBEST的有效性和它对噪声的鲁棒性。与其它混合数据聚类算法的比较研究也证明了CBEST的优越性能。CBEST还能有效融合先验知识,通过参数的调节来设置不同属性在聚类中的权重。 A clustering algorithm based on ensemble and spectral technique named CBEST that works well for data with mixed numeric and categorical features was presented.A similarity measure based on clustering ensemble was adopted to define the similarity between pairs of objects,which makes no assumptions of the underlying distributions of the feature values.A spectral clustering algorithm was employed on the similarity matrix to extract a partition of the data.The performance of CBEST was studied on artificial and real data sets.Results demonstrate the effectiveness of this algorithm in clustering mixed data tasks and its robustness to noise.Comparisons with other related clustering schemes illustrate the superior performance of this approach.Moreover,CBEST can infuse prior knowledge effectively to set the weights of different features in clustering.
作者 罗会兰 危辉
出处 《计算机科学》 CSCD 北大核心 2010年第11期234-238,274,共6页 Computer Science
基金 国家973项目(No2010CB327900) 国家自然科学基金(No60303007) 上海科技发展基金(No08511501703) 上海市智能信息处理重点实验室开放课题(NoIIPL-09-009)资助
关键词 聚类集成 混合型数据 相似性度量 Clustering ensemble Mixed data Similarity measure
  • 相关文献

参考文献10

  • 1Mckusick K B,Thompson K.COBWEB/3:A portable imple-mentation. FIA-90-6-182 . 1990
  • 2Reich Y,,Fenves S.The formation and use of abstract concepts in design Concept Formation:Knowledge and Experience in Un-supervised Learning[]..1991
  • 3Li C,Biswas G.Unsupervised Learning with Mixed Numeric and Nominal Data[].IEEE TransKnowlData Eng.2002
  • 4He Z,Xu X,Deng S.Clustering Mixed Numeric and Categorical Data:A Cluster Ensemble Approach[]..
  • 5Fred A L N.Finding Consistent Clusters in Data Partitions[].Multiple Classifier SystemsSecond International WorkshopMCS.2001
  • 6Fred A L N,Jain A K.Data Clustering using Evidence Accumulation[].Procof theth IntlConference on Pattern Recognition ICPR.2002
  • 7He Z,Xu X,Deng S.A cluster ensemble method for clustering cate-gorical data[].Information Fusion.2005
  • 8Zelnik-Manor L,Perona P.Self-Tuning Spectral Clustering[].Eighteenth Annual Conference on Neural Information Proces-sing Systems(NIPS).2004
  • 9http://www.ics.uci.edu/-mlearn/databases/ .
  • 10Huang,Z.Clustering Large Data Sets with MixedNumeric and Categorical Values[].Proceedings of The First Pacific-Asia Conference on Knowledge Discoveryand Data Mining.1997

同被引文献70

  • 1乔珠峰,田凤占,黄厚宽,陈景年.缺失数据处理方法的比较研究[J].计算机研究与发展,2006,43(z1):171-175. 被引量:13
  • 2李士进,朱跃龙,刘净.一种基于k-prototype的多层次聚类改进算法[J].河海大学学报(自然科学版),2007,35(3):342-347. 被引量:1
  • 3Xu Rui,Wunsch Donald. Survey of Clustering Algorithm[J]. IEEE Transactions on Neural Net works, 2005,16(3) :645- 678.
  • 4Johannes Grabmeier, Andreas Rudolph. Techniques of Cluster Algorithms in Data Mining[J]. Data Mining and Knowl- edge Discovery ,2002,6(4) :303-360.
  • 5Ralambondrainy H. A Conceptual Version of the k-means Algorithm[J]. Pattern Recognition Letters, 1995,16(11) :1147- 1157.
  • 6Wang Hui,Dubitzky Werner. A Flexible and Robust Similarity Measure Based on Contextual Probability [C]//Proceed- ings of the International Joint Conference on Artificial Intelligence,2005.
  • 7Huang Z X. Clustering Large Data Sets with Mixed Numeric and Categorical Values [C]//Proceedings of the East Pacific- Asia Conference on Knowledge Discovery and Data Mining, 1997.
  • 8Li Cen, Biswas Gautan. Unsupervised Learning with Mixed Numeric and Nominal Data[J] IEEE Transactions on Knowl- edfze Data Engineering, 2009,14(4) : 673-682.
  • 9Goodall D W. A New Similarity Index Based On Probability[J]. Biometrics, 1996,22: 882-892.
  • 10He Z Y,Xu X. Clustering Mixed Numeric and Categorical Data:A Cluster Ensemble Approach [J]. Computer Science Artificial Intelligence, 2005,5(4) : 255-268.

引证文献6

二级引证文献28

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部