期刊文献+

一种自底向上的高维聚类算法

Bottom-up Clustering Algorithm of High Dimension
下载PDF
导出
摘要 基于小波变换的聚类算法是高效的,能够探测到任意形状的聚类,可成功去除孤立点,并对输入数据的顺序不敏感.但当聚类维数增长时,算法的有效性会降低,计算复杂度也相当可观.采用自底向上的思想对小波聚类算法进行改进,使之适合高维聚类,并将改进算法并行化以增强可伸缩性.实验表明改进算法并未影响聚类质量,而且可有效地进行高维聚类,并降低了计算复杂度. Clustering algorithm based on wavelet transform is efficient, and which can detect clusters of arbitrary shape. It is insensitive to the outliers and the order of input data. However, efficiency of the algorithm would be degraded, and computation complexity of the algorithm would be considerable with increase of clustering dimensions. A bottom-u Pmethod is put forward to make the original algorithm fit to clustering in high dimension, and the scalability of the improved algorithm is enhanced by parallelization. The experiment demonstrates that the improved algorithm has no impact on quality of clustering and has a good efficient in high dimension clustering and in decrease of comnutation comnlexity.
出处 《重庆大学学报(自然科学版)》 EI CAS CSCD 北大核心 2006年第9期106-110,共5页 Journal of Chongqing University
基金 教育部博士点基金资助项目(2004061102) 重庆大学研究生院创新项目(200506Y1A0230130)
关键词 聚类 小波变换 自底向上 并行 clustering wavelet-transform bottom-up parallel
  • 相关文献

参考文献10

  • 1KAUFMAN L,ROUSSEEUW PJ.Finding Groups in Data:An Introduction to Cluster Analysis[M].New York:John Wiley & Sons,1990.
  • 2NGR T,HAN J.Efficient and Effective Clustering Methods for Spatial Data Mining[A].In Proc 1994 Int Conf Very Large Data Bases(VLDB' 94)[C].Santiago:Chil,1994.144-155.
  • 3GUHA U,RASTOGI R,SHIM K.CURE:An Efficient Clustering Algorithm for Large Databases[J].Information System,2001,1(26):35-58.
  • 4ZHANG T,RAMARKRISHNAN R,LIVNYM.BIRCH:An Efficient Data Clustering Method for Very Large Database[A].ACM SIGMOD Conference 1996[C].Canada:Montreal,1996.
  • 5ESTER M,KRIEGEL H P,SANDER J.A Density-based Algorithm for Discovering Clusters in Large Spatial Database with Noise[A].In Proc 2ndInt' l Conf Knowledge Discovering in Database and Data Mining(KDD-96)[C].USA:Portland,Oregon,1996.
  • 6ANKERST M,BREUNIG M,KRIEGEL H P.OPTICS:Ordering Points to Identity the Clustering Structure[A].In Proc 1999ACM-SIGMOD Int Conf Management of Data(SIGMOD'99)[C].PA:Philadelphia,1999.49-60.
  • 7AGRAWAL,GEHRKE J,GUNOPULOS D.Automatic Subspace Clustering of High Dimensional Data Mining Applications[A].In Proc ACM SIGMOD International Conf Management of Data[C].Washington:Seattle,1998.73-84.
  • 8SHEIKHOLESLAMI G,CHATTERJEE S,ZHANG A.Wave-Cluster:A Multiresolution Clustering Approach for Very Large Spatial Database[A].In Proc 1998 Int Conf Very Large Data Base[C].New York:VLDB,1998.428-439.
  • 9WANG W,YANG J,MUNTZ R.STING:A Statistical Information Grid Approach to Spatial Data Mining[A].In Proc 1997 Int Conf Very Large Data Bases(VLDB' 97)[C].Greece:Athens,1997.186-195.
  • 10SUNDERAM V S,GEISTt G A.The PVM Concurrent System:Evolution,Experiences,and Trends[M].USA:Department of Energy,1993.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部