摘要
近年来在数据分析中最广泛研究的问题之一就是在多维数据集中识别聚类或密集区域。为了解决大型数据集和最小化I/O成本的问题。由此提出一种基于层次结构的数据聚类方法——平衡迭代和聚类方法 BIRCH。论文中对BIRCH聚类算法性能从时间/空间效率、对算法参数改变下的Calinski-Harabasz指数和聚类质量等方面进行了评估,并和经典的CLARANS算法进行了性能比较。
In recent years,one of the most widely studied problems in data analysis is the identification of clusters or dense regions in multidimensional datasets.To address the issues of large datasets and minimizing I/O costs,a hierarchical data clustering method called Balanced Iterative Reducing and Clustering using Hierarchies(BIRCH)has been proposed.In this article,the performance of the BIRCH clustering algorithm is evaluated in terms of time/space efficiency,Calinski-Harabasz index under varying algorithm parameters,and clustering quality.A performance comparison is also conducted with the classic CLARANS algorithm.
作者
杨茜
吕杨
周俊山
张芮
YANG Xi;LYU Yang;ZHOU Junshan;ZHANG Rui(Naval Petty Officer Academy,Bengbu 233010)
出处
《舰船电子工程》
2024年第4期115-118,共4页
Ship Electronic Engineering