期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
Improved k-means clustering algorithm 被引量:16
1
作者 夏士雄 李文超 +2 位作者 周勇 张磊 牛强 《Journal of Southeast University(English Edition)》 EI CAS 2007年第3期435-438,共4页
In allusion to the disadvantage of having to obtain the number of clusters of data sets in advance and the sensitivity to selecting initial clustering centers in the k-means algorithm, an improved k-means clustering a... In allusion to the disadvantage of having to obtain the number of clusters of data sets in advance and the sensitivity to selecting initial clustering centers in the k-means algorithm, an improved k-means clustering algorithm is proposed. First, the concept of a silhouette coefficient is introduced, and the optimal clustering number Kopt of a data set with unknown class information is confirmed by calculating the silhouette coefficient of objects in clusters under different K values. Then the distribution of the data set is obtained through hierarchical clustering and the initial clustering-centers are confirmed. Finally, the clustering is completed by the traditional k-means clustering. By the theoretical analysis, it is proved that the improved k-means clustering algorithm has proper computational complexity. The experimental results of IRIS testing data set show that the algorithm can distinguish different clusters reasonably and recognize the outliers efficiently, and the entropy generated by the algorithm is lower. 展开更多
关键词 CLUSTERING k-means algorithm silhouette coefficient
下载PDF
Application of k-means clustering to environmental risk zoning of the chemical industrial area 被引量:4
2
作者 Weifang SHI Weihua ZENG 《Frontiers of Environmental Science & Engineering》 SCIE EI CAS CSCD 2014年第1期117-127,共11页
The homogeneous risk characteristics within a sub-area and the heterogeneous from one sub-area to another are unclear using existing environmental risk zoning methods. This study presents a new zoning method by determ... The homogeneous risk characteristics within a sub-area and the heterogeneous from one sub-area to another are unclear using existing environmental risk zoning methods. This study presents a new zoning method by determining and categorizing the risk characteristics using the k-means clustering data mining technology. The study constructs indices and develops index quantification models for environmental risk zoning by analyzing the mechanism of environmental risk occurrence. We calculate the source risk index, air risk field index, water risk field index, and target vulnerability of the study area with Nanjing Chemical Industrial Park using a 100 m - 100 m mesh grid as the basic zoning unit, and then use k-means clustering to analyze the environmental risk in the area. We obtain the optimal clustering number with the largest average silhouette coefficient by calculating the average silhouette coefficients of clustering at different k-values. The clustering result with the optimal clustering number is then used for the environmental risk zoning, and the zoning result is mapped using the geographic information system. The study area is divided into five sub-areas. The common environmental risk characteristics within the same sub-area, as well as the differences between sub- areas, are presented. The zoning is helpful in risk management and is convenient for decision makers to distribute limited resources to different sub-areas in the design of risk reducing intervention. 展开更多
关键词 environmental risk zoning k-means cluster-ing silhouette coefficient chemical industrial park RISKMANAGEMENT
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部