期刊文献+

自适应特征权重的K-means聚类算法 被引量:10

K-means Clustering Algorithm Based on Adaptive Feature Weighted
下载PDF
导出
摘要 为提高传统K-means聚类算法在医学数据聚类中的准确率和稳定性,提出了一种自适应特征权重的K-means聚类算法AFW-K-means。该算法首先通过计算属性的均方差选取初始聚类中心,然后根据当前的迭代结果,按照类内紧密、类间远离的原则调整属性在距离公式中的特征权重,以便更准确地反映数据点在欧氏空间中的真实距离,最后选取UCI上的BCW乳腺肿瘤等数据集对算法的有效性进行验证。结果表明:算法的准确率和稳定性均明显好于传统K-means算法。 In order to improve the accuracy and stability of traditional K-means algorithm on medical data clustering, proposed an adaptive feature weighted K-means clustering algorithm named AFW-K-means. Firstly, initial clustering center was chosen by calculating mean square deviation of feature attribute. Then,according to the results of each iteration,the feature attribute weight in distance formula is modified based on the principle of minimum-in-cluster-distance and maximum-between-cluster-distance, which can reflect the true distance among the data points in the Euclidean space. Finally, the validity of the proposed approach is demonstrated by the experiment of UCI data set such as Breast Cancer Wisconsin data set. The results showed that the algorithm has higher precision of prediction and better stability than traditional K-means algorithm.
出处 《计算机技术与发展》 2013年第6期98-101,105,共5页 Computer Technology and Development
基金 国家自然科学基金资助项目(51069004)
关键词 K—means 医学数据聚类 自适应特征权重 聚类评价 混淆矩阵 K-means medical data clustering AFW cluster evaluation confusion matrix
  • 相关文献

参考文献11

二级参考文献57

共引文献1411

同被引文献122

引证文献10

二级引证文献303

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部