期刊文献+

基于样本与特征双加权的自适应FCM聚类算法 被引量:5

Adaptive FCM clustering algorithm based on sample and feature weights
下载PDF
导出
摘要 针对传统FCM算法无法获得令人满意的聚类结果的问题,提出了基于样本与特征双加权的自适应FCM聚类算法。采用特征和样本双加权的策略,以特征权重信息熵作为代价函数,与样本权重、特征权重相融合,通过迭代优化的方法动态计算各属性特征对不同类别的权重系数、每个样本对聚类的重要性权重值,综合考虑各个样本的贡献度和各个特征的重要性,从而达到提高聚类结果质量的目的。使用5个来自UCI的标准机器学习数据集,对聚类算法的有效性进行验证。结果表明,对于具有不同样本贡献度和不同特征重要性的数据集,提出的算法具有较好的聚类效果。 As a classical machine learning algorithm, FCM (fuzzy C-means ) clustering algorithm is widely used in many applications. For datasets with different feature importance and different sample con- tribution, features weight based algorithms are proposed to improve the clustering quality in some degree, but can not obtain the optimal result. Based on the weighting strategy of feature and sample, the cost function of feature weight information entropy, and the combination of sample weight, attribute feature weight and objective function, an adaptive FCM clustering algorithm with both sample and feature weight is proposed, by adopting the iterative optimization to measure the weighting coefficients of each sample feature upon each cluster dynamically, as well as the importance of every sample to the clustering. The algorithm takes full account of the influence of different feature and sample contributions on the clustering results, so as to achieve the purpose of improving the clustering effect. The clustering algorithm is veri- fied by using a set of test data sets from the UCI standard machine. The experimental results show that the proposed algorithm has better clustering accuracy than classical clustering algorithm with different dis-tributions and different feature contributions.
作者 林甲祥 吴丽萍 巫建伟 张泽均 LIN Jiaxiang;WU Liping;WU Jianwei;ZHANG Zejun(College of Computer and Information Sciences, Fujian Agriculture and Forestry University, Fuzhou 350002, China;Research Center for the Marine Environment Administration and Development Strategy, Third Institute of Oceanography, State Oceanic Administration, Xiamen 361001, China)
出处 《黑龙江大学自然科学学报》 CAS 2018年第2期244-252,共9页 Journal of Natural Science of Heilongjiang University
基金 国家自然科学基金资助项目(41401458) 福建省自然科学基金项目(2016J05148 2016J01753) 中国-东盟海上合作基金资助项目(2020399) 国家海洋局第三海洋研究所项目(2016020)
关键词 样本加权 特征加权 信息熵 模糊C-均值 聚类分析 sample weighted feature weighted information entropy fuzzy C-means cluster analysis
  • 相关文献

参考文献1

二级参考文献8

  • 1潘天红,李少远.基于模糊聚类的PWA系统的模型辨识[J].自动化学报,2007,33(3):327-330. 被引量:8
  • 2Bustince H,Burillo P.Structures on intuitionistic fuzzy realtions [J]. Fuzzy Sets and Systems,2006,78:293-303.
  • 3Bustince H.Construction of intuitionistic fuzzy relation with predetermined properties [J]. Fuzzy Sets and Systems, 2000, 109: 379-403.
  • 4ZHANG Hong-mei, XU Ze-shui, CHEN Qi. On clustering approach to intuitionistic fuzzy sets [C]. Control and Decision, 2007,22(8):882-888.
  • 5Eulalia Szmidt, Janusz Kacprxyk.Entropy for intuitionstic fuzzy sets[J]. Fuzzy Sets and Systems,2001(118):467-477.
  • 6Ioannis K Vlachos,George D Sergiadis.Intuitionistic fuzzy information-Applications to pattern recognition[J].Pattern Recognition Letters,2007(28): 197-206.
  • 7Dunn J C.A graph theoretic analysis of pattern classification via Tamura's fuzzy relation [J]. IEEE Trans SMC, 2000,4 (3): 310-313.
  • 8Bezdek J C.Pattem Recognition with fuzzy objective function algorithms[M].New York: Plenum Press, 2001.

共引文献2

同被引文献39

引证文献5

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部