广义洛伦兹内核函数在模糊C均值聚类中的应用研究

Research on Generalized Lorenz Kernel Function in Fuzzy C Means Clustering

下载PDF

导出

摘要模糊C均值(FCM)算法是数据聚类分析的主要算法。但在嘈杂环境下,对于抽样大小不一的聚类,数目越多准确性越低,上述弊端可通过替代性FCM(AFCM)的高斯内核映射来解决。鉴于AFCM的不足,提出了针对模糊C均值聚类的广义洛伦兹内核函数。利用该算法对鸢尾数据库进行聚类,将其划分成山鸢尾、变色鸢尾和维吉尼亚鸢尾3类。实验结果表明,广义洛伦兹模糊C均值(GLFCM)可实现对离群聚类和大小不等的聚类数据的分类,其结果优于K均值、FCM、替代性C均值(AFCM)、Gustafson-Kessel(GK)和Gath-Geva(GG)方法,收敛迭代次数比AFCM的更少,其分区索引(SC)效果也好于其他方法。 Fuzzy C means（FCM） algorithm is the main algorithm for data clustering analysis. But in a noisy environ- ment, for the clusters of different sampling sizes, accuracy is low when the number of clusters is large. The above disad- vantages can be sloved through the Gauss kernel mapping of alternative FCM（AFCM）. This paper proposed generalized Lorenz kernel function to the fuzzy C means clustering for the deficiency of AFCM. This algorithm was used to analyze the Iris database cluster, to classify the Iris database into three clusters of Iris setosa, Iris versicolour and Iris virginica. Experimental results show that the generalized lorentzian fuzzy C-means（GLFCM） can classify data of outliers and un- equal sized clusters. The GLFCM yields better cluster than K-means（KM）, FCM, alternative fuzzy C-means（AFCM）, Gustafson-Kessel（GK） and Gath-Geva（GG）. It takes less iteration than that of AFCM to converge. Its partition index （SC） is better than the others.

作者王建华李晓峰高巍巍

机构地区哈尔滨师范大学黑龙江外国语学院信息科学系

出处《计算机科学》 CSCD 北大核心 2015年第9期268-271,共4页 Computer Science

基金黑龙江省智能教育与信息工程重点实验室开放基金项目(1155xnc107) 黑龙江省教育厅科学技术研究项目(12543067)资助

关键词广义洛伦兹隶属函数 K均值替代性模糊C均值聚类离群聚类 Generalized lorentzian membership function, K-means, Alternative fuzzy C-means, Clustering, Outlier clustering

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献10

1Kaufman L,Rousseeuw P.Finding Groups in Data[M].Wiley Series in Probability and Statistic,2005:56-67.
2Mirkin B.Clustering for Data Mining:A Data Recovery Approach[M].Chapman and Hall,2005:12-24.
3Wang Xiang,Guo Rui,et al.A Novel Alternative WeightedFuzzy C-means Algorithm and Cluster Validity Analysis [C]∥IEEE Pacific-Asia Workshop on Computational Intelligence and Industrial Application.2008:130-134.
4Hammerly G,Elkan C.Alternatives to the k-mean algorithm that find better clusterings[C]∥Proceedings of the 11th InternationalConference on Information and Knowledge Management,2002:600-607.
5郭小芳,李锋,宋晓宁,王卫东.基于连续域混合蚁群优化的核模糊C-均值聚类算法研究[J].模式识别与人工智能,2014,27(9):841-846. 被引量：5
6李广原,杨炳儒,刘英华,曹丹阳.基于模糊论的数据挖掘研究综述[J].计算机工程与设计,2011,32(12):4064-4067. 被引量：7
7李丽丽,李明,刘希玉.基于粒子群模糊C-均值聚类的图像分割算法[J].计算机工程与应用,2009,45(31):158-160. 被引量：12
8Liu X,Yang C.Performance research of Gaussian functionweighted fuzzy C-means algorithm[C]∥Proceedings of SPIE.2007.
9Yang M S,Tsai H S.A Gaussian kernel-based fuzzy c-means algotihm with a spatial bias correction[J].Pattern Recognition Letters,2008,29(12):1713-1725.
10Ramathilagam S,Huang Yueh-min.Extended Gaussian kernelversion of fuzzy c-means in the problem of data analyzing[J].Expert Systems with Applications:An International Journal,2011,38(4):3793-3805.

二级参考文献27

1张敏,于剑.基于划分的模糊聚类算法[J].软件学报,2004,15(6):858-868. 被引量：176
2耿新青,王正欧.一种挖掘模糊相似关联规则的新方法[J].计算机应用,2005,25(5):985-988. 被引量：5
3王华秋,曹长修.基于模拟退火的并行粒子群优化研究[J].控制与决策,2005,20(5):500-504. 被引量：45
4刘晓龙,张佑生,谢颖.模拟退火与模糊C-均值聚类相结合的图像分割算法[J].工程图学学报,2007,28(1):89-93. 被引量：17
5马江洪,葛咏.图像线状模式的有限混合模型及其EM算法[J].计算机学报,2007,30(2):288-296. 被引量：12
6Bezdek J C.Pattern recognition with fuzzy objective function algorithms[M].New York : Plenum Press, 1981 : 95-107.
7Madar J,Abonyi J,Szeifert F.Interactive Particle Swarm Optimization[C]//Proceedings of the 2005 5th International Conference on Intelligent Systems Design and Applications,2005.
8Jain A K. Data Clustering: 50 Years Beyond K-means. Pattern Rec- ognition Letters, 2010, 31 (8) : 651-666.
9Ozbay Y, Ceylan R, Karlik B. Integration of Type-2 Fuzzy Cluste- ring and Wavelet Transform in a Neural Network Based ECG Classi- fier. Expert Systems with Applications, 2011, 38(1) : 1004-1010.
10Zhang D Q, Chen S C. A Novel Kernelized Fuzzy C-means Algo- rithm with Application in Medical Image Segmentation. Artificial Intelligence in Medicine, 2004, 32(1): 37-50.

共引文献21

1张小红,宁红梅.基于混沌粒子群和模糊聚类的图像分割算法[J].计算机应用研究,2011,28(12):4786-4789. 被引量：10
2黄新建,牛强.改进的粒子群模糊聚类方法[J].计算机工程与设计,2012,33(3):1132-1135. 被引量：1
3张永成,王洪辉,谭桂花,李小刚.基于模糊C均值聚类的沉积相定量识别——以川西某气田蓬莱镇组为例[J].科学技术与工程,2012,20(26):6570-6574. 被引量：5
4赵艳妮,郭华磊,李敬华.基于粒子群模糊C均值聚类的快速图像分割[J].电子设计工程,2012,20(18):167-169.
5刘教民,李勇征,王雷,王震洲.融合自控粒子群和免疫进化的入侵数据分类[J].计算机工程与应用,2013,49(14):101-104.
6李伟峰.一种新的PSO优化FCM方法在图像分类中的应用[J].软件导刊,2013,12(8):72-75. 被引量：3
7王怡.基于模糊交叉网格的初始聚类中心选取方法[J].福建师大福清分校学报,2015,33(2):26-29. 被引量：1
8孟凯.论数据挖掘在科技评估中的应用[J].中国科技纵横,2015,0(11):22-22.
9董倩.改进遗传算法优化模糊均值聚类中心的图像分割[J].吉林大学学报（理学版）,2015,53(4):680-686. 被引量：9
10顾兆军,韩迎亚.可量化的信息系统安全性水平评估[J].计算机工程与设计,2016,37(7):1729-1733. 被引量：7

1熊兴平.蝴蝶效应与市场营销——寻找引发销售风暴的那只蝴蝶[J].农药市场信息,2008(23):22-23.
2刘健庄,谢维信.一种改进的AFCM聚类算法[J].西安电子科技大学学报,1990,17(3):75-81. 被引量：1
3周丽华,黄成泉,王林.一种自动模糊聚类的算法[J].统计与决策,2014,30(20):16-19. 被引量：5
4红杉.双硬盘无法进系统之谜[J].计算机应用文摘,2009(7):73-73.
5徐雪松,张谓,宋东明,张宏,刘凤玉.基于核的PP主成分分析及其在离群聚类中的应用[J].计算机科学,2007,34(9):131-134. 被引量：1
6李艳灵,李刚,武津刚.基于微分进化算法的FCM图像分割算法[J].数学的实践与认识,2009,39(9):139-143. 被引量：2
7彭代强,李家强,林幼权.基于模糊隶属度空间约束的FCM图像分割[J].计算机科学,2010,37(10):257-259. 被引量：6
8丁恒,李延来,熊升华,陈振颂.基于三角函数的洛伦兹曲线模型构造研究[J].计算机应用研究,2014,31(11):3273-3280. 被引量：4
9徐雪松.非线性数据变换及其在离群聚类中的应用[J].软件导刊,2009,8(10):6-9.
10贾元春,谭跃生,王静宇,顾瑞春,张晓琳.基于Linux内核映射的校园网计费系统研究[J].内蒙古科技大学学报,2007,26(3):246-249.

计算机科学

2015年第9期

浏览历史

内容加载中请稍等...

广义洛伦兹内核函数在模糊C均值聚类中的应用研究

参考文献10

二级参考文献27

共引文献21

相关作者

相关机构

相关主题

浏览历史