期刊文献+

一种改进的模糊连接点聚类算法 被引量:1

An improved fuzzy joint points clustering algorithm
下载PDF
导出
摘要 传统的模糊连接点FJP聚类算法采用基于欧氏距离的最大-最小合成运算法生成传递闭包,该方法所生成的传递闭包存在失真问题,即包含有较多错误的数据关联信息,最终造成算法聚类精度低且计算时间长。针对以上问题,提出一种改进的模糊连接点聚类算法:先用组合核函数计算数据集的模糊相似度矩阵,提高算法对数据非线性特征的辨识能力,并用大顶堆存储之;然后遍历传递闭包矩阵中的空元素,用堆顶的桥元素填充传递闭包的空元素,直至生成传递闭包。在测试数据集上的实验结果表明,本文算法的平均聚类精度较传统FJP算法有20%以上的提升,显著改善了传递闭包的失真问题;另外,在大型数据集上的计算效率亦优于传统FJP算法的,说明本文改进FJP算法的思路是有效的、可行的。 Conventional fuzzy joint points clustering method calculates the transitive closure by the max-min composition based on Euclidean distance,which makes the transitive closure anamorphic and include a lot of false data association information,and low clustering precision and high time complexity.Aiming at these problems,we propose an improved fuzzy joint points clustering algorithm.We first use the combined kernel function to obtain the fuzzy similarity matrix,which can increase the nonlinear recognizing ability.Then we use the max-heap to store fuzzy similarity matrix,traverse the empty elements of the transitive closure matrix,and populate the empty elements by the bridging element.Experimental results on the UCI data sets and artificial data sets show that the proposed approach has a shorter time and 20% higher clustering accuracy compared with the conventional FJP.In addition,the computational efficiency on large data sets is also more excellent than the traditional FJP clustering algorithm,which verifies that the idea of the improved FJP clustering algorithm is effective and feasible.
作者 孙明珊 覃华 苏一丹 SUN Ming-shan;QIN Hua;SU Yi-dan(School of Computer and Electronics Information,Guangxi University,Nanning 530004,China)
出处 《计算机工程与科学》 CSCD 北大核心 2018年第6期1140-1146,共7页 Computer Engineering & Science
基金 国家自然科学基金(61363027)
关键词 模糊连接点聚类算法 传递闭包 桥元素 大顶堆 fuzzy joint points clustering algorithm transitive closure bridging element max -heap
  • 相关文献

参考文献6

二级参考文献72

  • 1冯银厂,彭林,吴建会,朱坦,吕爱华,张克潭.乌鲁木齐市环境空气中TSP和PM_(10)来源解析[J].中国环境科学,2005,25(B06):30-33. 被引量:34
  • 2刘华军,任明武,杨静宇.一种改进的基于模糊聚类的图像分割方法[J].中国图象图形学报,2006,11(9):1312-1316. 被引量:23
  • 3李晓飞,马大玮,范小麟.一种基于SPIHT算法的感兴趣区域编码新方法[J].计算机应用研究,2007,24(2):189-191. 被引量:9
  • 4丁媛媛,司玉娟,郎六琪,罗思维.基于提升小波变换的图像压缩编码的VLSI实现[J].吉林大学学报(工学版),2007,37(3):675-680. 被引量:3
  • 5Bao Yukun,Liu Zhitao.A fast grid search method in support vector regression forecasting time series[C]. Springer Berlin/Hei- delberg:Intelligent Data Engineering and Automated Learning, 2006.
  • 6Lazar A.Income prediction via support vector machine[C].New York:Proceedings of IEEE Machine Learning and Applications, 2004.
  • 7邓乃阳,田英杰.支持向量机.理论、算法与拓展[M].北京:科学出版社,2009:97-101.
  • 8Haasdonk B.Feature space interpretation of SVMs with indefinite kernels [J]. Pattem Analysis and Machine Intelligence, 2005,27(4):482-492.
  • 9Jair Cervantes,Li Xiaoou,Yu Wen,et al.Support vector machine classification for large data sets via minimum enclosing ball clustering[J].Neurocomputing,2008,71 (4):611-619.
  • 10Yang Shuzhong, Luo Siwei. Learning SVM kernel with semidefinite programming [J]. Advances in Natural Computation, 2005,3601(2):710-715.

共引文献52

同被引文献7

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部