期刊文献+

利用测地线距离的改进谱聚类算法 被引量:5

An Improved Spectral Clustering Algorithm Using Geodesic Distance
下载PDF
导出
摘要 针对往复式压缩机故障数据空间分布复杂、常规算法不能有效聚类的问题,提出了一种改进的谱聚类算法.该算法使用新的相似度矩阵计算方式,根据故障数据流形分布的特点引入测地线距离取代欧氏距离作为数据间的关系度量;通过计算各数据点的邻域密度因子有效地识别和剔除了噪声点;利用基于密度的局部欧氏距离调整方法对流形间隙过小的区域进行了处理.在几个人工数据集和往复式压缩机故障数据集上的测试结果表明,改进谱聚类算法对于具有流形分布、多尺度、有噪声、流形间隙过小甚至交叉等特点的数据具有很好的聚类能力,聚类准确率比常规的k-均值和MSCA谱聚类算法分别提高了50.86%和8.6%. An improved spectral clustering algorithm is proposed to focus on the problem that the general clustering algorithms are invalid for reciprocating compressor fault data lying on complex manifold. A new affinity matrix is obtained. The geodesic distance replaces the traditional Euclidian distance to measure the similarity of data, and neighborhood-based density factor is used to identify and to remove noise points. Moreover, density-based local Euclidian distance adjustment is introduced into areas with small gap between manifolds. The proposed method is implemented On several artificial datasets and a real reciprocating compressor fault dataset. Experimental results show that the new algorithm can accomplish the clustering for data with noise and multi-scale character, especially when the manifolds have small gaps or crossover between each other. Its accuracy is 50. 86% and 8. 6~ higher than those of k-means and MSCA respectively.
出处 《西安交通大学学报》 EI CAS CSCD 北大核心 2012年第8期1-7,共7页 Journal of Xi'an Jiaotong University
基金 国家自然科学基金资助项目(61075001)
关键词 往复式压缩机 谱聚类 测地线距离 欧氏距离调整 reciprocating compressor spectral clustering geodesic distance Euclidian distanceadjustment
  • 相关文献

参考文献21

二级参考文献197

共引文献393

同被引文献38

  • 1Jain A K. Data clustering: 50 years beyond k-means [J]. Pat- tern Recognition Letters, 2010, 31 (8): 651-666.
  • 2Thanh N Tran, Klaudia Drab, Michal Daszykowski. Revised DBSCAN algorithm to cluster data with dense adjacent clusters[J]. Chemometrics and Intelligent Laboratory Systems, 2013, 120 (15): 92-96.
  • 3Rodriguez L A. Clustering by fast search and find of density peaks [J]. Science, 2014, 344 (6191): 1492-1496.
  • 4Chan T M. More algorithms for all-pairs shortest paths in weighted graphs [J]. SIAM Journal on Computing, 2010, 39 (5) ; 2075-2089.
  • 5Zhao H, Min F, Zhu W. Test-cost-sensitive attribute reduc- tion of data with normal distribution measurement errors [J]. Mathematical Problems in Engineering, 2013 (2): 1-12.
  • 6Crawford G C, Aguinis H, Lichtenstein B, et al. Power law distributions in entrepreneurship: Implications for theory and research [J]. Journal of Business Venturing, 2015, 30 (5): 696-713.
  • 7Wauthier F, Jojie N, Jordan M I. Active spectral clustering via iterative uncertainty reduction [C] //Proceedings of ACM Conference on Knowledge Discovery and Data Mining, 2012: 1339-1347.
  • 8Alcala' J, Fernatndez A, Luengo J, et al. Keel data-mining software tool: Data set repository, integration of algorithms and experimental analysis framework [J]. Journal of Multi- pie-Valued Logic and Soft Computing, 2010, 17 (11): 255-287.
  • 9赵海祥,伍卫国,赵增,李海龙.一种应用于远程并行程序调试系统的新型消息聚集机制[J].西安交通大学学报,2009,43(10):27-31. 被引量:2
  • 10贺玲,蔡益朝,杨征.高维数据聚类方法综述[J].计算机应用研究,2010,27(1):23-26. 被引量:42

引证文献5

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部