基于张量距离的高阶近邻传播聚类算法

A high-order affinity propagation clustering algorithm based on tensor distance

下载PDF

导出

摘要近邻传播算法(AP)不需要事先指定聚类数目,在程序运行过程中,能够自动识别聚类中心及聚类数目。在同一批数据集上,AP算法聚类结果稳定,鲁棒性好。除此之外,AP聚类算法可以采用多种距离度量方式,聚类结果精确。针对近邻传播算法(AP)不能对异构数据进行聚类的问题,提出一种基于张量距离的高阶AP聚类算法。该算法首先利用张量表示异构数据对象,然后将张量距离引入AP聚类算法,用来度量异构数据对象在张量空间的相似度。张量距离的引入,不但能够度量异构数据对象在数值上的差异,同时能够度量异构数据对象在高阶空间中位置的差异性,有效的捕捉异构数据对象的分布特征。实验结果表示,提出的高阶AP算法能够有效的对异构数据对象进行聚类。 Affinity propagation（AP）algorithm does not need to specify the number of clustering.When running the program,it can automatically identify the clustering center and the number of clustering.On the same data set,the result of AP clustering algorithm is stable and has good robustness.In addition,AP clustering algorithm can get accurate clustering results by using a variety of distance measuring methods.But current affinity propagation algorithm cannot be applied to heterogeneous data clustering.Aiming at this problem,the paper proposes a high-order affinity propagation algorithm based on tensor distance（HTDAP）for clustering heterogeneous data.The proposed algorithm represents each heterogeneous data object by the tensor,and introduces the tensor distance to measure the similarity between two objects.The tensor distance can capture the distribution features of the heterogeneous data sets in the high-order space by calculating the distance of the numerical values between the objects and measure the difference among the coordinate positions.Experimental results show the proposed scheme is effective in heterogeneous data clustering.

作者铉岩周传生

机构地区沈阳师范大学科信软件学院沈阳师范大学教育技术学院

出处《沈阳师范大学学报（自然科学版）》 CAS 2016年第1期96-99,共4页 Journal of Shenyang Normal University:Natural Science Edition

基金辽宁省科技厅高等学校本科专业设置预测系统研究项目(辽教函[1008]225号)

关键词聚类异构数据张量距离 AP算法 clustering heterogeneous data tensor distance affinity propagation

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献4

1李朋,刘天华.云平台下基于粗糙集的并行算法的研究[J].沈阳师范大学学报（自然科学版）,2015,33(2):274-278. 被引量：1
2郭秀娟,陈莹.AP聚类算法的分析与应用[J].吉林建筑工程学院学报,2013,30(4):58-61. 被引量：12
3刘晓勇,付辉.一种快速AP聚类算法[J].山东大学学报（工学版）,2011,41(4):20-23. 被引量：20
4董俊,王锁萍,熊范纶.可变相似性度量的近邻传播聚类[J].电子与信息学报,2010,32(3):509-514. 被引量：49

二级参考文献47

1徐章艳,刘作鹏,杨炳儒,宋威.一个复杂度为max（O（｜C｜｜U｜），O（｜C^2｜U／C｜））的快速属性约简算法[J].计算机学报,2006,29(3):391-399. 被引量：234
2Frey B J and Dueck D. Clustering by passing messages between data points. Science, 2007, 315(5814): 972-976.
3Givoni I E and Frey B J. A binary variable model for affinity propagation. Neural Computation, 2009, 21(6): 1589-1600.
4Jia Sen, Qian Yun-tao, and Ji Zhen, Band hyperspectral imagery using affinity. Proceedings of the 2008 Digital Image Techniques and Applications, Canberra, ACT selection for Propagation. Computing: 1-3.12.2008:137-141.
5Gang Li, Lei brain MR International (ISCAS 2009) Guo, and Liu Tian-ming, et at. Grouping of images via affinity propagation. IEEE Symposium on Circuits and Systems, 2009 Taipei, Taiwan, 5.24. 2009: 2425-2428.
6Dueck D, Frey B J, and Jojic N, et al. Constructing treatment portfolios using affinity propagation[C]. Proceedings of 12th Annual International Conference, RECOMB 2008. Singapore. 3.30-4.2, 2008: 360-371.
7Leone M, Sumedha, and Weigt M. Clustering by soft-constraint affinity propagation: applications to gene- expression data. Bioinformatics, 2007, 23(20): 2708-2715.
8Alexander Hinneburg and Daniel A Keim. A general approach to clustering in large databases with noise. Knowledge and Information Systems, 2003, 5(4): 387-415.
9Little M A, McSharry P E, Hunter E J, and Lorraine O. Suitability of dysphonia measurements for telemonitoring of Parkinson's disease. IEEE Transactions on Biomedical Engineering, 2009, 56(4): 1015-1022.
10FREY B J, DUECK D. Clustering by passing messages between data points [ J ]. Science, 2007, 315 (5814) :972-976.

共引文献73

1李存洋,钱良辉.AP聚类算法对多车型定制公交站点的运用[J].区域治理,2019,0(7):209-209.
2常瑞花.基于密集度量元的近邻传播聚类算法[J].微电子学与计算机,2015,32(5):1-5. 被引量：1
3李雪梅,王立宏,宋宜斌.一种混合约束的半监督聚类算法[J].模式识别与人工智能,2011,24(3):452-456. 被引量：2
4刘晓勇,付辉.一种快速AP聚类算法[J].山东大学学报（工学版）,2011,41(4):20-23. 被引量：20
5许晓丽,卢志茂,张格森,李纯,张琦.改进近邻传播聚类的彩色图像分割[J].计算机辅助设计与图形学学报,2012,24(4):514-519. 被引量：27
6张友新,王立宏.两阶段近邻传播半监督聚类算法[J].山东大学学报（工学版）,2012,42(2):18-22. 被引量：1
7付迎丁,兰巨龙.基于核自适应的近邻传播聚类算法[J].计算机应用研究,2012,29(5):1644-1647. 被引量：9
8李坤,黄开枝,鲁国英.部分信道状态信息下簇规模均匀的基站群快速分簇方案[J].计算机应用,2012,32(7):1827-1830. 被引量：1
9邢艳,周勇.基于互近邻一致性的近邻传播算法[J].计算机应用研究,2012,29(7):2524-2526. 被引量：9
10卢志茂,李纯,张琦.近邻传播的文本聚类集成谱算法[J].哈尔滨工程大学学报,2012,33(7):899-905. 被引量：9

1张毅,章毅.非线性系统=f(x)的稳定性分析[J].内蒙古师范大学学报（自然科学汉文版）,1993,22(1):7-12.
2王平禄,董昱威.浅谈聚类算法在图像分割中的应用[J].无线互联科技,2013,10(7):172-172. 被引量：1
3钱丽丽,施鹏飞.近邻传播算法在非监督图像聚类中的应用[J].微型电脑应用,2011(2):34-36. 被引量：2
4周世兵,徐振源,唐旭清.一种基于近邻传播算法的最佳聚类数确定方法[J].控制与决策,2011,26(8):1147-1152. 被引量：23
5周世兵,徐振源,唐旭清.基于近邻传播算法的最佳聚类数确定方法比较研究[J].计算机科学,2011,38(2):225-228. 被引量：30
6朱兰,张晓焱.基于近邻传播算法的K-means聚类优化算法[J].信息技术与信息化,2015,0(2):138-142. 被引量：1
7王仕民,叶继华,程柏良,王明文.基于多尺度张量空间的改进Itti视觉显著性检测[J].系统仿真学报,2016,28(9):2138-2145.
8刘兵,张鸿.基于卷积神经网络和流形排序的图像检索算法[J].计算机应用,2016,36(2):531-534. 被引量：13
9邹倩颖.基于Hadoop平台的Canopy算法研究及应用[J].福建电脑,2016,32(1):116-117.
10刘柏嵩,赵福青.基于微观点的产品评论微摘要研究[J].情报学报,2015,34(9):970-977. 被引量：1

沈阳师范大学学报（自然科学版）

2016年第1期

浏览历史

内容加载中请稍等...

基于张量距离的高阶近邻传播聚类算法

参考文献4

二级参考文献47

共引文献73

相关作者

相关机构

相关主题

浏览历史