期刊文献+

基于统一标签矩阵的快速多视图聚类

Fast Multi-View Clustering Based on Uniform Label Matrix
下载PDF
导出
摘要 在多视图聚类领域,众多方法直接从原始数据中学习相似矩阵,但是这忽视了原始数据中的噪声所产生的影响;此外,还有一些方法必须对图拉普拉斯矩阵进行特征分解,这导致可解释性的降低且需要k-means等后处理。为解决以上问题,文中提出一种基于统一标签矩阵的快速多视图聚类。首先,从松弛的归一化切割和比率切割的统一观点出发对目标函数增加非负约束;然后,通过指示矩阵对相似矩阵进行结构化图重构,确保获得的图具有强簇内连接和弱簇间连接;此外,通过设置统一的标签矩阵降低迭代次数,从而进一步提升该方法的运行速度;最后,基于交替方向乘子法的策略对问题进行优化求解。算法通过随机选择锚点地址的方法对多视图数据集进行对齐,对齐视图能够大幅提升聚类的精度。在迭代过程中通过使用奇异值分解来替代特征分解,有效地解决了传统谱聚类算法计算复杂度高的问题;通过按行索引指示矩阵的最大元素的列标直接获得标签。在4个真实数据集上的实验结果证明了该算法的有效性,表明其聚类性能优于现有的9种基准算法。 In the field of multi-view clustering,many methods learn the similarity matrix directly from the original data,but this ignores the effect of noise in the original data.In addition,some methods must perform a feature de⁃composition on the graph Laplacian matrix,which leads to reduced interpretability and requires post-processing such as k-means.To address these issues,this paper proposed a fast multi-view clustering based on a unified label matrix.Firstly,a non-negative constraint was added to the objective function from the unified viewpoint of the nor⁃malized cut of the relaxation and the ratio cut.Then,a structured graph reconstruction was performed on the simi⁃larity matrix by the indicator matrix to ensure that the obtained graph has strong intra-cluster connections and weak inter-cluster connections.In addition,the number of iterations was reduced by setting a unified label matrix,thus further improving the speed of the method.Finally,the problem was solved optimally based on an alternating direc⁃tion multiplication strategy.The algorithm aligns the multi-view dataset by randomly selecting the anchor ad⁃dresses,and aligning the views can significantly improve the accuracy of clustering.The problem of the high com⁃putational complexity of traditional spectral clustering algorithms was effectively solved by using singular value de⁃composition instead of feature decomposition in the iterative process.Labels were obtained directly by indicating the column labels of the largest element of the matrix by row index.Experimental results on four real datasets demonstrate the effectiveness of the algorithm,and show that its clustering performance outperformed the nine existing benchmark algorithms.
作者 刘怡俊 王嘉达 钟仕杰 杨晓君 叶武剑 LIU Yijun;WANG Jiada;ZHONG Shijie;YANG Xiaojun;YE Wujian(School of Integrated Circuits,Guangdong University of Technology,Guangzhou 510006,Guangdong,China;School of Advanced Manufacturing,Guangdong University of Technology,Jieyang 515200,Guangdong,China;School of Information Engineering,Guangdong University of Technology,Guangzhou 510006,Guangdong,China)
出处 《华南理工大学学报(自然科学版)》 EI CAS CSCD 北大核心 2023年第9期110-119,138,共11页 Journal of South China University of Technology(Natural Science Edition)
基金 国家重点研发计划项目(2018YFB1802100) 广东省重点领域研发计划项目(2018B030338001,2018B010115002) 广州市基础研究计划基础与应用基础研究项目(202201010595) 广东省教育厅创新人才项目和广东工业大学青年百人项目(220413548)。
关键词 多视图聚类 标签矩阵 图重构 可解释性 multi-view clustering label matrix graph reconstruction interpretability
  • 相关文献

参考文献2

二级参考文献3

共引文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部