基于节点聚类复杂度的图聚类方法

Graph Clustering Algorithm Based on Node Clustering Complexity

下载PDF

导出

摘要图聚类可以发现网络中的社区结构,是复杂网络分析中的一项重要任务。针对不同节点的聚类难度各异的问题,提出了一种基于节点聚类复杂度的图聚类算法(Graph Clustering Algorithm Based on Node Clustering Complexity,GCNCC),用于判断节点的聚类复杂度,为聚类复杂度低的节点赋予伪标签,利用伪标签提供的监督信息降低其他节点的聚类复杂度,进而得到网络聚类结果。GCNCC包括节点表示、节点聚类复杂度判别和图聚类3个主要模块。节点表示模块得到保持网络集聚性的表示;节点聚类复杂度判别模块用于判断网络中的低聚类复杂度节点,并利用低聚类复杂度节点的伪标签信息来优化更新网络中其他节点的聚类复杂度;图聚类模块采用标签传播方法,将低聚类复杂度节点标签传播给高聚类复杂度节点,以得到聚类结果。在3个真实的引文网络和3个生物数据集上与9种经典算法进行对比,算法GCNCC在ACC,NMI,ARI和F1等方面均表现良好。 Graph clustering is an important task in the analysis of complex networks,which can reveal the community structure within a network.However,clustering complexity of nodes varies throughout the network.To address this issue,a graph clustering algorithm based on node clustering complexity(GCNCC)is proposed.It calculates the clustering complexity of nodes and assigns pseudo-labels to nodes with low clustering complexity.Then it uses these pseudo-labels as supervised information to lower the clustering complexity of other nodes to obtain the community structure of the network.The GCNCC algorithm consists of three main modules:node representation,node clustering complexity assessment,and graph clustering.The node representation module represents nodes in a low-dimensional space to maintain the clustering of nodes,the node clustering complexity assessment module identifies low clustering complexity nodes,and assigns them pseudo-labels,which can be used to update the clustering complexity of other nodes.The graph clustering module uses label propagation to spread the pseudo-labels from nodes with low clustering complexity to those with high clustering complexity.Compared with 9 classic algorithms on 3 real citation networks and 3 biological datasets,the proposed GCNCC performed well in terms of ACC,NMI,ARI,and F1.

作者郑文萍王富民刘美麟杨贵 ZHENG Wenping;WANG Fumin;LIU Meilin;YANG Gui(School of Computer and Information Technology,Shanxi University,Taiyuan 030006,China;Key Laboratory of Computation Intelligence and Chinese Information Processing of Ministry of Education,Shanxi University,Taiyuan 030006,China;Institute of Intelligent Information Processing,Shanxi University,Taiyuan 030006,China)

机构地区山西大学计算机与信息技术学院计算智能与中文信息处理教育部重点实验室(山西大学) 山西大学智能信息处理研究所

出处《计算机科学》 CSCD 北大核心 2023年第11期77-87,共11页 Computer Science

基金国家自然科学基金(62072292) 山西省1331工程项目。

关键词图聚类节点聚类复杂度网络嵌入自监督 Graph clustering Node clustering complexity Network embedding Self-supervised

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1杜秀丽,陶帆,于涵,徐耀耀,吕亚娜.基于超网络的装备保障网络重组的动态演化模型[J].火力与指挥控制,2022,47(5):29-35.
2刘晓扬,韩增林,郭建科.高铁流视角下的环渤海地区城市网络联系[J].资源开发与市场,2023,39(5):580-590. 被引量：1
3钱昭楚.跨境电商贸易网络地位与出口产品技术复杂度关系分析[J].商业经济研究,2023(12):131-134.
4王雪微,范大龙,曹卫东.中国汽车总部与零部件企业供应网络结构演化及影响因素[J].经济地理,2023,43(2):124-135. 被引量：2
5谢亚兰.基于AI智能生成工具Generative Fill的PS课程教学改革[J].中国高新科技,2023(16):158-160.
6周昱伽,贾宇,马文敏,张昊,马红霞.分子对接技术在研究群体感应抑制剂中的进展[J].微生物学通报,2023,50(10):4626-4638.
7王钦,夏雨欣,杨张博.组织合法性、能力两用性与企业创新战略选择[J].科技管理研究,2023,43(17):11-19.
8张佳雪.国外方言态度研究现状与热点分析--基于CiteSpace可视化研究[J].现代商贸工业,2023,44(19):58-60.
9曹锦利.基于多目标优化的火电厂煤炭调度模型[J].辽宁工业大学学报（自然科学版）,2023,43(5):288-292.
10夏婷婷,李鑫阳.基于云计算的分散式畜禽粪污处理信息集成方法[J].信息与电脑,2023,35(15):120-122. 被引量：1

计算机科学

2023年第11期

浏览历史

内容加载中请稍等...

基于节点聚类复杂度的图聚类方法

相关作者

相关机构

相关主题

浏览历史