结合图自编码器与聚类的半监督表示学习方法被引量：1

Semi-supervised representation learning method combining graph auto-encoder and clustering

下载PDF

导出

摘要节点标签是复杂网络中广泛存在的监督信息,对网络表示学习具有重要作用。基于此,提出了一种结合图自编码器与聚类的半监督表示学习方法(GAECSRL)。首先,以图卷积网络(GCN)和内积函数分别作为编码器和解码器,并构建图自编码器以形成信息传播框架;然后,在编码器生成的低维表示基础上增加k-means聚类模块,从而使图自编码器的训练过程和节点的类别分布划分形成自监督机制;最后,利用节点标签的判别信息对网络低维表示的类别划分进行指导,将网络表示生成、类别划分以及图自编码器的训练构建在一个统一的优化模型中,并获得融合节点标签信息的有效网络表示结果。在仿真实验中,将GAECSRL用于节点分类和链接预测任务。实验结果表明,相比DeepWalk、node2vec、全局结构信息图表示学习(GraRep)、结构化深度网络嵌入(SDNE)和用数据的转导式或归纳式嵌入预测标签和邻居(Planetoid),在节点分类任务中GAECSRL的Micro-F1指标提高了0.9~24.46个百分点,Macro-F1指标提高了0.76~24.20个百分点;在链接预测任务中,GAECSRL的AUC指标提高了0.33~9.06个百分点,说明GAECSRL获得的网络表示结果能有效提高节点分类和链接预测任务的性能。 Node label is widely existed supervision information in complex networks,and it plays an important role in network representation learning. Based on this fact,a Semi-supervised Representation Learning method combining Graph AutoEncoder and Clustering(GAECSRL)was proposed. Firstly,the Graph Convolutional Network(GCN)and inner product function were used as the encoder and the decoder respectively,and the graph auto-encoder was constructed to form an information dissemination framework. Then,the k-means clustering module was added to the low-dimensional representation generated by the encoder,so that the training process of the graph auto-encoder and the category classification of the nodes were used to form a self-supervised mechanism. Finally,the category classification of the low-dimensional representation of the network was guided by using the discriminant information of the node labels. The network representation generation,category classification,and the training of the graph auto-encoder were built into a unified optimization model,and an effective network representation result that integrates node label information was obtained. In the simulation experiment,the GAECSRL method was used for node classification and link prediction tasks. Experimental results show that compared with DeepWalk,node2vec,learning Graph Representations with global structural information(GraRep),Structural Deep Network Embedding(SDNE)and Planetoid(Predicting labels and neighbors with embeddings transductively or inductively from data),GAECSRL has the Micro-F1 index increased by 0. 9 to 24. 46 percentage points,and the Macro-F1 index increased by 0. 76 to 24. 20 percentage points in the node classification task;in the link prediction task,GAECSRL has the AUC(Area under Curve)index increased by 0. 33 to 9. 06 percentage points,indicating that the network representation results obtained by GAECSRL effectively improve the performance of node classification and link prediction tasks.

作者杜航原郝思聪王文剑 DU Hangyuan;HAO Sicong;WANG Wenjian(School of Computer and Information Technology,Shanxi University,Taiyuan Shanxi 030006,China;Key Laboratory Computational Intelligence and Chinese Information Processing of Ministry of Education(Shanxi University),Taiyuan Shanxi 030006,China)

机构地区山西大学计算机与信息技术学院计算智能与中文信息处理教育部重点实验室(山西大学)

出处《计算机应用》 CSCD 北大核心 2022年第9期2643-2651,共9页 journal of Computer Applications

基金国家自然科学基金资助项目(61902227,61773247) 山西省高等学校科技创新项目(2019L0039) 山西省自然科学基金资助项目(201901D211192)。

关键词网络表示学习网络嵌入节点标签图神经网络自监督机制 network representation learning network embedding node label graph neural network self-supervised mechanism

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1孙金清,周慧,赵中英.网络表示学习方法研究综述[J].山东科技大学学报（自然科学版）,2021,40(1):117-128. 被引量：4

二级参考文献3

1陈维政,张岩,李晓明.网络表示学习[J].大数据,2015,1(3):8-22. 被引量：16
2周慧,赵中英,李超.面向异质信息网络的表示学习方法研究综述[J].计算机科学与探索,2019,13(7):1081-1093. 被引量：19
3Wenjing Luan,Guanjun Liu,Changjun Jiang,Liang Qi.Partition-based Collaborative Tensor Factorization for POI Recommendation[J].IEEE/CAA Journal of Automatica Sinica,2017,4(3):437-446. 被引量：5

共引文献3

1梁庆伟,马健,林泽东.基于社交网络分析的学生异常轨迹检测方法研究[J].信息与电脑,2021,33(10):30-33.
2韩津津,李智杰,李昌华,张颉.基于改进图注意机制的网络嵌入方法研究及应用[J].计算机测量与控制,2022,30(9):207-212.
3富坤,郝玉涵,孙明磊,刘赢华.基于优化图结构自编码器的网络表示学习[J].计算机应用,2023,43(10):3054-3061.

同被引文献6

1刘昱阳,李龙杰,单娜,陈晓云.融合聚集系数的链接预测方法[J].计算机应用,2020,40(1):28-35. 被引量：4
2Yu ZHU,Zhonglin YE,Haixing ZHAO,Ke ZHANG.Text-enhanced network representation learning[J].Frontiers of Computer Science,2020,14(6):43-54. 被引量：1
3陈吉成,陈鸿昶.基于张量建模和进化K均值聚类的社区检测方法[J].计算机应用,2021,41(11):3120-3126. 被引量：4
4胡秉德,王新根,王新宇,宋明黎,陈纯.超图学习综述:算法分类与应用分析[J].软件学报,2022,33(2):498-523. 被引量：13
5刘贞国,朱宇,刘连照,徐宙.基于转化策略的异质超网络表示学习[J].计算机应用研究,2022,39(11):3333-3339. 被引量：3
6刘贞国,朱宇,赵海兴,王晓英,黄建强.基于平移约束的异质超网络表示学习[J].中文信息学报,2022,36(12):74-84. 被引量：1

引证文献1

1王可可,朱宇,王晓英,黄建强,曹腾飞.超边约束的异质超网络表示学习方法[J].计算机应用,2023,43(12):3654-3661.

1王曙燕,巩婧怡.融合节点标签与强弱关系的链路预测算法[J].计算机工程与应用,2022,58(18):71-77. 被引量：1
2楼嘉琪,叶海良,杨冰,李明,曹飞龙.双分支多交互的深度图卷积网络[J].模式识别与人工智能,2022,35(8):754-763.
3卢鹏丽,许星舟.基于精准k核的复杂网络节点重要性评估方法[J].兰州理工大学学报,2022,48(4):90-98. 被引量：4
4郭瑞泽,魏巍,崔军彪,冯凯.图自适应原型网络的小样本节点分类方法[J].模式识别与人工智能,2022,35(8):743-753. 被引量：1
5李静.基于熵权法的城市周边乡村旅游竞争力评价[J].九江学院学报（自然科学版）,2022,37(3):74-79.
6王新颖,胡磊磊,刘岚,徐拓,林振源,黄旭安.基于改进K-means和CNN的储罐罐底点蚀诊断模型[J].中国安全生产科学技术,2022,18(8):196-201. 被引量：2
7张博,宋淑彩,赵一航.基于GCN的节点分类研究[J].河北建筑工程学院学报,2022,40(2):196-200.
8周亦乐.人格权禁令与人格权纠纷诉讼并行的程序安排——以人格权禁令的制度属性与程序构建为分析角度[J].常州工学院学报（社会科学版）,2022,40(4):109-116. 被引量：1
9王晨曦,张莹祺.基于门控图注意力网络的归纳式文本分类[J].计算机系统应用,2022,31(9):201-209.
10黄丽,朱焱,李春平.基于异构网络表征学习的作者学术行为预测[J].计算机科学,2022,49(9):76-82.

计算机应用

2022年第9期

浏览历史

内容加载中请稍等...

结合图自编码器与聚类的半监督表示学习方法被引量：1

参考文献1

二级参考文献3

共引文献3

同被引文献6

引证文献1

相关作者

相关机构

相关主题

浏览历史

结合图自编码器与聚类的半监督表示学习方法 被引量：1

参考文献1

二级参考文献3

共引文献3

同被引文献6

引证文献1

相关作者

相关机构

相关主题

浏览历史

结合图自编码器与聚类的半监督表示学习方法被引量：1