附加特征图增强的图卷积神经网络被引量：2

Graph Convolutional Neural Networks with Additional Feature Graph

下载PDF

导出

摘要近年来,图卷积网络(Graph Convolutional Network,GCN)凭借其简单的网络结构、在图上任务中展现出的优异性能,受到了学术界和工业界的广泛关注.然而GCN也存在着在浅层时信息传播范围过小、特征提取不充分的缺陷.针对这一问题,本研究提出附加特征图模型(Additional Feature Graph,AFG).AFG通过引入图的节点结构特征(度特征),对度相同的节点随机增加连边、缩短信息传播距离.AFG并不是独立的图神经网络模型,而是作为一种附加技术与GCN及其相关模型配合使用.实验显示,在Cora、Citeseer、Pubmed数据集上AFG能够对浅层主干模型实现显著性能增益,帮助主干模型性能超越了其他以提升模型特征提取能力、改善欠传播情况为目的进行设计的模型.本研究进一步分析了AFG与DropEdge——一种随机切断原始图连边的附加技术——的区别与联系,并通过实验证明附加特征图模型与DropEdge模型共同使用的可行性,以及两者间存在一定的互补性.结合使用两种附加技术可以实现更大的节点分类准确度增益. Graph structures are suitable for the modeling of complex interactions and relations.Therefore,graphs are widely used in data representation such as molecules,chemical compounds,citation networks,social networks,traffic web,and knowledge graphs.In light of the great success of neural networks in image understanding and natural language processing,there has been a rising interest in Graph Neural Networks(GNN)for the study of learning on graphs.Among the popular GNNs,Graph Convolutional Networks(GCN),highlighted for their simple network structures and excellent performance with graphs,have attracted wide attention and become a promising direction.However,the limitation on message passing distance deteriorates the performance of GCN.To address this problem,people propose constructing deep GCN models to improve propagation.However,as the depth of GCN increases,node features become oversmoothed or over-squashed.This leads to a sharp decrease in the model performance.Though different deep GCN models have been proposed to tackle the over-smoothing or over-squashing problem,the inherent problems of the message passing mechanism are less explored.The problems of message passing in GCN include:1)having unreliable paths in the original input graphs which pass information with low signal-to-noise ratios,2)reaching limited message passing extent which gives rise to the less expressive feature representation,and 3)lacking explicit learning fashion on structural features.To this end,some people seek to directly improve the message passing in shallow GCN concerning the limitation in robustness,message passing extent,or feature diversity.But all these studies fail to tackle three problems in a single model.In this paper,we propose a novel model named AFG(Additional Feature Graph),to improve the message passing over robustness,message passing extent,and feature diversity.Specifically,AFG can inject the structural features of the input graph into the message passing process and randomly add edges between node pairs that have the same degree.The degree feature represents the first-order topological structure of individual nodes.Connecting nodes with the same degree allow explicit learning on structural features.As a lightweight and general technique,our AFG model can be easily plugged into GCN and its related models,bringing extra improvements.Experimental results on three datasets demonstrate the efficiency of AFG.AFG-aided models outperform shallow backbones and related GCN-based models on Cora,Citeseer,and Pubmed.AFG achieves 0.69%averaged improvement on 2-layer models,0.57%on 4-layer models,1.18%on 6-layer models,and 0.99%on 8-layer models.The improvements are proven to be significant with hypothesis testing.We also provide detailed experimental analysis on AFG with both synthetic datasets and real world datasets.The corresponding experimental results demonstrate that AFG can improve the connectivity and shorten the average path length of the input graphs.With AFG,nodes can get more informative features from their neighbors.Compared to GCN,AFG reaches a broader message passing extent in shallow model structures.Furthermore,we provide experiments verifying that AFG and DropEdge(another plug-and-play technique for GCN models)are complementary to each other and can be combined to achieve better performance.

作者孙隽姝王树徽杨晨雪黄庆明郑振刚 SUN Jun-Shu;WANG Shu-Hui;YANG Chen-Xue;HUANG Qing-Ming;Reynold C.K.Cheng(Key Lab of Intelligent Information Processing(CAS,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190;Department of Computer Science and Technology,University of Chinese Academy of Sciences,Beijing 100049;Pengcheng Laboratory,Shenzhen,Guangdong 518055;Agriculture Information Institute of Chinese Academy of Agriculture Sciences,Beijing 100081;Department of Computer Science,The University of Hong Kong,Hong Kong;Guangdong-Hong Kong-Macao Joint Laboratory,Shenzhen University,Shenzhen,Guangdong 518060)

机构地区中国科学院计算技术研究所智能信息处理实验室中国科学院大学计算机科学与技术学院鹏城实验室中国农业科学院农业信息研究所香港大学计算机科学系深圳大学粤港澳智慧城市联合实验室

出处《计算机学报》 EI CAS CSCD 北大核心 2023年第9期1900-1918,共19页 Chinese Journal of Computers

基金科技创新2030-新一代人工智能重大项目:面向跨媒体内容管理的智能分析与推理(No.2018AAA0102000) 国家自然科学基金委员会:跨媒体理解与知识推理(No.62022083) 国家自然科学基金委员会:数据和知识联合驱动的跨媒体语义理解与文本生成(No.62236008) 中国科学院计算技术研究所创新课题(E161060) 香港大学项目(104005858,10400599) 粤港澳联合实验室项目(2020B1212030009) 鹏城实验室重大攻关项目:脑眼融合的智能感知计算技术与平台(PCL2023AS6-1)等项目资助。

关键词图表示学习图神经网络信息传播图卷积网络节点分类 graph representation learning graph neural networks message passing graph convolutional networks node classification

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献9

1胡龙灿,杨帆,樊爱军.手写数学公式的识别研究及在Android上的应用[J].计算机应用与软件,2014,31(8):28-31. 被引量：2
2付鹏斌,彭荆旋,杨惠荣,李建君.基于多重几何特征和CNN的脱机手写算式识别[J].计算机系统应用,2020,29(8):271-279. 被引量：2
3甘晓英,白阳,何晓栋,刘斌.一种并行二值图像连通域标记算法[J].计算机与数字工程,2021,49(5):993-997. 被引量：10
4沈佳伟,周宇昂,赵天宇,周渊,周志豪,张娟.手写数学表达式识别方法研究[J].福建电脑,2021,37(7):59-61. 被引量：1
5雷嘉兴,王伟.二维傅里叶图像预处理对DNN网络的影响研究[J].科学技术创新,2022(11):61-64. 被引量：2
6王巍,周庆华.基于改进Faster R-CNN的算式检测与定位[J].智能计算机与应用,2022,12(12):164-168. 被引量：2
7王佳宇,李楹,马春梅,吴东昊,姜丽芬.融合实体信息的图卷积神经网络的短文本分类模型[J].天津师范大学学报（自然科学版）,2023,43(1):67-72. 被引量：7
8李卓璇,周亚同.改进DBNet的电商图像文字检测算法研究[J].计算机工程与科学,2023,45(11):2008-2017. 被引量：2
9倪波,柯亨进,蔡贤涛.半监督学习下复杂背景图像边缘检测仿真[J].计算机仿真,2023,40(12):269-272. 被引量：1

引证文献2

1王治学.融合实体信息的图卷积神经网络的短文本分类模型分析[J].信息系统工程,2023(9):122-125. 被引量：2
2刘兴豪,陈芷妍,何滨,童保鑫,李文全.基于手机拍照的手写算式识别研究[J].信息技术与信息化,2024(6):68-71.

二级引证文献2

1张德银,黄少晗,赵志恒,李俊佟,张裕尧.基于融合神经网络的飞机蒙皮缺陷检测的研究[J].成都大学学报（自然科学版）,2023,42(4):365-371. 被引量：1
2贡丹均,张志明.基于无人机航测技术的林草区植被快速分类方法研究[J].经纬天地,2024(4):82-85.

1王静红,周志霞,王辉,李昊康.双路自编码器的属性网络表示学习[J].计算机应用,2023,43(8):2338-2344.
2王金娣.税收弹性对中小企业税负的影响[J].纳税,2023(19):37-39.
3李智杰,韩津津,李昌华,张颉.面向图嵌入的改进图注意机制模型[J].计算机工程与应用,2023,59(17):152-158. 被引量：1
4陈坤,孙彬,高仕谦,王学东.IL/Fe_(3)O_(4)-COOH@MIL-101磁性固相萃取-高效液相色谱测定牛奶中的黄曲霉毒素[J].分析科学学报,2023,39(4):398-404. 被引量：1
5陈鑫,侯青山,付艳,张吉康.改进DeepLabV3+下的轻量化烟雾分割算法[J].西安工程大学学报,2023,37(4):118-126. 被引量：1
6唐彦龙,冶忠林,赵海兴,仁青卓么.基于文本注意力机制优化的网络表示学习模型[J].郑州大学学报（理学版）,2023,55(6):41-47.
7马胜位,黄瑞章,任丽娜,林川.基于多层语义融合的结构化深度文本聚类模型[J].计算机应用,2023,43(8):2364-2369. 被引量：2
8宣清良.猪传染性胸膜肺炎的诊断与防治措施[J].今日畜牧兽医,2023,39(8):104-106. 被引量：1
9焦娇.新媒体时代时政新闻传播路径探究[J].传播力研究,2023,7(24):4-6.
10孙传珠,李斌,符朝兴.基于注意力机制与YOLOv5融合的树脂拉链缺陷检测算法研究[J].青岛大学学报（工程技术版）,2023,38(3):23-29.

计算机学报

2023年第9期

浏览历史

内容加载中请稍等...

附加特征图增强的图卷积神经网络被引量：2

同被引文献9

引证文献2

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

附加特征图增强的图卷积神经网络 被引量：2

同被引文献9

引证文献2

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

附加特征图增强的图卷积神经网络被引量：2