基于影响力剪枝的图神经网络快速计算图精简被引量：1

Fast Computation Graph Simplification via Influence-based Pruning for Graph Neural Network

下载PDF

导出

摘要计算图精简是提升图神经网络(Graph Neural Network,GNN)模型训练速度的一种优化技术,它利用节点间存在共同邻居的特性,通过消除聚合阶段的冗余计算,来加速图神经网络模型的训练。但是,在处理大规模图数据时,已有的计算图精简技术存在计算效率低的问题,影响了计算图精简技术在大规模图神经网络中的应用。文中详细分析了当前的计算图精简技术,统计了包括搜索和重构两阶段处理的时间开销,并总结了现有方法的不足。在此基础上,提出了基于影响力剪枝的图神经网络快速计算图精简算法。该算法应用影响力模型刻画各个节点对计算图精简的贡献,并基于影响力对共同邻居的搜索空间进行剪枝,极大地提升了搜索阶段的效率。此外,详细分析了算法复杂度,从理论上证明了该技术期望的加速效果。最后,为验证所提算法的有效性,将所提算法应用到两种主流的计算图精简技术上,选取常见的图神经网络模型在多个数据集上进行测试,实验结果表明所提算法在保证一定冗余计算去除量的前提下,能够显著地提升计算图精简的效率。相比基线计算图精简技术,所提技术在PPI数据集上搜索阶段的加速效果最高提升了3.4倍,全过程最高提升了1.6倍;在Reddit数据集上搜索阶段的加速效果最高提升了5.1倍,全过程最高提升了3.2倍。 Computation graph simplification is a kind of optimization technique to improve the training speed of graph neural network models.It uses the characteristics of common neighbors among nodes and speeds up the training of graph neural network models by eliminating redundant computation in the stage of aggregation.However,when dealing with large-scale graph data,the existing computation graph simplification techniques suffer from the problem of low computation efficiency,which affects the application of computation graph simplification in large-scale graph neural networks.This paper analyzes the current techniques of computation graph simplification in detail by counting the overhead of two phases including searching and reconstruction,and summarizes the shortcomings of existing techniques.On this basis,it proposes an algorithm of fast computation graph simplification via influence-based pruning for graph neural network.This algorithm applies an influence model to describe the contribution of each node to the computation graph simplification and prunes the searching space of common neighbors based on influence,which greatly improves the efficiency of the phase of searching.In addition,this paper analyzes the algorithm complexity and theoretically proves the expected acceleration effect of this technique.Finally,in order to verify the effectiveness of this novel algorithm,the algorithm is applied to two mainstream computation graph simplification technique,and common graph neural network models areselected to test on some data sets.Experimental results demonstrate that the novel algorithm can significantly improve the efficiency of the computation graph simplification on the premise of ensuring a certain amount of redundant computation reduction.Compared with the baseline of computation graph simplification,the proposed technique can speed up to 3.4 times in searching phase and speed up to 1.6 times on the whole process on PPI dataset,while it can speed up to 5.1 times in searching phase and speed up to 3.2 times on the whole process on Reddit dataset.

作者顾希之邵蓥侠 GU Xizhi;SHAO Yingxia(School of Computer Science,Beijing University of Posts and Telecommunications,Beijing 100876,China)

机构地区北京邮电大学计算机学院(国家示范性软件学院)

出处《计算机科学》 CSCD 北大核心 2023年第1期52-58,共7页 Computer Science

基金国家自然科学基金(62272054,U1936104,62192784)。

关键词图神经网络计算图精简共同邻居冗余计算剪枝 Graph neural network Computation graph simplification Common neighbors Redundant computation Pruning

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献25

1侯汉清,黄刚.电子计算机与文献分类[J].现代图书情报技术,1982(1):5-14. 被引量：10
2庞观松,蒋盛益.文本自动分类技术研究综述[J].情报理论与实践,2012,35(2):123-128. 被引量：33
3周丽红,刘勘.基于关联规则的科技文献分类研究[J].图书情报工作,2012,56(4):12-16. 被引量：9
4王方,阮梅花,朱海刚,熊燕,缪有刚.基于向量空间模型的科技文献自动分类研究[J].情报探索,2013(12):1-3. 被引量：5
5蒋昂波,王维维.ReLU激活函数优化研究[J].传感器与微系统,2018,37(2):50-52. 被引量：104
6谢红玲,奉国和,何伟林.基于深度学习的科技文献语义分类研究[J].情报理论与实践,2018,41(11):149-154. 被引量：11
7何浩,杨海棠.一种基于N-Gram技术的中文文献自动分类方法[J].情报学报,2002,21(4):421-427. 被引量：18
8王颖.科技文献内容语义描述模型研究[J].农业图书情报学报,2020,32(8):12-24. 被引量：9
9张晓丹.改进的图神经网络文本分类模型应用研究——以NSTL科技期刊文献分类为例[J].情报杂志,2021,40(1):184-188. 被引量：13
10陈德光,马金林,马自萍,周洁.自然语言处理预训练技术综述[J].计算机科学与探索,2021,15(8):1359-1389. 被引量：39

引证文献1

1安波.结构信息增强的文献分类方法研究[J].农业图书情报学报,2023,35(3):15-24.

1凌杰瑶,顾可恒.基于模糊综合评价的共享单车市场竞争影响力模型构建研究[J].企业改革与管理,2022(20):75-77. 被引量：1
2杨龙霄,杨润鑫,汤伟,李峻翔.知识图谱--从一张“图”看关联[J].科学中国人,2022(19):68-69.
3罗思诗,李茂军,陈满.多尺度融合注意力机制的人脸表情识别网络[J].计算机工程与应用,2023,59(1):199-206. 被引量：11
4孙水发,李小龙,李伟生,雷大江,李思慧,杨柳,吴义熔.图神经网络应用于知识图谱推理的研究综述[J].计算机科学与探索,2023,17(1):27-52. 被引量：14
5毛照昉,张悦晴.考虑新产品观测与体验特征的播种营销策略研究[J].系统工程理论与实践,2022,42(10):2740-2756. 被引量：3
6徐光晶,周坚鑫,舒晴.四叉树分解在海空重力测网交叉点搜索中的应用[J].武汉大学学报（信息科学版）,2022,47(11):1847-1853. 被引量：3
7马涪元,王英,李丽娜,汪洪吉.融合结构和特征的图层次化池化模型[J].计算机科学与探索,2023,17(1):179-186.
8王学滨,薛承宇,岑子豪.基于点-单元接触模式的水平岩层运动连续-非连续方法模拟[J].山东科技大学学报（自然科学版）,2022,41(6):40-49. 被引量：4

计算机科学

2023年第1期

浏览历史

内容加载中请稍等...

基于影响力剪枝的图神经网络快速计算图精简被引量：1

同被引文献25

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于影响力剪枝的图神经网络快速计算图精简 被引量：1

同被引文献25

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于影响力剪枝的图神经网络快速计算图精简被引量：1