摘要
图划分是大规模分布式图处理的首要工作,对图应用的存储、查询、处理和挖掘起基础支撑作用.随着图数据规模的不断扩大,真实世界中的图表现出动态性.如何对动态图进行划分,已成为目前图划分研究的热点问题.从不同动态图划分算法的关注点和特点出发,系统性地介绍当前可用于解决动态图划分问题的各类算法,包括流式图划分算法、增量式图划分算法和图重划分算法.首先介绍图划分的3种不同的划分策略及问题定义、图的两种不同的动态性来源以及动态图划分问题;然后介绍3种不同的流式图划分算法,包括基于Hash的划分算法、基于邻居分布的划分算法以及基于流的优化划分算法;其次介绍单元素增量式划分和批量增量式划分这两种不同的增量式图划分算法;再次,分别介绍针对图结构动态的重划分算法和针对图计算动态的重划分算法;最后,在对已有方法分析和比较的基础上,总结目前动态图划分面临的主要挑战,提出相应的研究问题.
Graph partitioning is the primary work of large-scale distributed graph processing,which plays a fundamental role in storage,query,processing,and mining of graph applications.Since graph data in the real world are always dynamic,the research of dynamic graph partitioning is a hot topic.This study systematically introduces the current algorithms for dynamic graph partitioning,which including streaming graph partitioning algorithm,incremental graph partitioning algorithm,and graph repartitioning algorithm.Firstly,the study introduces three different partitioning strategies,two different dynamic sources of graph and dynamic graph partitioning problem.Then,three different streaming graph partitioning algorithms are introduced,including hash algorithm,neighbor distribution-based algorithm,and novel algorithm.Secondly,two different incremental graph partitioning algorithms,single element incremental graph partitioning,and batch incremental graph partitioning are introduced.Thirdly,the repartitioning algorithm for graph structure and the repartitioning algorithm for graph computation are introduced,respectively.Finally,based on the analysis and comparison of the existing methods,the main challenges of dynamic graph partitioning are summarized and the corresponding research problems are proposed.
作者
李贺
刘延娜
袁航
杨舒琪
韵晋鹏
乔少杰
黄健斌
崔江涛
LI He;LIU Yan-Na;YUAN Hang;YANG Shu-Qi;YUN Jin-Peng;QIAO Shao-Jie;HUANG Jian-Bin;CUI Jiang-Tao(School of Computer Science and Technology,Xidian University,Xi’an 710126,China;ByteDance,Hangzhou 310020,China;School of Software Engineering,Chengdu University of Information Technology,Chengdu 610225,China)
出处
《软件学报》
EI
CSCD
北大核心
2023年第2期539-564,共26页
Journal of Software
基金
国家自然科学基金(61602354,61876138)
四川省科技计划(2021JDJQ0021,2022YFG0186)
成都市技术创新研发项目(2021-YF05-00491-SN)
成都市重大科技创新项目(2021-YF08-00156-GX)
成都市“揭榜挂帅”科技项目(2021-YF08-00156-GX)
中央高校基本科研业务费专项。
关键词
图划分
动态图
分布式图处理
图算法
graph partitioning
dynamic graphs
distributed graph processing
graph algorithms