期刊文献+

大规模图数据的k^2-MDD表示方法与操作研究 被引量:4

Representation and Operations Research of k^2-MDD in Large-Scale Graph Data
下载PDF
导出
摘要 对包含亿万个顶点和边的图数据进行高效、紧凑的表示和操作是大规模图数据分析处理的基础.针对该问题提出了基于决策图的大规模图数据的一种表示方法——k^2-MDD,给出了k^2-MDD的构造过程以及图的边查询、外(内)邻查询、出(入)度查询、添加(删除)边等基本操作.该表示方法在k^2树的基础上进行优化与改进,对图的邻接矩阵进行k^2划分后,采用多值决策图进行存储,从而达到存储结构更为紧凑的目的.通过对来自米兰大学LAW实验室的一系列真实网页图和社交网络图数据的实验结果可以看出,k^2-MDD结构在节点数上仅为k^2树的2.59%~4.51%,达到了预期效果.通过对随机图的实验结果可以看出,k^2-MDD结构不仅适用于稀疏图,同样也适用于稠密图.图数据的k^2-MDD表示,既具有k^2树表示的紧凑型和查询的高效性,又能实现符号决策图表示下图模式的高效操作,从而实现了描述和计算能力的统一. Efficient and compact representation and operation of graph data which contains hundreds ofmillions of vertices and edges are the basis of analyzing and processing the large scale of graph data.Aiming at the problem,this paper proposes a representation of large-scale graph data based on thedecision diagram,that is^2-MDD,providing the initialization of^2-MDD and the basic operation suchas the edge query,inner(outer)neighbor query,finding out(in)-degree,adding(deleting)edge,etc.The representation method is optimized and improved on the basis of k2tree,and after dividing theadjacency matrix of graph into k2,it is stored with the multi valued decision diagram,so as to achievea more compact storage structure.According to the experimental results of a series of real Web graphand the social network graph data(cnr-2000,dewiki-2013,etc.)derived from the LAW laboratory atthe University of Milan,it can be seen that the number of^2-MDD5nodes is only2.59%-4.51%ofthe k2tree,which achieving the desired effect.According to the experimental results of randomgraphs,it can be seen that the^2-MDD structure is not only suitable for sparse graphs,but also fordense graphs.The graph data of^2-MDD shows that both containing the compact and query efficiencyrepresentation of k2tree and realizing the efficient operation of the graph model can thus achieve theunity of description and computing power.
作者 董荣胜 张新凯 刘华东 古天龙 Dong Rongsheng;Zhang Xinkai;Liu Huadong;Gu Tianlong(Guangxi Key Laboratory of Trusted Software(Guilin University of Electronic Technology),Guilin ^Guangxi 541004)
出处 《计算机研究与发展》 EI CSCD 北大核心 2016年第12期2783-2792,共10页 Journal of Computer Research and Development
基金 国家自然科学基金项目(U1501252,61363070,61572146,61363030) 广西高等学校高水平创新团队及卓越学者计划 桂林电子科技大学创新团队资助项目~~
关键词 图数据 存储优化 々2_ M D D 々2 决策图 graph data storage optimization ^2-MDD k2 tree decision diagram
  • 相关文献

参考文献2

二级参考文献9

  • 1AGGARWAL C, XIE Yan, YU P S. GConnect: a connectivity index for massive disk-resident graphs[ C ] //Proc of the International Conference on Very Large Data Bases. [ S. l. ] : VLDB Endowment Inc, 2009:862873.
  • 2BRISABOA N R, LADRA S, NAVARRO G. K2-trees for compact Web graph representation [ C ]//Proc of the 16th International Symposium on String Processing and Information Retrieval. Berlin : Springer, 2009 : 1830.
  • 3BOLDI P, VIGNA S. The Webgraph framework I: compression techniques[ C ]//Proc of the 13th International Conference on World Wide Web. New York : ACM Press,2004:595-602.
  • 4XIAO Yang-hua, MacARTHUR B D, WANG Hui, et al. Network quotients: structural skeletons of complex systems[ J]. Physical Review E,2008,78(4) :046102-046108.
  • 5KNUTH D E. The Stanford graphbase:a platform for combinatorial computing [ M ]. New York : ACM Press, 1993.
  • 6BATAGELJ V, MRVAR A. Pajek datasets[ EB/OL]. (2007-02-24) [ 2010-10-22 ]. http ://vlado. fmf. uni-lj, si/pub/networks/data/.
  • 7NEWMAN M E J. The structure of scientific collaboration networks [J]. Proceedings of the National Academy Sciences of the USA,2001,98(2) : 404-409.
  • 8ComScore:Facebook 3月独立访问用户达4.84亿[EB/OL].(2010-04-22)[2010-06-28].http://www.cctime.com/html/2010-4-22/201042291342486.htm.
  • 9于戈,谷峪,鲍玉斌,王志刚.云计算环境下的大规模图数据处理技术[J].计算机学报,2011,34(10):1753-1767. 被引量:97

共引文献13

同被引文献8

引证文献4

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部