期刊文献+

基于邻域的大规模图数据动态分割算法 被引量:2

Large Scale Dynamic Graph Partitioning Based on Neighborhood Algorithm
下载PDF
导出
摘要 随着图数据的规模日益增大,出现大量以动态图数据为基础的分布式处理需求,划分问题在动态图数据分布式处理领域尤为重要.对大规模动态图数据上的划分问题进行研究,根据图结构性质及动态图特点,提出并实现基于邻域的动态图分割算法.算法分为静态切分和动态调整两个阶段,其中基于割边算法整合现有最优化策略提出了大规模图数据的静态切割算法.在优化后的静态切割算法的基础上,根据图数据的动态扩张的特性提出动态分割算法.根据迁移顶点所达到的最小负载值进行顶点迁移,并在此基础上进行性能及割边控制优化操作.最后,改进算法在各类图数据集上进行了验证,验证的结果显示在平衡度和割边等指标上优化后的算法效果显著,提高了划分的合理性,并且在保证割边不增加的情况下提高了图分割的平衡度. To solve the problem of graph data distributed processing, graph partitioning is more and more important. This paper studies the method of partitioning a large-scale dynamic graph. According to the characteristics of graph structure and dynamic graph, a dynamic graph partitioning algorithm based on neighborhood is proposed and implemented. The algorithm consists of two stages, static partition and dynamic adjustment. A static graph partitioning combines the cut-edges and the existing partition tools to get an optimized method. The dynamic adjustment focuses on the transferring of vertexes to its neighborhood. It calculates the value of vertexes transferring load and optimize the operations. Experiments on a series of common graph datasets show significant effect on the indexes of balance and cut-edges, which means the partition is reasonable and balanced.
出处 《计算机系统应用》 2016年第9期193-199,共7页 Computer Systems & Applications
基金 中国科学院先导专项(XDA06010600) 国家自然科学基金(61303163 91318301)
关键词 图数据 动态图 图分割 负载均衡 graph dynamic graph graph partition load balance
  • 相关文献

参考文献20

  • 1于戈,谷峪,鲍玉斌,王志刚.云计算环境下的大规模图数据处理技术[J].计算机学报,2011,34(10):1753-1767. 被引量:97
  • 2Borthakur D. HDFS architecture guide. HADOOP APACHEPROJECT. http://hadoop. apache, org/common/docs/current/hdfs design, pdf.
  • 3Chang F, Dean J, Ghemawat S,et al. Bigtable: A distributedstorage system for structured data. ACM Trans, on ComputerSystems (TOCS), 2008,26(2): 4.
  • 4Webber J. A programmatic introduction to neo4j. Proc. of the3rd Annual Conference on Systems, Programming, andApplications: Software for Humanity. ACM.2012. 217-218.
  • 5Xin RS, Gonzalez JE, Franklin MJ,et al. Graphx: A resilientdistributed graph system on spark. First InternationalWorkshop on Graph Data Management Experiences andSystems. ACM. 2013. 2.
  • 6Low Y,Bickson D, Gonzalez J, et al. Distributed GraphLab: Aframework for machine learning and data mining in the cloud.Proc. of the VLDB Endowment, 2012, 5(8): 716-727.
  • 7Malewicz Q Austem MH, Bik AJC, et al. Pregel: A systemfor large-scale graph processing. Proc. of the 2010 ACMSIGMOD International Conference on Management of Data.ACM. 2010. 135-146.
  • 8Shao B, Wang H, Li Y. Trinity: A distributed graph engine ona memory cloud. Proc. of the 2013 ACM SIGMODInternational Conference on Management of Data. ACM.2013.505-516.
  • 9Garey MR, Johnson DS, Stockmeyer L. Some simplifiedNP-complete graph problems. Theoretical Computer Science,1976,1(3): 237-267.
  • 10Kemighan BW, Lin S. An efficient heuristic procedure forpartitioning graphs. Bell System Technical Loumal, 1970,49(2): 291-307.

二级参考文献60

  • 1Amazon SimpleDB. http://aws, amazon, com/simpledb/, 2011-8-10.
  • 2Connor Alexander G, Chrysanthis Panos K, Labrinidis Alexandros. Key key-value stores for efficiently processing graph data in the cloud//Proceedings of the GDM. Hannover, Germany, 2011:88-93.
  • 3Lordanov Borislav. HyperGraphDB: A generalized graph database//Proceedings of the IWGD. JiuZhai Valley, China, 2010:25-36.
  • 4Eifrem Emil. NOSQL: Scaling to size and scaling to complexity, http://blogs, neotechnology, com/emil/2009/11/ nosql-scaling tosize-and-scaling-to-complexity, html, 2009- 1-15.
  • 5Wu Sai, Jiang Da-Wei, Ooi Beng Chin et al. Efficient B-tree based indexing for cloud data proeessing//Proeeedings of the VLDB. Singapore, 2010: 1207-1218.
  • 6Wang Jin-Bao, Wu Sai, Gao Hong et al. Indexing multi dimensional data in a cloud system//Proceedings of the SIGMOD. Indianapolis, Indiana, USA, 2010: 591-602.
  • 7Tsatsanifos George, Sacharidis Dimitris, Sellis Timos et al. MIDAS: Multi-attribute indexing for distributed architecture systems//Proceedings of the SSTD. Minneapolis, MN, USA, 2011:168-185.
  • 8Aguilera M K, Golab W, Shah M A. A practical scalable distributed B-tree//Proceedings of the VLDB. Auckland, New Zealand, 2008: 598-609.
  • 9Zhang Xiang-Yu, Ai Jing, Wang Zhong-Yuan, Lu Jia-Heng et al. An efficient multi-dimensional index for cloud data management//Proceedings of the CloudDB. Hong Kong, China, 2009:17-24.
  • 10InfiniteGraph, the Distributed Graph Database. http:// www. infinitegraph, com/, 2011 -7 -29.

共引文献96

同被引文献4

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部