期刊文献+

基于结构冗余性校准的在线式社会网络压缩 被引量:2

Compressing Online Social Networks by Calibrating Structure Redundancy
下载PDF
导出
摘要 随着社交网络、移动互联网等新兴服务的不断涌现,在线社会网络正以前所未有的速度增长并且呈现出极强的演化特性.网络压缩技术能够将大规模网络压缩成规模更小、结构信息更简洁的网络,在数据存储和网络可视化领域发挥着重要作用.现有的压缩算法为了优化压缩损失,重复比对原始网络与压缩网络之间的差异导致过高的时间开销,并且算法仅局限于静态网络,无法满足在线社会网络的演变要求.针对上述问题,提出一种解决演化网络压缩问题的高效算法,首先设计了基于局部化判定的结构合并贡献函数及其快速调整算法,将网络的首次压缩复杂度控制在O(n)到O(mn)之间;其次,设计了一种面向演化网络压缩的动态校准算法,参照网络演化前后拓扑结构的变化,校准前一时刻的压缩表达以避免网络的重复压缩,在满足在线社会网络演变要求的同时提高了压缩效率;最后,通过对真实数据集的实验分析,验证了算法的有效性. Online social networks are growing in unprecedented speed and emerge strong evolutionary characteristic which is caused by emerging new services such as social network and mobile Internet. Network compression is a new technique for representing a large-scale network by a concise network that can capture the structural and attributive information of the original large network. Network compression plays an important role in data storage and the visualization of networks. The previous network compression algorithm leads to exorbitant costs by comparing the difference between the original network and the compression one repeatedly when optimizing the compression loss. Furthermore, it can only deal with static network but not the evolutionary ones. To solve these problems, this paper proposes an efficient algorithm to deal with the evolutionary network compression issue. A function to measure structure merging contribution based on localized decision and its quick-adjusting algorithm are first designed, its time complexity of initial network compression is controlled between O (n) and O (ran). Then a dynamic calibrating algorithm for evolutionary network compression is proposed, which takes advantage of the changes of the topological structure in each snapshot to adjust the previous compression expression, and avoids redundantly compressing while achieving high efficiency. Finally, competitive experiments on real datasets demonstrate the effectiveness and efficiency of the proposed algorithm.
出处 《计算机研究与发展》 EI CSCD 北大核心 2013年第12期2504-2519,共16页 Journal of Computer Research and Development
基金 国家自然科学基金项目(61073041 61073043 61202274 61370083) 高等学校博士学科点专项科研基金项目(20112304110011 20122304110012)
关键词 在线社会网络 网络压缩 结构合并贡献 损失优化 压缩表达校准 online social networks network compression structure merging contribution lossoptimization compression expression calibrating
  • 相关文献

参考文献20

  • 1Toivonen H, Zhou F, Hartikainen A, et al. Compression of weighted graphs [C] //Proc of the 17th ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining. New York: ACM, 2011:965-973.
  • 2Zhang N, Tian Y Y, Patel J M. Discovery-driven graph summarization [C] //Proe of the 26th Int Conf on Data Engineering. Piscataway, NJ: IEEE, 2010: 880-891.
  • 3Tian Y Y, Hankina R A, Patel J M. Efficient aggregation for graph summarization [C] //Proc of the 2008 ACM SIGMOD Int Conf on Management Data. New York: ACM, 2008:567-580.
  • 4Navlakha S, Rastogi R, Shrivastava N. Graph summarization with bound error [C] //Proc of the 2008 ACM SIGMOD Int Conf on Management Data. New York: ACM, 2008:419-432.
  • 5Blandford D K, Blelloch G E, Kash I A. Compact representations of separable graphs [C] //Proc of the 4th Annual ACM-SIAM Symp on Discrete Algorithms. Philadelphia, PA: Society for Industrial and Applied Mathematics, 2003:679-688.
  • 6Raghavan S, Molina G H. Represent web graphs [(;] //Proe of the 19th Int Conf on Data Engineering. Piscataway, NJ: IEEE, 2003:405-416.
  • 7Boldi P, Vigna S. The webgraph framework I: Compression techniques [C] //Proc of the 13th Int Con[ on World Wide Web. New York: ACM, 2004:595-602.
  • 8Tan G, Poletto M, Guttag J, et al. Role classification of hosts within enterprise networks based on connection patterns [C] //Proc of the 2003 USENIX Annual Technical Conf. Berkeley, CA: USENIX Association, 2003: 15 -28.
  • 9Chakrabarti D, Faloutsos C, Zhan Y. Visualization of large networks with min-cut plots, A Plots and R-MAT [J]. International Journal of Man machine Studies, 2007, 65(5): 434-445.
  • 10Newman M E J, Girvan M. Finding and evaluating community structure in networks [J]. Physical Review E, 2004, 69(2): 026113.

二级参考文献34

  • 1CNN Facebook nearly as large as U. S. population [OL]. (2009-09 -16)[2011-04-27]. http..//edition, cnn. com/2009/ TECH/09/16/facebook. profit/.
  • 2Raghavan S, Molina G H. Representing Web graphs [C] // Proc of Int Conf on Data Engineering 2003. Piscataway, NJ: IEEE, 2003:405-416.
  • 3Rjeili A A, Karypis G. Multilevel algorithms for partitioning power-law graphs [C] //Proc of Int Parallel and Distributed Processing Symp 2006. Piscataway, NJ: IEEE, 2006: 16- 575.
  • 4Tian Y Y, Hankins R A, Patel M J. Effcient aggregation for graph summarization [C] //Proc of ACM SIGMOD Int Conf on Management of Data. New York: ACM, 2008 : 567-580.
  • 5Zhang N, Tian Y Y, Patel M J. Discovery-driven graph summarization [C] //Proc of Int Conf on Data Engineering 2010. Piscataway, NJ: IEEE, 2010: 880-891.
  • 6Chakrabarti D, Faloutsos C. Graph mining: Laws, generators, and algorithms [J]. ACM Computing Surveys, 2006, 38(1): article No. 2.
  • 7Newman M E J, The structure and function of complex networks [J]. ACM Sigcsim Installation Management Review, 2003, 45: 167-256.
  • 8Chakrabarti D, Faloutsos C, Zhan Y. Visualization of large networks with min-cut plots, A-plots and R MAT [J]. Int Journal of Man-machine Studies, 2007, 65(5): 434-445.
  • 9Jun H, Wang W, Prins J, et al. Spin: Mining maximal frequent subgraphs from graph databases [C] //Proc of Knowledge Discovery and Data Mining 2004. New York: ACM, 2004:581-586.
  • 10Macropol K, Singh A K. Scalable discovery of best clusters on large graphs [C] //Proc of Int Conf on Very Large Data Bases 2010. New York: ACM, 2010: 693-702.

共引文献41

同被引文献16

引证文献2

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部