期刊文献+

一种改进的邻接关系可查询压缩算法

An Improved Compression Algorithm Supporting Neighbor Query
下载PDF
导出
摘要 目前多数数据压缩算法不能直接在压缩结果上进行数据查询,大数据的线性化压缩算法虽然可直接在压缩后的数据上进行邻接关系查询,但压缩率较低。针对该问题,对线性化压缩的实现原理进行研究,分析MPk线性化算法在不同社会网络样本下的压缩效率,发现线性化压缩结果中存在冗余信息,并针对该情况设计改进算法,删去原有数据结构中的冗余部分,进一步提高压缩率。实验结果证明,改进算法的时间复杂度与原算法相同,压缩率平均提升23%。 Nowadays,most data compression algorithms do not support performing query directly on compressed data.Though the compression algorithm can perform query neighbor relations on compressed result,the compression ratio is relatively low. To solve the problem,this paper does some research on the principle of linearization compression algorithm. It analyzes the compression ratio of MPkalgorithm in different sample social network and finds the redundant information in compressed result. To eliminate these redundant information,it improves the original data structure,removes the unnecessary bits and improves the compression ratio. Experiments show that the proposed algorithm has same time complexity compared with primal algorithm,but the compression ratio can be increased by 23% in average.
作者 高圣巍 彭超
出处 《计算机工程》 CAS CSCD 北大核心 2015年第1期61-64,共4页 Computer Engineering
基金 国家自然科学基金资助项目(91118008 61232006) 国家"863"计划基金资助重点项目(SQ2010AA0101016001) 上海市教育委员会科研创新基金资助项目(44440590)
关键词 线性化压缩算法 大数据 社会网络 启发式算法 Eulerian数据结构 linearization compression algorithm big data social network heuristic algorithm Eulerian data structure
  • 相关文献

参考文献12

  • 1马帅,李佳,刘旭东,怀进鹏.大数据时代的图搜索技术[J].信息通信技术,2013,7(6):44-51. 被引量:3
  • 2Adler M,Mitzenmacher M.Towards Compressing Web Graphs[C]//Proceedings of2001Data Compression Conference.Snowbird,USA:IEEE Press,2001:202-213.
  • 3Apostolico A,Drovandi G.Graph Compression by BFS[J].Algorithms,2009,2(3):1031-1044.
  • 4郑翠芳.几种常用无损数据压缩算法研究[J].计算机技术与发展,2011,21(9):73-76. 被引量:45
  • 5Boldi P,Santini M,Vigna S.Permuting Web Graphs,Algorithms and Models for the Web-Graph[M].Berlin,Germany:Springer,2009:116-126.
  • 6Boldi P,Vigna S.The Web Graph Framework I:Compression Techniques[C]//Proceedings of the13th International Conference on World Wide Web.New York,USA:ACM Press,2004:595-602.
  • 7Boldi P,Vigna S.The Web Graph Framework II:Codes for the World-wide Web[C]//Proceedings of2004Data Compression Conference.Snowbird,USA:IEEE Press,2004:528.
  • 8Boldi P,Rosa M,Santini M,et al.Layered Label Propagation:A Multiresolution Coordinate-free Ordering for Compressing Social Networks[C]//Proceedings of the20th International Conference on World Wide Web.Hyderabad,India[s.n.],2011:587-596.
  • 9Chierichetti F,Kumar R,Lattanzi S,et al.On Compressing Social Networks[C]//Proceedings of the15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.Paris,France:ACM Press,2009:219-228.
  • 10Maserrat H,Pei J.Neighbor Query Friendly Compression of Social Networks[C]//Proceedings of the16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.Washington D.C.,USA:[s.n.]:2010:533-542.

二级参考文献58

  • 1王忠效,姜丹.关于Lempel-Ziv 77压缩算法及其实现的研究[J].计算机研究与发展,1996,33(5):329-340. 被引量:19
  • 2王刚,刘立柱.ZIP文件压缩编码分析[J].微计算机信息,2006(05X):283-285. 被引量:8
  • 3Shannon C E. A mathematical Theory of Communication [J]. The Bell System Technical Journal, 1948,27 ( 7 ) : 379-423.
  • 4袁玫,袁文.数据压缩技术及其应用[M].北京:电子工业出版社,1994.
  • 5Rissanen J, Langdon G G. Universal modeling and coding[ J ]. IEEE Trans on Information Theory,1981, 27(1 ) :12-23.
  • 6Ziv J, Lempel A. A Universal Algorithm for Sequential Data Compression[ J ]. IEEE Transactions on Information Theory, 1977, 23(3) :337-343.
  • 7Ziv J, Lempel A. Compression of Individual Sequences via Variable Rate Coding [ J ]. IEEE Transactions on Information Theory, 1978,24(5 ) :530-536.
  • 8Welch. A Technique for High Performance Data Compression [ J ]. IEEE Compuler, 1984,17 (6) :8-19.
  • 9马帅,李佳,刘旭东,等.图查询:社会计算时代的新型搜索[J].中国计算机学会通讯,2012,8(11):26-32.
  • 10Adam Schenke r,Mark Last,Horst Bunk ,et al.Classification of Web Documents Using Graph Matching[C]//IJPRAI Conference,2004.

共引文献46

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部