期刊文献+

XSLC:分层编码并面向查询的XML数据压缩算法

XSLC:Layered Coding and Query-Oriented XML Data Compression Algorithm
下载PDF
导出
摘要 XML(extensible markup language)文档已经被广泛用作应用程序的一个数据交换格式,针对XML数据的压缩技术也逐渐成为新的研究领域。提出XSLC(XMLstream layered-coding compression)算法,通过预先扫描DTD对数据模式进行分析,继而根据元素的父子关系进行子元素层面的编码;同时根据数据类型进行数据压缩,能够在压缩之后的文档上进行查询,因为仅需一遍压缩扫描所以可以应用于数据流环境。实验表明:XSLC算法的压缩比率和压缩时间均优于传统算法。 XML documents have been widely used as a data exchange format. XML (extensible markup language) data compression technology has become a new field of research. A compression method called XSLC (XML stream layered-coding compression) is proposed to compress and decompress XML stream in real time. When DTD (document type definition) is available, XSLC can analyze the data model and encode elements according to the relationship of father node and son node, compress data part according to its type, and support query operations applied on compressed files, as for only one time of scanning data is needed, all the processes can be implemented in XML data stream environment. Experimental results show that XSLC outperforms other methods in compression ratio and compression efficiency.
出处 《计算机科学与探索》 CSCD 2010年第2期145-152,共8页 Journal of Frontiers of Computer Science and Technology
基金 国家自然科学基金No.60673113 国家高技术研究发展计划(863)No.2007AA01Z191 2009AA01Z150 教育部科技创新工程重大项目培育资金项目No.708001~~
关键词 可扩展标记语言 压缩 文档类型定义 数据流 extensible markup language(XML) compression document type definition(DTD) data stream
  • 相关文献

参考文献5

  • 1Hartmut L, Dan S. XMilI: An efficient compressor for XML data[C]//Proc of the SIGMOD 2000. Texas: ACM Press, 2000: 153-164.
  • 2Pankaj M T, Jayant R H. XGRIND: A query friendly XML compressor[C]//Proc of the ICDE 2002. San Jose: IEEE Computer Society, 2002: 225-234.
  • 3Jun K M, Myung J P, Chin W C. XPRESS: A queriable compression for XML data[C]//Alon Y, Zachary G. Proc of the SIGMOD 2003. San Diego: ACM Press, 2003: 122-133.
  • 4王腾蛟,高军,杨冬青,唐世渭,刘云峰.面向XPath执行的XML数据流压缩方法[J].软件学报,2005,16(5):869-877. 被引量:17
  • 5Jefery S V. Design and analysis of dynamic Huffman codes[J].Journal of the ACM, 1987,34(4) : 825-845.

二级参考文献10

  • 1Hartmut L, Dan S. XMill: An efficient compressor for XML data. In: Weidong C, Jeffrey F, eds. Proc. of the SIGMOD 2000. Texas;ACM Press, 2000. 153-164.
  • 2Pankaj MT, Jayant RH. XGRIND: A query friendly XML compressor. In: Proc. of the ICDE 2002. San Jose: IEEE Computer Society, 2002. 225-234.
  • 3Jun KM, Myung JP, Chin WC. XPRESS: A queriable compression for XML data. In: Alon Y, Zachary G, eds. Proc. of the SIGMOD 2003. San Diego: ACM Press, 2003. 122-133.
  • 4Jacob Z, Abraham L. A universal algorithm for sequence data compression. IEEE Trans. on Information Theory, 1977,23(3):337-343.
  • 5Jeffery SV. Design and analysis of dynamic Huffman codes. Journal of the ACM, 1987,34(4):825-845.
  • 6Jean LG. GZIP. 2003. HTTP://www.gzip.com
  • 7SwissProt Data Set. 1998. http://www.cs.washington.edu/research/xmldatasets/data/SwissProt/SwissProt.xml
  • 8NASA Data Set. 2001. http://www.cs.washington.edu/research/xmldatasets/data/nasa/nasa.xml
  • 9Tree Bank Data Set. 2002. http://www.cs.washington.edu/research/xmldatasets/data/treebank/treebank_e.xml
  • 10Angel LD, Douglas L. XML generator. 1999. http://www.alphaworks.ibm.com/tech/xmlgenerator

共引文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部