期刊文献+

基于BWC的XML文本数据索引技术研究

Research on Indexing Technique of XML Text Data Based on BWC
下载PDF
导出
摘要 在XML文档中,相当大的部分是由文本数据组成的,针对XML文本数据占用空间较大、对压缩文本数据有效搜索效率较低的难点,基于BWC提出了压缩XML文本数据索引的技术,通过构造全文本数据模型,并利用整体压缩自索引存储XML文档的文本数据,实验结果表明,该技术不仅有效支持XPath查询语言文本搜索,而且内存消耗相对较小,实现了中小规模数据的内存搜索. A large number of fractions of an XML document are composed of text data.Considering the problems of the size of large XML document and less efficiency of effective searching on compressed text data,an index technology for compressed XML text data based on BWC is presented.The proposed technique is implemented by constructing a full text data model and in which the text data of XML document is stored with global compressed self-index.Experimental results shows,the proposed technique not only supports XPath query language search text effectively,but also needs fewer consumption of the memory so as to realize small and medium-scale data memory search.
出处 《昆明学院学报》 2011年第3期60-63,共4页 Journal of Kunming University
基金 安徽省自然科学研究资助项目(KJ2010B280)
关键词 自索引 后向搜索 文本数据 BWC self-index backward searching text data Burrows-Wheeler Compression
  • 相关文献

参考文献9

  • 1SCHOLTZ R. XML Mind products: Qizx XML query engine [ EB/ OL]. [ 2007 - 07 - 12 ]. http ://www. xmhnind, com/qizx.
  • 2SIGNUM T. Digital humanities [ EB/OL]. [ 2008 - 06 - 29 ]. http :// tauro, signum, sns. it.
  • 3BONCZ P A, GRUST T, KEULEN V M, et al. MonetDB/XQuery: A fast XQuery processor powered by a re|alional engine[ C]. ACM SIC- MOD international conference on Managemeut of data, 2006:479 - 490.
  • 4KAY M. Ten reasons why Saxon XQuery is fast[ J]. IEEE Data Engi- neering Bulletin,2008,31 ( 4 ) : 65 - 74.
  • 5FERRAGINA P, LUCCIO F, MANZINI G, et al. Structuring labeled trees for optimal succinctness, and beyond [ J ]. IEEE Symposium on Foundations of Computer Science, 2005,42 : 184 - 196.
  • 6BURROWS M, WHEELER D J. A block-sorting lossless data com- pression algorithm[ J ]. Digital Systems Research Center Research Re-- port, I994,124:3 - 10.
  • 7MANZINI G. An analysis of the Burrows-Wheeler transform [ J ]. J ACM,2001,48(3) :407 -430.
  • 8FERRAGINA P, MANZINI G, MAKINEN V, et al. Compressed rep- resentations of sequences and full-text indexes [ J ]. ACM TALG, 2007,3(2) :203 -215.
  • 9NAVARRO G, MAKINEN V. Compressed full-text indexes [ J ]. ACM Comp Surv,2007,39(1 ) :312 -323.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部