期刊文献+

基于传统文本检索系统的XML索引实现研究 被引量:6

XML indexing Based on Traditional IR System
下载PDF
导出
摘要 作为重要的信息交换与存储标准,XML得到学者们越来越多的重视。作为XML检索研究的重要组成部分,XML索引机制与实现的研究已经取得了一定的研究成果。然而,大部分研究都是基于数据库及专门的半结构化管理器之上的。本文提出了如何在传统文本检索系统Okapi的基础上构建XML索引的方法。首先介绍了Okapi的索引结构。在此基础上,深入探讨了XML索引的存储结构及实现。并对索引的性能进行了评价。 Being an important data exchange and information storage standard, XML gained much attention and much work has been done on XML indexing. However, most of the research is based on database system and specialized semi-structured data management system. In this paper, we propose a comprehensive method for XML indexing based on traditional IR system Okapi. Firstly, Okapi and its indexing structure are introduced. Then we fully exploit the index structure, indexing algorithm and performance evaluation of this method.
作者 陆伟
出处 《情报学报》 CSSCI 北大核心 2006年第6期679-685,共7页 Journal of the China Society for Scientific and Technical Information
基金 国家社会科学基金资助项目(编号 04CTQ005)和湖北省科技攻关项目(编号:2004AA101C99)成果之一.
关键词 文本检索系统 Okapi XML索引实现 traditional IR system, Okapi, XML, index structure and algorithm.
  • 相关文献

参考文献13

  • 1Cooper B F,Sample N,Franklin M J,Hjaltason G R,Shadmon M.A fast index for semistructured data.In:Proceedings of the 27th VLDB Conference,Roma,Italy,September,2001.Morgan Kaufmann,2001.341~350
  • 2Deutsch A,Fernandez M,Suciu D.Storing semistructured data with STORED.In:Proceedings of ACM SIGMOD,Philadelphia,PN,1999.431~442
  • 3Harding P J,Li Q,Moon B.XISS/R:XML Indexing and storage system using RDBMS.In:Proceedings of the 29th VLDB Conference,Berlin,Germany,September,2003.Morgan Kaufmann,2003.1073~1076
  • 4McHugh J.Lore:a database management system for semistructured data.SIGMOD Record,1997,26(3):54~66
  • 5Software AG.Tamino XML database.http://www.softwareag.com/tamino/,2006-03-29
  • 6XYZFind.XML Database.http://www.xyzfind.com,2006-03-29
  • 7Okapi Documentation.http://soi.city.ac.uk/~andym/OKAPI-PACK,2006-03-29
  • 8INEX Web Site.http://inex.is.informatik.uni-duisburg.de/,2006-03-18
  • 9Expat XML Parser.http://sourceforge.net/projects/expat/,2006-03-18
  • 10Fuhr N,Govert N.Index compression vs.retrieval time of inverted files for XML documents.In:CIKM'02,McLean,Virginia,USA,November,2002.662~664

同被引文献73

引证文献6

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部