期刊文献+

基于双索引结构的XML文档查询设计及优化

Query design and optimization of XML document content based on dual index structure
下载PDF
导出
摘要 为了解决大型XML文档检索时间长、响应速度慢、内存资源消耗大等问题,设计了类B树形结构的双索引结构,提出了基于双索引结构快速定位目标内容的查询方法。采用基于路径的倒排索引结构,降低了检索内容之间逐个比较Dewey编码的时间消耗。同时针对XML文档内容进行分词处理构建数据单元,通过数据单元间的逻辑关系建立Path Guide索引库,避免对查询内容无关节点的访问。多组对比实验结果表明,基于内容的双索引结构查询方法及优化方案在查询效率上表现出明显的优越性。 In order to solve problems about large XML documents, such as time-consuming retrieval, slow response speed and excessive resource consumption, the dual index structure based on B tree is designed, and a query method based on dual index structure is proposed to quickly locate the target content. The inverted index structure based on the path is adopted for reducing effectively time consumption of the content retrieval by comparing the Dewey encoding. At the same time, for XML document contents, the data units are constructed by the process of word segmentation, and the PathGuide index data- base is established through the logical relationship between the data units. The index database can effectively avoid the meaningless access to the irrelevant nodes of the query content. Through multiple sets of comparative experiments, the re- sults indicate that the proposed method and the optimization solution show obvious superiority in the query efficiency.
出处 《桂林电子科技大学学报》 2017年第2期111-115,共5页 Journal of Guilin University of Electronic Technology
基金 国家自然科学基金(61362021 61661017) 广西科技创新能力与条件建设计划(桂科能1598025-21) 广西自然科学基金(2013GXNSFDA019030 2014GXNSFDA118035 2016GXNSFAA380149) 认知无线电教育部重点实验室基金(CRKL150103 2011KF11)
关键词 可扩展标记语言 内容查询 数据单元 倒排索引 双索引结构 extensible markup language content query data unit inverted index dual index structure
  • 相关文献

参考文献3

二级参考文献117

  • 1孔令波,唐世渭,杨冬青,王腾蛟,高军.XML数据索引技术[J].软件学报,2005,16(12):2063-2079. 被引量:55
  • 2陆伟,Stephen Robertson.基于域加权词频法的XML文档级检索实现与评价[J].中国图书馆学报,2006,32(6):57-60. 被引量:8
  • 3李晓光,于戈,龚剑,王大玲,鲍玉斌.有效的非完全结构XML查询[J].计算机学报,2007,30(1):57-67. 被引量:8
  • 4孔令波,唐世渭,杨冬青,王腾蛟,高军.XML信息检索中最小子树根节点问题的分层算法[J].软件学报,2007,18(4):919-932. 被引量:23
  • 5QUIN L.Extensible markup language (XML)[EB/OL].(2009-04-16)[2009-06-22].http://ww.w3.org/XML.
  • 6GUO L,SHAO F,BOTEV C,et al.XRANK:Ranked keyword search over XML documents[C]// Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data.New York:ACM Press,2003:16-27.
  • 7XU Y,PAPAKONSTANTINOU Y.Efficient keyword search for smallest LCAs in XML databases[C]// Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data.Baltimore:ACM Press,2005:527-538.
  • 8GUO L,FENG J,WANG J,et al.Effective keyword search for valuable LCAs over XML documents[C]// Proceedings of the 16th ACM Conference on Information and Knowledge Management.New York:ACM Press,2007:31-40.
  • 9COHEN S,MAMOU J,KANZA Y,et al.XSEarch:A semantic search engine for XML[C]// Proceedings of the 29th International Conference on Very Large Data Bases.Berlin,Germany:VLDB Endowment,2003:45-56.
  • 10XU Y,PAPAKONSTANTINOU Y.Efficient LCA based keyword search in XML data[C]// Proceedings of the 11th International Conference on Extending Database Technology.New York:ACM Press,2008:535-546.

共引文献50

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部