期刊文献+

一种基于XML文档关键字检索的结构索引 被引量:5

Structure Summary for Keyword Search over XML Documents
下载PDF
导出
摘要 XML数据索引对其检索效率有较大的影响。在深入分析现有XML结构索引之后,结合XML文档特点,提出了一种基于关键字检索的结构索引——LSS(Level Structure Summary)。LSS采用了把具有相同标签路径的结点进行合并的策略,具有高效判断结点之间同构异构关系的能力。实现了LSS索引生成算法CSCAN,并在LSS索引的基础上设计了XML关键字检索算法LSSearch。该算法依据LSS索引,将各个关键字的原始倒排表集合分拆成不同类型的子集合,最后在所有子集合上进行查询。实验结果表明,LSS可以帮助减少XML文档中关键字倒排表的规模,提高检索效率。 The index of XML Data is crucial for retrieval efficiency of XML document.After analysis of existing XML structure summaries,this paper proposed a structural summary over
出处 《计算机科学》 CSCD 北大核心 2010年第12期120-124,共5页 Computer Science
基金 863国家重点基金项目(2009AA1Z134) 国家自然科学基金(60803043 60720106001)资助
关键词 XML 关键字检索 索引 倒排表 search called LSS combining the XML document.LSS merges the nodes in the XML tree with the same label path so as to determine nodes' homogeneity and heterogeneity efficiently.This paper implemented LSS constructing algorithm called CSCAN and designed a XML keyword retrieval algorithm called LSSearch based on LSS.This algorithm split keywords' inverted list into different type subsets finally retrieved to get all results quickly on these subsets.Experimental results demonstrated that LSS can help to reduce the size of the keyword inverted list in XML document dramatically and improve retrieval efficiency.Keywords XML Keyword search Indices Inverted list
  • 相关文献

参考文献1

二级参考文献24

  • 1Cohen S, Kimelfeld B, Sagiv Y. Incorporating Constraints in Probabilistic XML [ C ] //Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on principles of database systems. Vancouver, Canada, 2008 : 109-118.
  • 2Zhao Wenzhong, Dekhtyar A, Goldsmith J. A Framework for Management of Semistructured Probabilistic Data[J]. Journal of Intelligent Information Systems, 2005,25 (3) : 293-332.
  • 3Zhao Wenzhortg,Dekhtyar A,Goldsmith J. Databases for Interval Probabilities [J]. International Journal of Intelligent Systems, 2004,19(9) : 789-815.
  • 4Magnani M, Montesi D. Management of interval probabilistic data[J]. Acta Informatica,2008 (45) :93-130.
  • 5Dekhtyar A,Mathias K K,Gutti P. Structured Queries for Semistructured Probabilistie Data[C]//TDM'2006.
  • 6Dekhtyar A,Goldsrnith J, Hawkes S R. Semistructured Probabilistic Databases[C]//Proc. Statistical and Scienti Database Management Systems. 2001.
  • 7Hung E. Managing uncertainty and ontologies in databases[D]. University of Maryland at College Park College Park, MD, USA,2005.
  • 8Kimelfeld B, Kosharovsky Y, Sagiv Y. Query Efficiency in Probabilistic XML Models[C]//Proceedings of the 2008 ACM SIGMOD international conference on management of data. Vancouver, Canada, 2008.
  • 9Kimelfeld B, Sagiv Y. Matching Twigs in Probabilistic XML[C] //VLDB'07. Vienna,Austria,2007.
  • 10Hung E, Subrahmanian V S. Managing uncertainty and ontologies in databases[D]. University of Maryland at College Park, 2005.

共引文献2

同被引文献33

  • 1孔令波,唐世渭,杨冬青,王腾蛟,高军.XML数据索引技术[J].软件学报,2005,16(12):2063-2079. 被引量:55
  • 2李栋,史晓东.一种支持高效检索的实时更新倒排索引策略[J].情报学报,2006,25(1):16-20. 被引量:6
  • 3FENSEL D. The Semantic Web and Its Languages [J]. IEEE Intelligence Systems, 2000, 11 (9) : 67-73.
  • 4刘炜.关于元数据的十万个为什么[M].上海:上海市图书馆,2004.
  • 5CELTS-41.2002.教育部教育信息化技术标准委员会.教育资源建设技术规范:信息模型[S].
  • 6W3C. Extensible Markup Language (XML)[EB/OL]. (2011-01-20). [2011-03-11]. http: //www. w3. org/xml.
  • 7KRISNA ADIYARTA, NAOMIE SALIM. Metadata Management Model for Relational Database Publication on Grid: An On- tology Based Framework [J/OL]. (2007-07-03). [2011-03-11]. http: //citeseerx. ist. psu. edu/viewdoc/download? doi = 10. 1. 1. 136. 4609&rep =repl&type =pdf.
  • 8BOLLEGALA D, MATSUO Y, ISHIZUKA M. Measuring Semantic Similarity between Words Using Web Search Engine [ C] //www~07 Proceedings of the 16th International Conference on World Wide Web. Canada: Banff Alberta, 2007: 757-766.
  • 9eXtensible Markup Language (XML) [ EB/OL ]. 2004. http :// www. w3. org/xml.
  • 10Guo L, Shao F, Botev C, et al. Xrank : ranked keyword search over xml documents [ C ]//SIGMOD. [ s. 1. ] : [ s. n. ], 2003 : 16-27.

引证文献5

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部