期刊文献+

一种快速的XML语义检索算法 被引量:6

A Fast Semantic Search Algorithm for XML Data
下载PDF
导出
摘要 传统基于关键词的搜索引擎不能充分利用XML文档的结构信息,搜索结果往往不精确;而基于结构信息和关键词的XML搜索技术又不适用于普通用户.基于关键词的XML语义检索克服了以上缺点,但需要提高检索效率.本文深入分析了XML文档结构潜藏的语义,提出了新的索引结构及两结点语义相关的判断函数,在此基础上提出了一种快速的XML语义检索算法,该算法大大减少了结点对语义相关的判断次数.对实际数据集的测试实验结果显示出新算法的有效性. Traditional keyword-based search engine does not consider the additional information provided by the structure of XML documents,it returns imprecise results often;searching according to keywords and structure information of XML documents inputted is not suitable for contain users.Semantic search for XML data based on tag-keywords overcomes the limitations above, but its efficiency needs to be improved.This paper analyzes semantic information provided by the structure of XML documents deeply.It puts forward a new index structure for XML data and semantic related decision function between two nodes.Based on this,it proposes a fast semantic search algorithm for XML data.The search algorithm reduces the times to decide semantic correlation greatly.The experimental results with real data sets illustrate the effectiveness of the proposed algorithm.
出处 《电子学报》 EI CAS CSCD 北大核心 2007年第11期2220-2225,共6页 Acta Electronica Sinica
关键词 XML文档 语义检索 索引结构 信息检索 XML document semantic search index structure information retrieval
  • 相关文献

参考文献7

  • 1Theobald,G Weikum. The index-based XXL search engine for querying XML data with relevance ranking[A]. 8th Internalional Conference on Extending Database Technology (EDBT) [ C] .Prague: Springer-Verlag, 2002.477-495.
  • 2吴劲,陈泽琳.基于部分匹配的XML文本文档向量检索模型[J].电子学报,2002,30(12A):2169-2171. 被引量:6
  • 3王晓燕,王海洋,洪晓光.自行调整粒度的XML向量空间检索[J].武汉大学学报(理学版),2004,50(5):609-613. 被引量:3
  • 4王海波,姜吉发,耿晖,白硕,祝明发.XML搜索引擎研究[J].计算机应用研究,2001,18(4):68-71. 被引量:40
  • 5郭永民.XML文档检索技术研究[D].太原:太原理工大学,2003.
  • 6Sara Cohen, Jonathan Mamou, et al. XSEarch: a semantic search engine for XML [ A]. Proceedings of the 29th VLDB Conference[ C]. Berlin : Morgan Kaufmann Publishers, 2003. 45-56.
  • 7ACM SIGMOD. Available Products[ DB/OL]. http://www. acre. org/sigraod/record/xml, 2006-12-01/2007-3-10.

二级参考文献19

  • 1[1]XML and Search[EB/OL]. http://www.searchtools.com/related/ xml.html.
  • 2[2]Goxml[EB/OL]. http://www.goxml.com.
  • 3[3]Dongwook Shin, Hyuncheol Jang, Hongglan Jin. BUS: An Effective Indexing and Retrieval Cheme in Structured Documents[Z].
  • 4[4]Roy Goldman, JasonMcHugh, Jennifer Widom. From Semi-structured Data to XML: Migrating the Lore DataModel and Query Language[Z].
  • 5[5]Alin Deutsch, Mary Fernandez, Daniela Florescu. A Query Language for XML[C]. The Eighth International World Wide Web Conference.
  • 6[6]Guidelines for Robot Writers[EB/OL]. Http://info. Webcrawler.com/mak/projects/robots/robots.html.
  • 7[7]Extensible Markup Language (XML)[EB/OL]. Http://www.w3 .org/XML/.
  • 8[8]Jon Bosak, Sun Microsystems. XML, Java, and the Future of the Web[Z].
  • 9Theobald A,Weikem G. Adding Relevance to XML[A].Proceedings of 3rd International Workshop on Web and Database[C]. London: Springer-Verlag, 2000.105-124.
  • 10Fuhr N,Grobjohann K. XIRQL:A Query Language for Information Retrieval in XML Documents[A]. Proceedings of the 24th Annual International Conference on Research and development in Information Retrieval[C]. New York: ACM Press, 2001.172-180.

共引文献44

同被引文献56

  • 1汪锦岭,金蓓弘,李京.一种高效的RDF图模式匹配算法[J].计算机研究与发展,2005,42(10):1763-1770. 被引量:13
  • 2孔令波,唐世渭,杨冬青,王腾蛟,高军.XML数据索引技术[J].软件学报,2005,16(12):2063-2079. 被引量:55
  • 3赵军,金千里,徐波.面向文本检索的语义计算[J].计算机学报,2005,28(12):2068-2078. 被引量:28
  • 4孔令波,唐世渭,杨冬青,王腾蛟,高军.XML数据的查询技术[J].软件学报,2007,18(6):1400-1418. 被引量:72
  • 5Xu J X,Croft W B.Query expansion using local and global document analysis[C]//Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.Zurich,Switzerland,1996:4-11.
  • 6Leacock C,Chodorow M.Combining local context and wordnet similarity for word sense identification in WordNet:An Electronic Lexical Database[M]//Christiane Fellbaum,ed.MIT Press,1998,265-283.
  • 7http://wordnet.princeton.edu/linksJHJ.NET.
  • 8Xu Y,Papakonstantinou Y.Efficient Keyword Search for Smallest LCAs in XML Databases[C]//Proceedings of SIGMOD' 2005.Baltimore,Maryland,USA.
  • 9Tran T,Wang H,Rudolph S,et al.Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-Shaped (RDF)Data[C]//ICDE.2009:405-416.
  • 10Baeza-Yates R,Ribeiro-Neto B.Modern Information Retrieval[M].NewYork:Addison-Wesley-Longman,1999.

引证文献6

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部