摘要
传统基于关键词的搜索引擎不能充分利用XML文档的结构信息,搜索结果往往不精确;而基于结构信息和关键词的XML搜索技术又不适用于普通用户.基于关键词的XML语义检索克服了以上缺点,但需要提高检索效率.本文深入分析了XML文档结构潜藏的语义,提出了新的索引结构及两结点语义相关的判断函数,在此基础上提出了一种快速的XML语义检索算法,该算法大大减少了结点对语义相关的判断次数.对实际数据集的测试实验结果显示出新算法的有效性.
Traditional keyword-based search engine does not consider the additional information provided by the structure of XML documents,it returns imprecise results often;searching according to keywords and structure information of XML documents inputted is not suitable for contain users.Semantic search for XML data based on tag-keywords overcomes the limitations above, but its efficiency needs to be improved.This paper analyzes semantic information provided by the structure of XML documents deeply.It puts forward a new index structure for XML data and semantic related decision function between two nodes.Based on this,it proposes a fast semantic search algorithm for XML data.The search algorithm reduces the times to decide semantic correlation greatly.The experimental results with real data sets illustrate the effectiveness of the proposed algorithm.
出处
《电子学报》
EI
CAS
CSCD
北大核心
2007年第11期2220-2225,共6页
Acta Electronica Sinica
关键词
XML文档
语义检索
索引结构
信息检索
XML document
semantic search
index structure
information retrieval