摘要
XML关键词搜索使用户可以在不了解数据结构的情况下从XML数据中检索信息。之前的大部分XML关键词搜索引擎都是采用一次性的方式来呈现搜索到的XML结果段,不能使用户对结果进行进一步的优化。在这些情况下,由于关键词查询存在歧义,如何时刻保证搜索引擎准确地返回所需信息就十分重要了。提出了一种新的XML关键词搜索引擎XWord,该引擎为有效用户交互提供全面支持,自动返回单元认证及拥有灵活的匹配排序语义。XWord提供灵活的输入方式,允许用户对结果段进行扩展到邻近的分段,并会给用户有效的动态查询建议。XWord还有很好的自动兼容性,可以在无需用户干涉的情况下处理任意XML数据,这一点对从大量异构XML数据中检索信息是非常重要的。最后给出大量的实验结果来展示XWord的有效性和效率。
XML keyword search enables users to retrieve information from XML data without knowing the data schemas.Most of the previous XML keyword search engines adopt a one-off manner to present the result XML fragments,which cannot be further manipulated on by users to explore for the intended information.In such cases,it is very important for the search engines to return the intended information exactly for all the time,which is impossible due to the ambiguity of keyword queries.In this paper,we present a new XML keyword search engine,XWord,that provides comprehensive support for effective user interactions,automatic return unit identification,and flexible matching and ranking semantics.XWord provides flexible input methods,allows users to expand a result XML fragment to include its near-by fragments,and gives effective query-biased suggestions to users.Another key feature of XWord is its universal and automatic techniques that can work on arbitrary XML data without additional user intervention,which is very important for retrieving information from large amounts of heterogeneous XML data.We conduct extensive experiments to show the effectiveness and efficiency of XWord.
出处
《计算机应用与软件》
CSCD
北大核心
2012年第11期141-147,共7页
Computer Applications and Software
基金
上海市科委项目(10511516005
10dz1500107)
高等学校博士学科点专项科研基金项目(20100071120033)