期刊文献+

Keyword Searches in Data-Centric XML Documents Using Tree Partitioning 被引量:1

Keyword Searches in Data-Centric XML Documents Using Tree Partitioning
原文传递
导出
摘要 This paper presents an effective keyword search method for data-centric extensive markup language (XML) documents. The method divides an XML document into compact connected integral subtrees, called self-integral trees (SI-Trees), to capture the structural information in the XML document. The SI-Trees are generated based on a schema guide. Meaningful self-integral trees (MSI-Trees) are identified, which contain all or some of the input keywords for the keyword search in the XML documents. Indexing is used to accelerate the retrieval of MSI-Trees related to the input keywords. The MSI-Trees are ranked to identify the top-k results with the highest ranks. Extensive tests demonstrate that this method costs 10-100 ms to answer a keyword query, and outperforms existing approaches by 1-2 orders of magnitude. This paper presents an effective keyword search method for data-centric extensive markup language (XML) documents. The method divides an XML document into compact connected integral subtrees, called self-integral trees (SI-Trees), to capture the structural information in the XML document. The SI-Trees are generated based on a schema guide. Meaningful self-integral trees (MSI-Trees) are identified, which contain all or some of the input keywords for the keyword search in the XML documents. Indexing is used to accelerate the retrieval of MSI-Trees related to the input keywords. The MSI-Trees are ranked to identify the top-k results with the highest ranks. Extensive tests demonstrate that this method costs 10-100 ms to answer a keyword query, and outperforms existing approaches by 1-2 orders of magnitude.
出处 《Tsinghua Science and Technology》 SCIE EI CAS 2009年第1期7-18,共12页 清华大学学报(自然科学版(英文版)
基金 Partly Supported by the National High-Tech Research and Development (863) Program of China (No. 2007AA01Z152) the Basic Research Foundation of Tsinghua National Laboratory for Information Science and Technology (TNList) 2008 HP Labs Innovation Research Program
关键词 keyword searches extensive markup language (XML) self-integral trees RANKING INDEXING keyword searches extensive markup language (XML) self-integral trees ranking indexing
  • 相关文献

参考文献10

  • 1Li G,,Feng J,Wang J, et al.RACE: Finding and rankingcompact connected trees for keyword proximity search over XML documents[].Proceedings of the th Interna- tional Conference on World Wide Web.2008
  • 2Li G,Feng J,Wang J, et al.SAILER: An effective search engine for unified retrieval of heterogeneous XML and web documents[].Proceedings of the th International Conference on World Wide Web.2008
  • 3Li G,Feng J,Wang J, et al.Efficient keyword search for valuable lcas over XML documents[].Proceedings of the Sixteenth ACM Conference on Information and Knowl- edge Management.2007
  • 4Li G,Feng J,Zhou L.Efficient keyword search over data-centric XML documents[].Proceedings of Advances in Data and Web Management Joint th Asia-Pacific Web Conference and th International Conference on Web-Age Information Management.2007
  • 5Botev C,Amer-Yahia S,Shanmugasundaram J.Expres- siveness and performance of full-text search languages[].Proceedings of Advances in Database Technology - EDBT th International Conference on Extending Data- base Technology.2006
  • 6Cohen S,Kanza Y,Kimelfeld B, et al.Interconnection semantics for keyword search in XML[].Proceedings of the ACM CIKM International Conference on Infor- mation and Knowledge Management.2005
  • 7Hristidis V,Koudas N,Papakonstantinou Y, et al.Keyword proximity search in XML trees[].IEEE Transaction Knowl- edge Data Engineering.2006
  • 8Pradhan S.An algebraic query model for effective and efficient retrieval of XML fragments[].Proceedings of the nd International Conference on Very Large Data Bases.2006
  • 9Agrawal S,Chaudhuri S,Das G.Dbxplorer: A system for keyword-based search over relational databases[].Pro- ceedings of the th International Conference on Data En- gineering.2002
  • 10Bhalotia G,Hulgeri A,Nakhe C, et al.Keyword searching and browsing in databases using banks[].Proceedings of the th International Conference on Data Engineering.2002

同被引文献13

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部