期刊文献+

利用自动机高效处理XML路径表达式查询 被引量:6

Efficiently Processing XML Path Queries Using Automata
下载PDF
导出
摘要 在XML查询处理中,应用于绝大多数XML查询语言中的路径表达式在定位和查询XML数据和数据的结构关系方面具有极强的表达能力,并且由于XML数据的半结构化性,使得XML路径表达式查询的查询处理技术的研究与传统的数据库查询处理技术相比有着全新的特点和挑战.一些目前已有的查询处理技术可以用来处理路径表达式,但是查询处理中产生的大量中间结果导致了这些方法应用在大规模XML文档和复杂的路径表达式查询中时查询效率急剧下降.文中利用自动机技术设计了一个处理XML路径表达式查询的高效方法——SAM.SAM的基本思想是将路径表达式查询转化成一个与之完全等价的自动机,然后将其与从XML文档中抽象出来的模式路径相匹配.文中同时也给出了基于SAM方法的针对路径表达式中"//"操作符计算的有效解决方案.实验证明:SAM是一种非常有效的查询方法,在计算大数据量复杂路径表达式查询时具有非常高的效率,是一种实用的XML路径表达式查询方法.  In XML query processing, path expressions applied in most XML query languages have a powerful ability in locating and querying XML data as well as structural relationships. Due to the semi-structured feature of XML data, the query processing techniques of XML path expression query has new characteristics and challenges compared to traditional database query processing techniques. So far, some techniques have been proposed for processing path queries. However, when they are applied to large scale of XML documents and complicated path expressions, their performance degrade dramatically. This paper proposes a high efficient XML path expression query processing method——SAM based on the automata technique. Its basic idea is transforming a path expression query to an equivalent automata and matching the automata with the schema paths abstracting from the XML document. This paper also presents an approach to computing the "" operation based on the SAM method. The experimental results show that SAM is an efficient and practical method for computing complicated path expression queries on large scale XML documents.
出处 《计算机学报》 EI CSCD 北大核心 2007年第9期1520-1532,共13页 Chinese Journal of Computers
基金 国家自然科学基金(60573089) 国家"九七三"重点基础研究发展基金(2006CB303103)资助
关键词 XML路径表达式 自动机 查询处理 XML path expression automata query processing
  • 相关文献

参考文献21

  • 1W3C Recommendation.Extensible Markup Language (XML) 1.0 (2nd Edition).2000.http://www.w3.org/TR/REC-xml
  • 2Fankhauser P.XQuery formal semantics:State and challenges.SIGMOD Record,2001,30(3):14-19
  • 3W3C Working Draft.XML Path Languages(XPath),ver 2.0.December 20 2001.Tech.Report WD-xpath20-20011220,W3C,2001,http://www.w3.org/TR/WD-xpath20-20011220
  • 4Abiteboul S,Quass S,McHugh J,Widom J,Wiener J.The Lorel query language for semistructured data.International Journal on Digital Libraries,1997,1(1):68-88
  • 5Zhou A,Lu H,Zheng S,Liang Y,Zhang L,Ji W,Tian Z.VXMLR:A visual XML-relational database system//Apers P M G,Atzeni P,Ceri S,Paraboschi S,Ramamohanarao K,Snodgrass R Teds.Proceedings of the 27th International Conference on Very Large Data Bases.Roma,Italy,2001:719-720
  • 6McHugh J,Abiteboul S,Goldman R,Quass D,Widom J.Lore:A database management system for semistructured data.SIGMOD Record,1997,26(3):54-66
  • 7Li Q,Moon B.Indexing and querying xml data for regular path expressions//Apers P M G,Atzeni P,Ceri S,Paraboschi S,Ramamohanarao K,Snodgrass R T eds.Proceedings of the 27th International Conference on Very Large Data Bases.Roma,Italy,2001:361-370
  • 8Chung C,Min J,Shim K.APEX:An adaptive path index for XML data//Franklin M J,Moon B,Ailamaki A eds.Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data.Madison,Wisconsin,USA,2002:121-132
  • 9Lv Jian-Hua,Wang Guo-Ren,Yu Jeffrey Xu,Yu Ge,Lu Hong-Jun,Sun Bing.A new path expression computing approach for XML data//Proceedings of 1st VLDB Workshop on Efficiency and Effectiveness of XML Tools and Techniques (LNCS 2590).Hong Kong,China,2002:35-46
  • 10Al-Khalifa S,Jagadish H V,Koudas N,Patel J M,Srivastava D,Wu Yu-Qing.Structural joins:A primitive for efficient XML query pattern matching//Proceedings of the the 2002 International Conference on Data Engineering.San Jose,California,USA,2002:141-154

同被引文献59

引证文献6

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部