期刊文献+

基于列表的可扩展标记语言流数据查询处理方法 被引量:2

Query processing method of XML streaming data using list
下载PDF
导出
摘要 针对半结构化可扩展标记语言(XML)流数据实时在线到达,顺序性一次访问及处理时效性高、缓存量小的需求,以及目前算法在大规模数据处理中查询表达式的能力有限、效率尚不能满足实际应用的现状,基于SAX解析,提出以列表及关系指针组合处理XPath查询的QXSList方法。首先定义数据模型,给出算法实现的整体框架,然后分别针对两个不同的XPath查询片段重点考虑了谓词判断条件和通配符的处理方法;该方法通过层次值计算判断节点的结构关系,利用关系指针链接多个候选节点列表,获取查询查询结果;最后分析给出优化算法,进一步减少缓存管理。通过实验对该方法与QStream++方法及Monet DB和SAXON查询引擎的运行时间和内存占比进行分析,得出与同类算法相比,随着数据量级的增加,效率提升在30%以上,且运行过程中内存占比接近于常量。 Focused on the characteristics of processing semi-structure e Xtensible Markup Language( XML) streaming data such as the stream real-time arriving continuously,requiring to be read sequentially and only once into memory,the query must be processed on the fly and usable buffer size is very little,and concerned the current status for limitation of query expression and inefficiency in practical applications of processing large scale data,QXList method was proposed for massive data processing based on SAX parsing XML. Data model and algorithm integrated framework were defined firstly. The integrated methods to process predicate and wildcard were discussed in detail. Layer value was used to determine the relationship of two elements and relational pointer was constructed to link multiple candidate nodes' lists to get query results in this method. Two optimal points were analyzed for decreasing buffer size. The experimental results show that the proposed approach is effective and efficient to this problem,and outperforms the state-of-the-art algorithms about 30 percent such as QStream + + and query engines Monet DB and SAXSON especially for large processed data. At the same time,memory usage is nearly constant.
出处 《计算机应用》 CSCD 北大核心 2016年第3期665-669,686,共6页 journal of Computer Applications
基金 北京市自然科学基金资助项目(4122011) 国家自然科学基金青年基金资助项目(61202074) 河北省教育厅青年基金资助项目(QN2014178) 北华航天工业学院校级科技创新团队资助项目(XJTD20140)~~
关键词 流数据 查询处理 XPATH查询 关系指针 缓存管理 streaming data query processing XPath query relational pointer buffer management
  • 相关文献

参考文献14

  • 1CLARK J, DEROSE S. XML Path language (XPath) Version 1.0 [ EB/OL]. [ 1999-11-16]. http://www.w3.org/TR/xpath/.
  • 2BOAG S, CHAMBERLIN D, FERNANDEZ M F, et al. XQuery 1.0: an XML query language [ EB/OL]. [ 2011- 01-03 ]. http:// www.w3. org/TR/xquery/.
  • 3WU X, THEODORATOS D. A survey on XML streaming evaluation techniques [ J]. The VLDB Journal, 2013, 22(2) : 177 - 202.
  • 4PENG F, CHAWATHE S S. XSQ: a streaming XPath engine [ J]. ACM Transactions on Database Systems, 2005, 30(2):577 -623.
  • 5JOSIFOVSKI V, FONTOURA M, BARTA A. Querying XML streams [J]. The VLDB Journal, 2005, 14(2) : 197 -210.
  • 6HAN W S, JIANG H, HO H, et al. StreamTX: extracting tuples from streaming XML data [ J]. Proceedings of the VLDB Endowment, 2008, 1(1) : 289 - 300.
  • 7RYU B G, HA J W, LEE S K. XQStream ++ : fast tuple extraction algorithm for streaming XML data [ J]. Information Sciences, 2015, 314:311 -326.
  • 8BROWNELL D, MEGGINSON D. SAX: simple API for XML [ EB/ OL]. [2004-04-27]. http://www.saxproject.org/.
  • 9Department of Computer and Information Science. What are real DTDs like [ R]. Philadelphia: University of Pennsylvania, 2002: 17.
  • 10BONCZ P, GRUST T, VAN KEULEN M, et al. MonetDB/ XQuery: a fast XQuery processor powered by a relational engine [ C] // Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data. New York: ACM, 2006:479 - 490.

同被引文献26

引证文献2

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部