一种基于YFilter的XML数据流查询的改进算法

An extended algorithm based YFilter of XML stream querying

导出

摘要利用XML文档中各路径之间相似的特点,只扫描一次XML文档,把重复的XML标记所生成的事件组合成一个聚合事件,并在基于共享前缀的NFA算法YFilter基础上,提出一种改进算法PolYFilter,实现了聚合事件的谓词计算.实验表明,与YFilter相比,PolYFilter算法减少了大量相同的有限自动机状态转移的中间状态,避免了状态集的重复计算.特别是当XML文档比较大,且重复标记比较多的时候,PolYFilter有较好的过滤性能. There are similarities between the various paths in the XML document,so an improved algorithm PolYFilter is proposed based on YFilter of NFA algorithm of shared prefix.PolYFilter combines all of the same events into a polymeric event running through the XML parser at most once,and it also implement evaluating predicate on polymeric event.The experiment shows that the improved PolYFilter can reduce numbers of operations in the set of intermediate states.Particularly,the algorithm is efficient to deal with large XML documents that have more similar events.

作者蔡俊仁俞建家

机构地区福州大学数学与计算机科学学院

出处《福州大学学报（自然科学版）》 CAS CSCD 北大核心 2010年第6期824-829,共6页 Journal of Fuzhou University(Natural Science Edition)

关键词数据流查询 XML文档聚合事件算法 data stream querying XML document polymeric event algorithm

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献11

1杨卫东,施伯乐.XML流管理研究综述[J].计算机研究与发展,2009,46(10):1721-1728. 被引量：8
2Altinel M, Franklin M J. Efficient filtering of XML documents for selective dissemination of information[ C]//Proceedings of the 26th VLDB Conference. Cairo : [ s. n. ], 2000 : 53 - 64.
3Diao Y, Fischer P, Franklin M J, et al. YFilter : efficient and scalable filtering of XML documents[ C]//ICDE. 2002 : 341 - 345.
4Diao Y, Altinel M, Franklin M J, et al. Path sharing and predicate evaluation for high -performance XML filtering[ J]. ACM Trans on Database Systems, 2003, 28 (4): 467-516.
5Chan C, Felber P, Garofalakis M N, et al. Efficient filtering of XML documents with XPath expressions [ C ]//ICDE. 2002: 235 - 244.
6Gupta A, Suciu D. Stream processing of XPath queries with predicates[ C] //SIGMOD. 2003:419-430.
7Green T, Gupta A, Miklau G, et al. Processing XML streams with deterministic automata and stream index[J]. ACM Trans on Database Systems, 2004, 29 (4) : 752 -788.
8高军,杨冬青,唐世渭,王腾蛟.基于树自动机的XPath在XML数据流上的高效执行[J].软件学报,2005,16(2):223-232. 被引量：33
9International Press Telecommunications Council (IPTC). MTF DTD [ EB/OL]. [ 2000 - 09 - 21 ]. http ://xml. coverpages. org/nitf.html.
10Busse R, Carey M, Florescu D, et al. Xmark: a benchmark for XML data management[ C ]//Proceedings of the International Conference on Very Large Data Bases (VLDB'02). Hong Kong: China Morgan Kaufmann, 2002:974 -985.

二级参考文献43

1高军,杨冬青,唐世渭,王腾蛟.一种基于DTD的XPath逻辑优化方法[J].软件学报,2004,15(12):1860-1868. 被引量：17
2高军,杨冬青,唐世渭,王腾蛟.基于树自动机的XPath在XML数据流上的高效执行[J].软件学报,2005,16(2):223-232. 被引量：33
3杨卫东,王清明,施伯乐.针对XML流数据的复杂Twig Pattern查询处理[J].软件学报,2007,18(4):893-904. 被引量：9
4Babcock B, Babu S, Datar M, et al. Models and issues in data slreams [C] //Popa L. Proc of the 21st ACM SIGACTSIGMOD-SIGART Syrup on Principles of Database Systems. New York: ACM, 2002: 1-16.
5Altinel M, Franklin M J. Efficient filtering of XML documents for selective dissemination of information [C] // Proc of VLDB 2000. San Francisco, CA: Morgan Kaufmann, 2000:53-64.
6Yanlei Diao, Mehmet Altinel, et al. Path sharing and predicate evaluation for high-performance XML filtering [J]. ACM Trans on Database System, 2003, 28(4): 467-516.
7Green T J, Miklau G, Onizuka M, et al. Processing XML streams with deterministic automata and stream indexes [J]. ACM Trans on Database Systems (TODS), 2004, 17(4): 752-788.
8Feng Peng, Sudarshan S, Chawathe. XSQ: A streaming XPatb engine [J]. ACM Trans on Database Systems, 2005, 30(2) : 577-623.
9Su Hong, Jian Jinhui, Rundensteiner Elke A. RAINDROP: A uniform and layered algebraic framework for XQueries on XML streams [C] //Proc of CIKM'03. New York: ACM, 2003, 279-286.
10Joonho Kwon, Praveen Rao, Bongki Moon, et al. FIST: Scalable XML document filtering by sequencing twig patterns [C]//Proc of VLDB. New York: ACM, 2005:217-228.

共引文献37

1崔屹.基于XML数据查询问题的研究[J].辽宁大学学报（自然科学版）,2006,33(1):89-92. 被引量：1
2杜成龙,关佶红,王治.GML空间数据流压缩算法研究[J].计算机工程,2007,33(1):98-100. 被引量：4
3杨卫东,王清明,施伯乐.针对XML流数据的复杂Twig Pattern查询处理[J].软件学报,2007,18(4):893-904. 被引量：9
4郑祥毅,翟磊,陈继明,江曼,潘金贵.XML路由算法分类研究[J].计算机科学,2007,34(4):95-99.
5桂智明,廖湖声.基于扩展XQuery引擎的空间数据流查询方法研究[J].计算机应用研究,2007,24(12):72-73.
6梁冰,刘群.基于自动机模型数据关联性能评估算法[J].电子科技大学学报,2008,37(4):606-609. 被引量：1
7王宏志,李建中,骆吉洲.XML数据流上的高效聚集算法[J].软件学报,2008,19(8):2032-2042. 被引量：2
8杨卫东,王清明,施伯乐.XML流数据查询结果的缓存管理[J].软件学报,2008,19(8):2080-2088. 被引量：3
9张晓琳,李宏辉,崔敏,谭跃生.XML数据流查询处理技术[J].情报杂志,2008,27(9):13-15.
10廖小平,王志坚,刘山.基于XML的发布/订阅型系统中过滤算法的改进[J].电脑开发与应用,2008,21(12):16-17. 被引量：1

1焦亚冰.基于RFID技术的物流信息跟踪系统构建[J].计算机工程与设计,2013,34(10):3690-3694. 被引量：4
2太子.走，到“WORD减肥中心”去[J].软件指南,2005(5):46-46.
3一次性保存多个Word文件[J].电脑爱好者,2013(4):48-48.
4Win7无法连接网络打印机[J].电脑迷,2010(3):90-90.
5陈孟奇,王珒,张祖平.一种基于Bloom Filter算法的广播认证方案[J].现代通信技术,2013(4):23-25.
6邹莉莉.构建高可扩展性的物流装备管理系统[J].现代工业经济和信息化,2016,6(12):103-105.
7John Nofsinger.完美风暴[J].现代制造,2008(27):67-68.
8王春梅.基于Bloom Filter的网络爬虫URL消重算法研究[J].产业与科技论坛,2011,0(18):55-56. 被引量：1
9孙彪,张赛男.亚像素图像ContourExtractor2DimageFilter算法研究[J].通讯世界（下半月）,2015(4):231-231.
10吴年,张昱.带谓词的XPath查询的即时处理[J].计算机工程,2006,32(13):58-60. 被引量：1

福州大学学报（自然科学版）

2010年第6期

浏览历史

内容加载中请稍等...

一种基于YFilter的XML数据流查询的改进算法

参考文献11

二级参考文献43

共引文献37

相关作者

相关机构

相关主题

浏览历史