使用区间路径处理XML查询

XML query processing using RegionPath

下载PDF

导出

摘要近年来,在XML查询处理方法中发表了一些基于节点流栈连接的高效的分枝连接算法。然而,这些算法普遍存在这样的问题:由于它们必须扫描查询中出现的每一个元素对应的节点流,当XML节点数量很大时,查询处理的输入代价很大,效率变得低下。为了解决这个问题,提出了一个新型的标记法记为区间路径,不同于节点流的区间标记法,区间路径可以把具有相同路径的节点集索引到一个集合中。继而提出了分枝点连接算法用于XML查询处理。同基于节点流栈的分枝连接算法相比,该算法有以下优势:节点集的祖先信息直接位于区间路径中;只有和查询结果相关的节点集会被扫描到,大大降低了输入代价;支持查询通配符;对于类型为根路径的查询,只需一次输入操作代价完成查询处理。实验结果表面该算法在输入代价,执行时间和延展性方面都优于基于节点流的分枝连接算法。 In recent years,a number of stack-based twig join algorithms have been proposed to process XQueries based on region encoded tag streams.However,these algorithms are I/O sensitive and inefficient for large documents because they need to scan and process all nodes whose tags appear in a given query.To address this problem,this paper proposes a novel labeling scheme called RegionPath.Unlike previous schemes for query processing,this scheme can group all nodes with the same path into one label.Based on the labeling,an efficient algorithm,called BranchPointJoin,is proposed to process queries.Compared with existing stack-based algorithms,our algorithm has four main advantages：one is the ancestors of the same structured nodes can be obtained from the labels directly;the second is only part of nodes associated with query are scanned and joined;the third queries with wild-cards are supported;and the forth is only one I/O cost is needed for each root path query.The experimental results on various datasets and queries show that the proposed algorithm outperforms stack based methods on I/O cost,execution time and scalability.

作者张蔚王洪强

机构地区解放军第沈阳军区总医院

出处《信息技术》 2011年第6期105-108,111,共5页 Information Technology

关键词 XML XQUERY 区间路径分枝点连接 XML XQuery RegionPath BranchPointJoin

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献8

1Nicolas Brano, Nick Koudas, Divesh Srivastava. Holistic twig joins: opti- mal xml pattern matching[C]// Proc. of SIGMOD, 2002:310-321.
2Songting Chen, HuaGang Li, Jun' ichi Tatemura, et al. Tvdg2Stack : Bottom-up processing of generalized tree pattern queries over xml doc- uments[C]//Proc. of VLDB, 2006:283-294.
3Ting Chen, Jiaheng Lu, and Tok Wang Ling. On boosting holism in xml twig pattern matching[C]//Proe. of SIGMOD, 2005:455-466.
4Haifeng Jiang, Wei Wang, Hongjun Lu, et al. Holistic twig joins onindexed xml documents[C]//Proc. of VLDB, 2003:273-284.
5Jiaheng Lu, Ting Chen, Tok Wang Ling. Efficient processing of xml twig patterns with parent child edges: a look-ahead approach[C]// Proceedings of CIKM, 2004:533-542.
6Jiaheng Lu, Tok Wang Ling, Chee Yong Chan, et al. From region encoding to extended dewey: On efficient processing of xml twig pat- tern matching[C]//Proc. of VLDB, 2003:193-204.
7Xmark : The xml-benchmark project [EB/OL ]. http ://monpetdb. cwi. nl/xml.
8Dblp dataset [EB/OL]. http ://dblp. uni-trier, de/xml/.

1夏有为,林正浩.逻辑综合中对关键路径处理方法的研究[J].电子设计应用,2005(6):81-82. 被引量：1
2买军,王海涛,陈晖.PMI特权路径处理分析及改进方案设计[J].保密科学技术,2011(10):46-51.
3耿蓉,李喆.AD Hoc网络中的一种QoS-AWARE多径路由协议[J].系统仿真学报,2009(5):1390-1394. 被引量：2
4董军,李娟.P2P智能节点流媒体内容存取技术[J].陕西教育（综合版）,2009(3):26-26.
5朱参世,李响,朱琳.基于流数据分类挖掘算法在入侵检测的应用[J].微计算机信息,2010,26(12):80-81.
6金升平,陈定方.指纹细节匹配的遗传算法[J].交通与计算机,2002,20(2):30-32. 被引量：1
7盘点杀毒软件日常使用的几种技巧[J].计算机与网络,2010(9):33-33.
8王勇,吴昊.PMI中的证书路径处理机制的优化[J].科学技术与工程,2006,6(12):1706-1709.
9蔡健荣,孙海波,李永平,孙力,陆化珠.基于双目立体视觉的果树三维信息获取与重构[J].农业机械学报,2012,43(3):152-156. 被引量：76
10江为强,杨义先,黄正全.基于J2EE体系结构的CA认证系统的研究[J].武汉理工大学学报,2008,30(2):105-109. 被引量：1

<12 >

信息技术

2011年第6期

使用区间路径处理XML查询

参考文献8

相关作者

相关机构

相关主题

使用区间路径处理XML查询

参考文献8

相关作者

相关机构

相关主题

微信扫一扫：分享