期刊文献+

分布式XML Twig查询处理方法 被引量:1

Distributed XML Twig query processing method
下载PDF
导出
摘要 在单机环境下,难以处理半结构化XML大数据查询,为此分析Twig查询的结构匹配特征,基于MapReduce编程模型,提出TwigMRR算法对XML Twig查询进行分布式处理。对XML数据进行Dewey编码,水平切分后存储于分布式文件系统,通过执行Map-Reduce-Reduce任务对Twig分解后的线性路径查询进行分布式并行计算以取得结果。实验结果验证了该算法的有效性和完整性,与类似算法的比较结果表明了其在处理效率方面的优势。 To deal with the difficulties of large-scale XML query processing in single computer unit,Twig query structure matching features were analyzed,TwigMRR algorithm was proposed for XML Twig queries distributed processing based on MapReduce program model.XML data were stored into distributed file system after encoded using Dewey and partitioned horizontally.Map-Reduce-Reduce tasks were executed to evaluate the linear path queries generated with the divided Twig.The experimental results show that the proposed approach is efficient and scalable,and it is more effective than other related works.
出处 《计算机工程与设计》 北大核心 2016年第1期123-126,210,共5页 Computer Engineering and Design
基金 北京市自然科学基金项目(4122011) 河北省教育厅青年基金项目(QN2014178) 北华航天工业学院科研基金项目(KY-2014-09) 校级科技创新团队基金项目(XJTD20140)
关键词 分布式计算 TWIG查询 MAPREDUCE模型 XML数据 HADOOP平台 distributed computing Twig query MapReduce model XML data Hadoop platform
  • 相关文献

参考文献11

  • 1Dean J,Ghemawat S.MapReduce:Simplified data processing on large clusters[J].Communications of the ACM,2008,51(1):107-113.
  • 2Kling P,zsu MT,Daudjee K.Generating efficient execution plans for vertically partitioned XML databases[C]//Proceedings of the VLDB Endowment,2010:1-11.
  • 3Cong G,Fan W,Kementsietsidis A,et al.Partial evaluation for distributed XPath query processing and beyond[J].ACM Transactions on Database Systems,2012,37(4):1-43.
  • 4Choi H,Lee KH,Kim SH,et al.HadoopXML:A suite for parallel processing of massive XML data with multiple twig pattern queries[C]//Proceedings of the 21st ACM International Conference on Information and Knowledge Management,2012:2737-2739.
  • 5Bidoit N,Colazzo D,Malla N,et al.Processing XML queries and updates on map/reduce clusters[C]//Proceedings of the16th International Conference on Extending Database Technology,2013:745-748.
  • 6Khatchadourian S,Mariano C,Siméon J.ChuQL:Processing XML with XQuery using Hadoop[C]//Proceedings of Conference of the Center for Advanced Studies on Collaborative Research,2011:74-83.
  • 7Fegaras L,Li C,Gupta U,et al.XML query optimization in Map-Reduce[C]//Proceedings of 14th International Workshop on the Web and Databases,2011.
  • 8Damigos M,Gergatsoulis M,Plitsos S.Distributed processing of XPath queries using MapReduce[J].New Trends in Databases and Information Systems,2014,241:69-77.
  • 9Bruno N,Koudas N,Srivastava D.Holistic twig joins:Optimal XML pattern matching[C]//Proceedings of SIGMOD,2002:310-321.
  • 10White T.Hadoop:The definitive guide[M].2nd ed.Sebastopol:O'Reilly Media/Yahoo Press,2010:9-12.

同被引文献5

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部