期刊文献+

XPSort——树形数据多核并行外存排序算法 被引量:1

XPSort:A Multi-core Parallel Sorting Algorithm for Hierarchical Data in External Memory
下载PDF
导出
摘要 XML数据处理中一个基本问题是树形数据排序.本文针对已有算法的不足提出了一种XML文档多核并行外存排序算法——XPSort.XPSort扫描XML文档产生相互独立的排序任务,利用多核CPU对任务进行并行处理;同时,利用数据压缩、单临时文件以及避免子树匹配等策略,有效地减少磁盘I/O,提高排序性能;它克服了NEXSORT算法没能有效利用内存空间、存在大量随机I/O的问题以及难以处理"右深树"的缺陷,也克服了HERMES的数据冗余、大量磁盘开销等缺点.文章对不同特性的XML文档开展了大量比较实验,结果表明XPSort优于已有算法,所提优化方法是有效可行的. A fundamental problem in XML data handling is hierarchical data sorting .This paper focuses on this problem and proposes an effective sorting algorithm called XPSort for XML document .XPSort exploits multi-core CPU to parallelize the execu-tions of the mutually independent tasks generated by scanning the XML document ;and effectively reduces disk I/Os through data compression ,single temporary-file storage and avoidance of tree-matching .XPSort overcomes the issues of inefficient space utiliza-tion ,large number of random I/Os and inability to handle right-deep tree in NEXSORT ,and avoids data redundancy and large disk I/O costs in HERMES .Extensive experiments on XML documents with different characteristics show that XPSort outperforms other existing sorting algorithms .
出处 《电子学报》 EI CAS CSCD 北大核心 2014年第2期292-300,共9页 Acta Electronica Sinica
基金 国家自然科学基金(No.61070042) 浙江省自然科学基金(No.Y1090096 No.Y13F020110)
关键词 XML文档 树形数据 排序算法 并行算法 XML document hierarchical data sorting algorithm parallel algorithm
  • 相关文献

参考文献16

  • 1G Graefe. Implementing sorting in database systems[ J]. ACM Computing Survey, 2006,38 (3) : 1 - 37.
  • 2M-C Albutiu, A Kemper, T Neumann. Massively parallel sort- merge joins in main memory multi-core database systems[ J ]. PVLDB,2012,5(10) : 1064 - 1075.
  • 3徐海渊,吴泉源,王怀民,贾焰.基于相容关系的XML索引机制[J].电子学报,2003,31(8):1155-1159. 被引量:3
  • 4衡星辰,覃征,邵利平,曹玉辉,高洪江.基于两阶段查询重写的XML近似查询算法[J].电子学报,2007,35(7):1271-1278. 被引量:6
  • 5P Buneman, et al. Archiving scientific data[J] .ACM Transac- tions on Database Systems,2004,29(1):2- 42.
  • 6P Buneman, et al. Keys for XML[ J]. Computer Networks, 2002,39 (5) : 473 - 487.
  • 7S Chawathe,et al. Change detection in hierarchically structured information[ A]. Proceedings of the ACM SIGMOD Conference [ C ]. New York: ACM, 1996.493 - 504.
  • 8XQuery 1. O: An XML Query Language[ OL]. http://www. w3. org/TR/xquery.
  • 9A Silberstein, J Yang. NEXSORT: Sorting XML in external memory[ A]. Proceedings of the ICDE Conference[ C]. Piscat- away, NJ: IEEE, 21304. 695 - 707.
  • 10Koltsidas, H Muller, S Viglas. Sorting hierarchical data in extemal memory for archiving[ A]. Proceedings of the VLDB Conference[ C] .New York: ACM,2008,1 ( 1 ) : 1205 - 1216.

二级参考文献47

  • 1回首05多核之路:AMD英特尔Sun的技术攻坚战[EB/OL] http://news.pconline.com.cn/hy/0512/742256.html,2005-12.
  • 2A Agarwal,M Levy.The kill rule for multicore[A].Proc of IEEE Design Automation Conference[C].San Diego,2007.750-753.
  • 3科学家开发千核处理器运算速度提升20倍[EB/OL] http://www.it com.cn/news/cyxw/gjyj/010123016/952063.html,2010-12-30.
  • 4G Seshadri,R Jain,A Mittal.Parallelization of principal component analysis[A].IEEE Advance Computing Conference[C].Patiala,2010.44-49.
  • 5Xian-he Sun,Yong Chen.Reevaluating Amdahl' s law in the multicore era[J].Journal of Parallel and Distributed Computing,2010,70(2):183-188.
  • 6Mark D Hill,Michael R Marty.Amdahl's law in the multicore era[J].Computer,2008,41 (7):33-38.
  • 7J L Gustafson.Reevaluating Amdahl' s law[J].Communications of ACM,1988,31(5):532-533.
  • 8P Chrstie,D Stroobandt.The interpretation and application of Rent's rule[J].IEEE Trans on VLSI Systems,Special Issue on System-Level Interconnect Prediction,2000,8(6):639-648.
  • 9Daniel Greenfield,Amab Banerjee,Jeong-Gun Lee,Simon Moore.Implication of Rent's rule for NOC design and its faulttolerance[J].Networks-on-Chip,2007,7 (9):283-294.
  • 10Wei Huang,Mircea R Stan,Karthik Sankaranarayanan,Robert J Ribando,Kevin Skadron.Many-core design from a thermal perspective[A].IEEE Design Automation Conference[C].Anaheim,2008.7.46-749.

共引文献17

同被引文献15

  • 1Miki Y,Takahashi D,Mori M.Highly scalable implementation of an N-body code on a GPU cluster[J].Computer Physics Communications,2013,184:2159-2168.
  • 2Liu Handan,Tafti D K,Li Tingwen.Hybrid parallelism in MFIX CFD-DEM using Open MP[J].Powder Technology,2014,259:22-29.
  • 3Perla F,Zanetti P.Performance analysis of an hybrid Open MP/MPI ALM software for life insurance policies on multicore architectures[C]//8th International Workshop on Open MP,Rome,Italy,2012:250-253.
  • 4Bull J M,Enright J P,Ameer N.A microbenchmark suite for mixed-mode Open MP/MPI[C]//LNCS 5568:IWOMP 2009.Heidelberg:Springer,2009:118-131.
  • 5Fowler R,Gamblin T,Porterfied A,et al.Performance engineering challenges:the view from RENCI[J].J Physics:Conf Ser,2008,125(1):1-6.
  • 6Fedorova A.Opratings system scheduling for chip multiprocessor architectures[D].[S.l.]:Harvard University,2006.
  • 7Beckmann B M,Wood D A.Managing wire delay in large chip-multiprocessor caches[C]//Proceedings of the37th International Symposium on Microarchitecture,2004:319-330.
  • 8Kanan H,Guo F,Zhao L,et al.From chaos to Qo S:case studies in CMP resource management[J].ACM SIGARCH Computer Architecture News,2007,35(1):21-30.
  • 9Diacu F.The solution of the N-Body problem[J].Mathematical Intelligencer,1996,18(3):66-70.
  • 10Aversa R,Di Martino B,Mazzocca N,et al.Performance analysis of hybrid Open MP/MPI N-Body application[C]//LNCS 3349:WOMPAT,2005:12-18.

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部