期刊文献+

基于全级C阶矩模型并行流数预测的广域大数据吞吐量优化 被引量:2

Wide Area Big Data Throughput Optimization Based on Full C-order Moment Model with Parallel Flow Prediction
下载PDF
导出
摘要 针对传统大数据密集型的可扩展计算系统在数据源利用和数据传输方面效率不高的问题,提出基于并行流数预测的应用层吞吐量优化模型。为提高并行流数预测精度,以提高瓶颈链路的利用效率为目的,设计等效并行流数选取方式。借鉴部分C阶矩模型和完全二阶矩模型,构建全级C阶矩模型,并且设计低采样吞吐量优化框架,降低计算复杂度。在不同大小数据集上的实验结果表明,全级C阶矩并行流数的预测模型更适合大数据传输,且效率更高。 In order to solve the problem of low efficiency in the use of data sources and data transmission of the traditional data intensive scalable computing systems, this paper proposes a big data throughput optimization algorithm based on full C-order moment model with parallel flow prediction. To improve the prediction accuracy of parallel flow, it takes the utilization efficiency improvement of the bottleneck link for the purpose, and designs the method of equivalent parallel flow algorithm. According to the partial C-order moment model and full second-order moment model, it constructs the full C-order moment model, and designs the low sample throughput optimization framework, which can reduce the computational complexity. Experimental results on the data set with different size show that the full C-order moment model with parallel flow is more suitable for the transmission of big data, and more efficient.
作者 李芝 龙敏
出处 《计算机工程》 CAS CSCD 北大核心 2016年第4期295-300,306,共7页 Computer Engineering
基金 湖南省自然科学基金资助项目(2015JJ2007) 湖南省研究生科研创新基金资助项目(CX2013B376)
关键词 C阶矩模型 二阶矩模型 大数据 并行流数预测 吞吐量 C-order moment model second-order moment model big data parallel flow prediction throughput
  • 相关文献

参考文献13

  • 1Gu Lin.Zeng Dcze. Li Peng. Cost Minimization for Big Data Processing in Geo-distributed Data Centers[J]. IEEE Transactions on Emerging Topics in Computilag, 2014,2(3) :314-323.
  • 2金澈清,钱卫宁,周敏奇,周傲英.数据管理系统评测基准:从传统数据库到新兴大数据[J].计算机学报,2015,38(1):18-34. 被引量:68
  • 3张滨,乐嘉锦.基于列存储的MapReduce并行连接算法[J].计算机工程,2014,40(8):70-75. 被引量:5
  • 4Sandryhaila A, Moura J M. Big Data Analysis with Signal Processing on Graphs: Representation and Processing of Massive Data Sets with Irregular Structure[J]. IEEE Signal Processing Magazine,2014, 31(5) :80-90.
  • 5Gunay C, Edgerton J R, Li Su. Database Analysis of Simulated and Recorded Electrophysiological Datasets with PANDORA' s Toolbox [J].Neuroinformatics, 2009,7(2) :93-111.
  • 6Rathinasamy S, Raju R. Sequencing and Scheduling of Nonuniform Flow Pattern in Parallel Hybrid Flow Shop[J]. International Journal of Advanced Manu- facturing Technology, 2010,49 ( 1 ) : 213-225.
  • 7Yin Dengpan,Yildirim E, Kosar T. A Data Throughput Prediction and Optimization Service for Widely Distributed Many-task Computing [ J ]. IEEE Transac- tions on Parallel & Distributed Systems, 2011,22 ( 6 ) : 899 -909.
  • 8Yildirim E, Yin Dengpan, Kosar T. Prediction of Optimal Parallelism Level in Wide Area Data Trans- fers[J]. IEEE Transactions on Parallel & DistributedSystems ,2011,22 ( 12 ) :765 -779.
  • 9Hacker T J, Athey B D, Noble B. The End-to-End Performance Effects of Parallel TCP Sockets on a Lossy Wide Area Network [ C]//Proceedings of the 16th International Parallel and Distributed Processing Symposium. Washington D. C. , USA : IEEE Press ,2002 : 314-322.
  • 10Kola G,Vernon M K. Target Bandwidth Sharing Using Endhost Measures [ J ]. Performance Evaluation, 2007, 64(9-12) :948-964.

二级参考文献19

  • 1Dean J,Ghemawat S.MapReduce:Simplified Data Processing on Large Clusters[C]//Proc.of OSDI' 04.San Francisco:[s.n.],2004:137-150.
  • 2Abadi D J,Madden S R,Hachem N.Column-stores vs.Row-stores:How Different Are They Really?[C]// Proc.of ACM SIGMOD' 08.Vancouver,Canada:ACM Press,2008:967-980.
  • 3Stonebraker M,Abadi D J,Batkin A,et al.C-store:A Column-oriented DBMS[C]//Proc.of VLDBConference.Trondheim,Norway:[s.n.],2005:553-564.
  • 4Boncz P,Zukowski M,Nes N.MonetDB/X100:Hyperpipelining Query Execution[C]//Proc.of CIDR' 05.Asilomar,USA:ACM Press,2005:251-264.
  • 5Blanas S,Patel J M,Ercegovac V,et al.A Comparison of Join Algorithms for Log Processing in MapReduce[C]//Proc.of ACM SIGMOD International Conference on Management of Data.Indianapolis,USA:ACMPress,2010:975-986.
  • 6Abouzeid A,Bajda-Pawlikowski K,Abadi D J,et al.HadoopDB:An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads[C]//Proc.of VLDB Conference.Lyon,France:[s.n.],2009:922-933.
  • 7Bajda-Pawlikowski K,Abadi D J,Silberschatz A,et al.Efficient Processing of Data Warehousing Queries[C]// Proc.of ACM SIGMOD International Conference on Management of Data.Athens,Greece:ACM Press,2011:1165-1176.
  • 8Lin Yuting,Agrawal D,Chen Chun,et al.Llama:Leveraging Columnar Storage for Scalable Join Processing in the MapReduce Framework[C]//Proc.of ACM SIGMOD International Conference on Management of Data.Athens,Greece:ACM Press,2011:961-972.
  • 9Floratou A,Patel J M,Shekita E J.Column-oriented Storage Techniques for MapReduce.The VLDB Journal,2011,4(7):419-429.
  • 10ThusooA,Sarma J S,Jain N,et al.Hive--A Warehousing Solution over a Map-reduce Framework[C]//Proc.of VLDB Conference.Lyon,France:[s.n.],2009:1626-1629.

共引文献71

同被引文献17

引证文献2

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部