期刊文献+

公有云中基于系统工作流的并行策略性能评估

Performance evaluation of parallel strategies based on system workflow in public clouds
下载PDF
导出
摘要 分析系统发育基因组工作流程并行处理性能,提出了一种适用于云计算平台中SciPhylomics执行的性能评估工作流程。首先,介绍了映射化简模型的应用实现Hadoop;然后,呈现了SciCumulus云工作流程引擎;最后,在亚马逊EC2云上使用两种并行执行方法(SciCumulus和Hadoop)实施工作流程。实验结果表明,尽管系统发育基因组学实验对计算环境要求严格,但实验仍然适合在云中执行。此外,所评估的工作流程呈现出几组数据密集型工作流程的许多特征,本方法可以扩展到其他实验类型。 The performance of parallel execution of phylogenomic tree is studied.A performance evaluation for SciPhylomics exe-cutions in a real cloud environment is proposed.Firstly,the Hadoop,a MapReduce model implementation is introduced.Then, the SciCumulus workflow engine is explained.Finally,the workflow is executed using two parallel execution approaches (SciCu-mulus and Hadoop)at the Amazon EC2 cloud.The experiment results demonstrate that the bioinformatics experiment is suitable to be executed in the cloud despite its need for high performance capabilities.Many features of the evaluated workflow are same as other data intensive workflows.Thus,proposed method could be used to analyze other experiments.
出处 《中国科技论文》 CAS 北大核心 2014年第10期1091-1098,共8页 China Sciencepaper
基金 浙江省教育技术研究规划课题(JB083) 浙江中烟工业有限责任公司杭州卷烟厂核心业务课题(8100375)
关键词 公有云 系统工作流程 并行策略 系统发育基因组学 分布式文件系统 public clouds system workflow parallel strategies phylogenomic Hadoop
  • 相关文献

参考文献14

  • 1Tablan V, Roberts I, Cunningham H, et al. GATE- Cloud. net: a platform for large-scale, open-source text processing on the cloud [J] Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 2013, 371 ( 1983 ): article No. 20120071.
  • 2李乔,郑啸.云计算研究现状综述[J].计算机科学,2011,38(4):32-37. 被引量:432
  • 3秦秀磊,张文博,魏峻,王伟,钟华,黄涛.云计算环境下分布式缓存技术的现状与挑战[J].软件学报,2013,24(1):50-66. 被引量:75
  • 4Mateescu G, Gentzsch W, Ribbens C J. Hybrid com- putin:where HPC meets grid and cloud computing [J]. Future Generation Computer Systems, 2011, 27 (5) : 440-453.
  • 5Hartman A L, Riddle S, McPhillips T, et al. Introdu- cing WATERS: a workflow for the alignment, taxono- my, and ecology of ribosomal sequences[J]. BMCbioinformatics, 2010, 11(1): 317-327.
  • 6Wu M, Scott A J. Phylogenomic analysis of bacterial and archaea sequences with AMPHORA2 [J]. Bioin- formatics, 2012, 28(7): 1033-1034.
  • 7Lord E, Leclercq M, Boc A, et al. Armadillo 1.1: an original workflow platform for designing and conducting phylogenetic analysis and simulations [J] PloS one, 2012, 7(1): 456-473.
  • 8Moustafa A, Bhattacharya D, Allen A E. iTree: a high-throughput phylogenomic pipeline [C]//5th Cairo International Biomedical Engineering Conference. Cai- ro, E:vDt, 2010: 103-107.
  • 9柴学智,曹健.面向云计算的工作流技术[J].小型微型计算机系统,2012,33(1):90-95. 被引量:37
  • 10江小平,李成华,向文,张新访.云计算环境下朴素贝叶斯文本分类算法的实现[J].计算机应用,2011,31(9):2551-2554. 被引量:21

二级参考文献150

  • 1杜晓丽,蒋昌俊,徐国荣,丁志军.一种基于模糊聚类的网格DAG任务图调度算法[J].软件学报,2006,17(11):2277-2288. 被引量:48
  • 2王勇,胡春明,杜宗霞.服务质量感知的网格工作流调度[J].软件学报,2006,17(11):2341-2351. 被引量:60
  • 3林伟伟,齐德昱,李拥军,王振宇,张志立.树型网格计算环境下的独立任务调度[J].软件学报,2006,17(11):2352-2361. 被引量:29
  • 4DEAN J, GHEMAWAT S. MapReduce: simplified data processing on large clusters [ J] // Communications of the ACM: 50th anniversary issue, 2008, 51(1): 107-113.
  • 5Apache Hadoop. Hadoop[ EB/OL]. [2011-03- 15]. http://hadoop. apache, org.
  • 6CHU C-T, KIM S K, LIN Y-A, et al. Map-reduce for machine learning on multicore[ C]// NIPS 2006: Proceedings of Neural Information Processing Systems Conference. Cambridge, MA: MIT, 2006:281-288.
  • 7JASON D, LAWRENCE S, JAIME T, et al. Tracking the poor assumptions of Naive Bayes text classifiers[ C]// ICML 2003: Proceedings of the Twenty International Conference on Machine Learning. Washington, DC: [s. n. ], 2003:616-693.
  • 8中国科学院计算技术研究所.ICTCLAS汉语分词系统【EB/OL】.[2011-02—16】.http://ictclas.org/.
  • 9University of Waikato. Weka 3: data mining software in Java [ EB/ OL]. [2011 -03 - 15]. http://www, cs. waikato, ac. nz/ml/weka/.
  • 10WEGENER D, MOCK M, ADRANALE D, et al. Toolkit-based high-performance data mining of large data on MapReduce clusters [ C]// ICDM: IEEE International Conference on Data Mining. Washington, DC: IEEE Computer Society, 2009:296 -301.

共引文献598

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部