期刊文献+

大数据处理平台Spark及其生物医学应用 被引量:8

Big Data Processing Platform Spark and Its Biomedical Applications
下载PDF
导出
摘要 随着生命科学和医疗信息化的快速发展,生物医学数据出现了爆炸式增长趋势,其处理面临数据量大、维度关系复杂和交互式响应要求高等问题。传统的数据库以及Hadoop框架在处理生物医学大数据方面都存在一些不足。Spark是一个新兴的基于内存计算的开源大数据平台,具有丰富的编程接口、通用的处理框架和多元化的运行模式。本文介绍了Spark的关键技术和特性,以及不同来源生物医学大数据特点和成功案例,表明Spark在生物医学大数据处理中的适用性和潜在优势。 With the rapid development of life sciences and medical informatization, an explosive growthtrend of biomedical data has appeared, whose processing has the problems of a large mount of data, complexmulti-dimensional relations and high interactive response demands. There are some defects in biomedical bigdata processing by using traditional database and Hadoop. Spark is a novel open-source big data platform basedon memory computation, which has abundant programming interfaces, general processing framework andpluralistic operation modes. This article introduced the key technologies and features of Spark, combinedanalysis of characteristics of biomedical big data and successful cases of Spark, and discussed the applicabilityand potential advantages of Spark in the biomedical big data processing.
出处 《中国中医药图书情报杂志》 2015年第2期1-5,共5页 Chinese Journal of Library and Information Science for Traditional Chinese Medicine
基金 军队后勤科技"十二五"重点项目(BS211R008 BS212J009)
关键词 大数据 SPARK 医学研究 生物医学信息学 big data Spark medical research biomedical informatics
  • 相关文献

参考文献20

  • 1MW Davidson, DA Haim, JM Radin. Using Networks to Combine “BigData” and Traditional Surveillance to Improve InfluenzaPredictions[J/OL].[2015-01-29].http://www.nature.com/srep/2015/150129/srep08154/full/srep08154.html.
  • 2Miguel Helft. 大数据能治愈癌症吗? [EB/OL].[2014-10-29].http://www.fortunechina.com/business/c/2014-10/29/content_225089.htm.
  • 3OBAMA ADMINISTRATION: Big Data Research and DevelopmentInitiative[EB/OL].[2012-03-30].http://www.whitehouse.gov/sites/default/files/microsites/ostp/big_data_press_release.pdf.
  • 4RC Taylor. An overview of the Hadoop/MapReduce/HBase frameworkand its current applications in bioinformatics[J].BMCBioinformatics,2010,11(Suppl 12):S1.
  • 5MC Schatz. Cloudburst: Highly Sensitive Read Mapping withMapreduce[J]. Bioinformatics,2009,25(11):1363-1369.
  • 6S Leo, F Santoni, G Zanetti. Biodoop: Bioinformatics onhadoop[C]//Proceedings of the 38th International Conference onParallel Processing Workshops (ICPPW 2009). Vienna, Austria:2009:415-422.
  • 7A Matsunaga, M Tsugawa, J Fortes. CloudBLAST: CombiningMapreduce and Virtualization on Distributed Resources forBioinformatics Applications[C]//IEEE 4th InternationalConference on eScience (eScience 2008). Indiana,USA:2008:222-229.
  • 8H Lee, Y Yang, H Chae, et al. BioVLAB-MMIA: A cloud environmentfor microRNA and mRNA integrated analysis (MMIA) on AmazonEC2[J]. IEEE Transactions on NanoBioscience,2012,11(3):266-272.
  • 9AP Heath, M Greenway, R Powell, et al. Bionimbus: a cloud formanaging, analyzing and sharing large genomics datasets[J]. JAm Med Inform Assoc,2014,21(6):969-975.
  • 10J Zhang, DW Chen, JH Zhao, et al. RASS: A Portable Real-timeAutomatic Sleep Scoring System[C]// Proceedings of 2012 IEEE33rd Conference on Real-Time Systems Symposium(RTSS).Washington D.C., USA:2012:105-114.

同被引文献68

引证文献8

二级引证文献49

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部