摘要
随着生物信息学的不断发展,生物医学领域积累了大量的数据,大数据已经贯穿基础研究、临床诊断、医药开发、健康管理等生物医学领域的各个环节。如何有效存储、管理、分析这些海量数据面临严峻的而挑战。基于超级计算机的计算分析和存储能力,在生物医学大数据处理的异构融合架构,面向生物医学大数据的层次式存储系统,生物医学大数据处理的异构并行计算和多源数据的汇聚机制与分析方法,突破生物医学大数据的汇聚、存储、分析等方面的关键技术,构建一个计算、分析处理和存储融合平台,以满足多种类型生物医学大数据应用的不同需求。
With the rapid advancement of bioinformatics, a large volume of data has been accumulated in the biomedical field. Big data has been incorporated into all aspects related to biomedical science, including academic research, clinical diagnostics, pharmaceutical development, health management, etc. However, the storage, management, and analysis of such large amounts of data confront with tremendous challenges. In this work, we propose to build a general-purpose platform to tackle with biomedical big data. Theresearch issues involved in our work includes: The heterogeneous incorporated platform aiming at biomedical big data analytics based on Tianhe-2; The hierarchical storage system for biomedical big data; The parallel processing of biomedical big data on heterogeneous infrastructure; The aggregation and analytics of multi-source biomedical big data. Through the research works mentioned above, we expect to propose some key technologies and solutions for the aggregation, storage, analytics of biomedical big data, and build an incorporated platform based on Tianhe II to meet different requirements of a variety of biomedical big data applications.
作者
卢宇彤
陈志广
杜云飞
Lu Yutong Chen Zhiguang Du Yunfei(National Supercomputer Center in GuangZhou, Sun Yat-sen University, Guangzhou, Guangdong 510006, China)
出处
《科研信息化技术与应用》
2017年第1期3-9,共7页
E-science Technology & Application
基金
国家自然科学基金重点项目(U1611261
U1611263)
关键词
融合大数据平台
生物医药数据
异构并行编程
Hybrid bigdata platform
Biomedical data
Heterogeneous parallel programming