摘要
随着高通量技术的发展,生物数据大爆发式地增长。如何有效地利用生物大数据成为现代生物学的机遇和挑战。大数据和传统数据相比,呈现出很多不同的特点,包括常被提到的3个v(volume,variety,velocity即数据量的巨大、数据类型的多样和数据采集和处理的快速)。本文针对生物医学研究,详细介绍了大数据的杂乱性、可重复利用性、开放性等几个特点。同时结合微生物组学在元分析方面的最新进展,并用实例来阐述了我们在大数据采集方面应该有前瞻性的考虑,提出了在数据管理上如何保护隐私的挑战,探讨了对大数据进行分析的工具和方法。
With the development of high-throughput technologies, biomedical data has been increasing exponentially in an explosive manner. This brings enormous opportunities and challenges to biomedical researchers on how to effectively utilize big data. Big data is different from traditional data in many ways, described as "3Vs" - volume, variety and velocity. From the perspective of biomedical research, here I introduced the characteristics of big data, such as its messiness, re-usage and openness. Focusing on microbiome research of meta-analysis, the author discussed the prospective principles in data collection, challenges of privacy protection in data management, and the scalable tools in data analysis with examples from real life.
出处
《南方医科大学学报》
CAS
CSCD
北大核心
2015年第2期159-162,共4页
Journal of Southern Medical University