摘要
生命组学大数据是国家重要基础性、战略性资源,对支撑生命科学基础研究和应用创新、推动生物经济创新发展、维护国家安全具有重要意义。随着数据规模的不断增长,生命组学大数据的安全管理问题逐渐凸显。国家基因组科学数据中心(National Genomics Data Center,NGDC)面向我国人口健康和社会可持续发展的重大战略需求,建立了生命与健康大数据汇交存储、安全管理、开放共享与整合挖掘研究体系,形成了一系列数据安全管理的制度和措施。本文聚焦于生命组学大数据全生命周期的安全管理问题,探讨生命组学大数据安全管理框架,全面分析在数据汇交、存储、管理、共享全生命周期中涉及的安全管理内容,并总结了NGDC在生命组学大数据安全管理方面的成效。最后,本文展望了生命组学大数据安全管理的发展方向,包括完善数据分级分类制度、提升数据分级安全管理技术和加强数据异地灾备建设,以期实现生命组学大数据的安全管理与可持续发展。
Omics big data is a significant foundational and strategic resource for the country,which plays an important role in supporting the basic research and application innovation of life sciences,promoting the innovative development of bioeconomy,and maintaining national security.With the rapid accumulation of omics data,the security of data management has become increasingly prominent.Facing the major strategic needs of China's population health and sustainable social development,the National Genomics Data Center(NGDC)has established a comprehensive research architecture for collecting,storing,managing,sharing,and mining of big data in omics,forming a series of practices and measures for the security management of the data.This paper delves into the issues of security management of omics big data throughout its lifecycle,elaborating on NGDC's security management measures implemented in the collecting,storing,managing and sharing of the data.Furthermore,it summarizes NGDC’s achievements in the security management of omics big data.Finally,this paper envisions the future directions for the security management of omics big data,including enhancing the data classification and categorization system,enhancing data hierarchical security management technologies and strengthening the construction of off-site disaster recovery,in order to achieve the security management and sustainable development of omics big data in life sciences.
作者
王彦青
陈婷婷
张思思
朱军伟
陈焕新
肖景发
宋述慧
章张
赵文明
鲍一明
WANG YanQing;CHEN TingTing;ZHANG SiSi;ZHU JunWei;CHEN HuanXin;XIAO JingFa;SONG ShuHui;ZHANG Zhang;ZHAO WenMing;BAO YiMing(National Genomics Data Center,China National Center for Bioinformation,Beijing 100101,China;Beijing Institute of Genomics,Chinese Academy of Sciences,Beijing 100101,China;University of Chinese Academy of Sciences,Beijing 100049,China)
出处
《农业大数据学报》
2024年第3期325-332,共8页
Journal of Agricultural Big Data
基金
国家重点研发计划(2023YFC2605700,2023YFC2604400,2021YFF0703704)
中国科学院基因组科学数据中心运行维护(CAS-WX2022SDCXK05)。