摘要
目前,国内外已有许多动物基因组学数据库,却还未有专门针对蒙古高原家畜基因组信息构建的数据库。此外,传统的基因组数据库平台一般采用关系型数据库存储数据,但在面对海量的基因组数据时出现了读写性能差、可靠性低、不易扩展等问题。为解决上述问题,收集整合了牛、绵羊、山羊、骆驼等蒙古高原家畜的基因组数据,应用非关系型数据库,设计并实现了基于MongoDB存储架构的蒙古高原家畜基因组大数据管理系统。该系统的实现为蒙古高原家畜分子生物学研究提供了一个良好的数据平台,也解决了海量基因组数据的存储与管理问题。
At present,there are many animal genomics databases at home and abroad,but there is no database specially built for the genome information of livestock on the mongolian plateau.In addition,the traditional genome database platform generally uses relational databases to store data,but in the face of massive genome data,problems such as poor read-write performance,low reliability,and di?cult expansion have emerged,In order to solve the above problems,we collected and integrated the genome data of cattle,sheep,goats,camels and other livestock on the mongolian plateau,and designed and implemented a big data storage system for the genome of livestock on the mongolian plateau based on the MongoDB storage architecture using a non relational database.The implementation of this system provides a good data platform for the molecular biology research of livestock on the Mongolian plateau,and also solves the problem of storage and management of massive genome data.
作者
邬学敏
高静
WU Xuemin;GAO Jing(Department of Computer and Information Engineering,Baotou Vocational and Technical College,Baotou Inner Mongolia 014000;College of Computer and Information Engineering,Inner Mongolia Agricultural University,Hohhot Inner Mongolia 010000)
出处
《软件》
2022年第12期4-8,14,共6页
Software
基金
内蒙古自治区科学技术厅研究课题蒙古高原家畜遗传资源库与信息平台建设及种质资源开发利用(2020ZD0007)
内蒙古自然科学基金全基因组重测序数据的变异检测和转录组差异表达分析的高效并行化算法和软件研究(2019MS03014)。