期刊文献+
共找到18篇文章
< 1 >
每页显示 20 50 100
GSA-Human:人类遗传资源数据管理的公共系统 被引量:10
1
作者 张思思 陈旭 +16 位作者 陈婷婷 朱军伟 唐碧霞 王安可 董丽莉 张哲文 孙艳玲 俞彩霞 翟爽 孙玉彬 陈焕新 杜政霖 肖景发 章张 鲍一明 王彦青 赵文明 《遗传》 CAS CSCD 北大核心 2021年第10期988-993,共6页
GSA-Human是人类遗传资源数据汇交、存储、管理与共享的数据库系统,可提供人类遗传资源数据的上传、下载、浏览、检索等公共服务,并有效支撑了国家重点研发计划科技项目数据的汇交与管理工作。系统具有符合《中华人民共和国人类遗传资... GSA-Human是人类遗传资源数据汇交、存储、管理与共享的数据库系统,可提供人类遗传资源数据的上传、下载、浏览、检索等公共服务,并有效支撑了国家重点研发计划科技项目数据的汇交与管理工作。系统具有符合《中华人民共和国人类遗传资源管理条例》数据安全管理策略,提供公开访问和受控访问相结合的数据使用模式。公开访问数据允许用户自由下载与获取;受控访问数据采用申请-审核的模式,即需要通过数据管理委员会(Data Access Committee,DAC)的授权方可获得下载和使用权限。系统自上线以来,截至2021年7月,汇集数据总量已超5.27 PB。 展开更多
关键词 人类遗传资源数据管理系统 组学数据 数据汇交 数据共享
下载PDF
数据驱动的公共卫生安全 被引量:1
2
作者 李翠萍 吴林寰 +3 位作者 舒畅 鲍一明 马俊才 宋述慧 《科学通报》 EI CAS CSCD 北大核心 2024年第9期1156-1163,共8页
在现代社会中,数据已成为重要的生产要素和国家基础性战略资源,是维护国家生物安全与社会稳定的利器[1].正确处理和利用数据,有助于我们及时、快速、有效地发现和解决问题,进一步维护社会的和谐稳定发展.在公共卫生安全领域,公共卫生安... 在现代社会中,数据已成为重要的生产要素和国家基础性战略资源,是维护国家生物安全与社会稳定的利器[1].正确处理和利用数据,有助于我们及时、快速、有效地发现和解决问题,进一步维护社会的和谐稳定发展.在公共卫生安全领域,公共卫生安全事件成为21世纪人类生存面临的非传统安全威胁之一,特别是新发再发传染病对全球公共卫生和社会经济安全构成严重威胁[2,3]. 展开更多
关键词 公共卫生安全 数据驱动 非传统安全威胁 全球公共卫生 和谐稳定发展 传染病 21世纪 基础性
原文传递
Plant genomic resources at National Genomics Data Center:assisting in data-driven breeding applications
3
作者 Dongmei Tian Tianyi Xu +14 位作者 Hailong Kang Hong Luo Yanqing Wang Meili Chen Rujiao Li Lina Ma Zhonghuang Wang Lili Hao Bixia Tang Dong Zou Jingfa Xiao Wenming Zhao yiming bao Zhang Zhang Shuhui Song 《aBIOTECH》 EI CAS CSCD 2024年第1期94-106,共13页
Genomic data serve as an invaluable resource for unraveling the intricacies of the higher plant systems,including the constituent elements within and among species.Through various efforts in genomic data archiving,int... Genomic data serve as an invaluable resource for unraveling the intricacies of the higher plant systems,including the constituent elements within and among species.Through various efforts in genomic data archiving,integrative analysis and value-added curation,the National Genomics Data Center(NGDC),which is a part of the China National Center for Bioinformation(CNCB),has successfully established and currently maintains a vast amount of database resources.This dedicated initiative of the NGDC facilitates a data-rich ecosystem that greatly strengthens and supports genomic research efforts.Here,we present a comprehensive overview of central repositories dedicated to archiving,presenting,and sharing plant omics data,introduce knowledgebases focused on variants or gene-based functional insights,highlight species-specific multiple omics database resources,and briefly review the online application tools.We intend that this review can be used as a guide map for plant researchers wishing to select effective data resources from the NGDC for their specific areas of study. 展开更多
关键词 Plant-omics data Data repositories Data integration KNOWLEDGEBASE Plant genomics
原文传递
Database Commons:A Catalog of Worldwide Biological Databases 被引量:2
4
作者 Lina Ma Dong Zou +7 位作者 Lin Liu Huma Shireen Amir AAbbasi Alex Bateman Jingfa Xiao Wenming Zhao yiming bao Zhang Zhang 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2023年第5期1054-1058,共5页
Biological databases serve as a global fundamental infrastructure for the worldwide scientific community,which dramatically aid the transformation of big data into knowledge discovery and drive significant innovations... Biological databases serve as a global fundamental infrastructure for the worldwide scientific community,which dramatically aid the transformation of big data into knowledge discovery and drive significant innovations in a wide range of research fields.Given the rapid data production,biological databases continue to increase in size and importance.To build a catalog of worldwide biological databases,we curate a total of 5825 biological databases from 8931 publications,which are geographically distributed in 72 countries/regions and developed by 1975 institutions(as of September 20,2022).We further devise a z-index,a novel index to characterize the scientific impact of a database,and rank all these biological databases as well as their hosting institutions and countries in terms of citation and z-index.Consequently,we present a series of statistics and trends of worldwide biological databases,yielding a global perspective to better understand their status and impact for life and health sciences.An up-to-date catalog of worldwide biological databases,as well as their curated meta-information and derived statistics,is publicly available at Database Commons(https://ngdc.cncb.ac.cn/databasecommons/). 展开更多
关键词 Biological database CATALOG Database Commons CITATION z-index
原文传递
On the collection and integration of SARS-CoV-2 genome data 被引量:1
5
作者 Lina Ma Wei Zhao +4 位作者 Tianhao Huang Enhui Jin Gangao Wu Wenming Zhao yiming bao 《Biosafety and Health》 CAS CSCD 2023年第4期204-210,共7页
Genome data of severe acute respiratory syndrome coronavirus 2(SARS-CoV-2)is essential for virus diagnosis,vaccine development,and variant surveillance.To archive and integrate worldwide SARS-CoV-2 genome data,a serie... Genome data of severe acute respiratory syndrome coronavirus 2(SARS-CoV-2)is essential for virus diagnosis,vaccine development,and variant surveillance.To archive and integrate worldwide SARS-CoV-2 genome data,a series of resources have been constructed,serving as a fundamental infrastructure for SARS-CoV-2 research,pandemic prevention and control,and coronavirus disease 2019(COVID-19)therapy.Here we present an over-view of extant SARS-CoV-2 resources that are devoted to genome data deposition and integration.We review deposition resources in data accessibility,metadata standardization,data curation and annotation;review integrative resources in data source,de-redundancy processing,data curation and quality assessment,and variant annotation.Moreover,we address issues that impede SARS-CoV-2 genome data integration,including low-complexity,inconsistency and absence of isolate name,sequence inconsistency,asynchronous update of genome data,and mismatched metadata.We finally provide insights into data standardization consensus and data submission guidelines,to promote SARS-CoV-2 genome data sharing and integration. 展开更多
关键词 SARS-CoV-2 resource Genome data Data deposition Data integration Data curation
原文传递
RCoV19:A One-stop Hub for SARS-CoV-2 Genome Data Integration,Variant Monitoring,and Risk Pre-warning
6
作者 Cuiping Li Lina Ma +9 位作者 Dong Zou Rongqin Zhang Xue Bai Lun Li Gangao Wu Tianhao Huang Wei Zhao Enhui Jin yiming bao Shuhui Song 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2023年第5期1066-1079,共14页
The Resource for Coronavirus 2019(RCoV19)is an open-access information resource dedicated to providing valuable data on the genomes,mutations,and variants of the severe acute respiratory syndrome coronavirus 2(SARS-Co... The Resource for Coronavirus 2019(RCoV19)is an open-access information resource dedicated to providing valuable data on the genomes,mutations,and variants of the severe acute respiratory syndrome coronavirus 2(SARS-CoV-2).In this updated implementation of RCoV19,we have made significant improvements and advancements over the previous version.Firstly,we have implemented a highly refined genome data curation model.This model now features an automated integration pipeline and optimized curation rules,enabling efficient daily updates of data in RCoV19.Secondly,we have developed a global and regional lineage evolution monitoring platform,alongside an outbreak risk pre-warning system.These additions provide a comprehensive understanding of SARS-CoV-2 evolution and transmission patterns,enabling better preparedness and response strategies.Thirdly,we have developed a powerful interactive mutation spectrum comparison module.This module allows users to compare and analyze mutation patterns,assisting in the detection of potential new lineages.Furthermore,we have incorporated a comprehensive knowledgebase on mutation effects.This knowledgebase serves as a valuable resource for retrieving information on the functional implications of specific mutations.In summary,RCoV19 serves as a vital scientific resource,providing access to valuable data,relevant information,and technical support in the global fight against COVID-19.The complete contents of RCoV19 are available to the public at https://ngdc.cncb.ac.cn/ncov/. 展开更多
关键词 SARS-CoV-2 Mutation VARIANTS Surveillance Pre-warning
原文传递
From BIG Data Center to China National Center for Bioinformation
7
作者 yiming bao Yongbiao Xue 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2023年第5期900-903,共4页
Background Big data challenges In the late 1980s and early 1990s,three major international biological data centers were created:the DNA Database of Japan(DDBJ)[1],the European Bioinformatics Institute(EMBL-EBI)in the ... Background Big data challenges In the late 1980s and early 1990s,three major international biological data centers were created:the DNA Database of Japan(DDBJ)[1],the European Bioinformatics Institute(EMBL-EBI)in the United Kingdom(UK)[2],and the National Center for Biotechnology Information(NCBI)in the United States(US)[3]. 展开更多
关键词 CENTERS CENTER created
原文传递
微生物组测序与分析专家共识 被引量:6
8
作者 段云峰 王升跃 +26 位作者 陈禹保 杨瑞馥 李后开 朱怀球 童贻刚 杜文斌 付钰 胡松年 王军 辛玉华 赵方庆 鲍一明 张雯 李娟 曾明 牛海涛 周欣 李岩 崔生辉 袁静 李俊桦 王加义 刘东来 倪铭 孙青 邓晔 朱宝利 《生物工程学报》 CAS CSCD 北大核心 2020年第12期2516-2524,共9页
在过去的十几年,微生物组相关研究和应用持续升温。微生物组逐渐成为生命科学、环境科学和医学等领域的研究焦点。与此同时,全球多个国家和组织也都积极发起各自的微生物组计划,进行多方面的布局,力争在这一具有广阔前景的领域获得战略... 在过去的十几年,微生物组相关研究和应用持续升温。微生物组逐渐成为生命科学、环境科学和医学等领域的研究焦点。与此同时,全球多个国家和组织也都积极发起各自的微生物组计划,进行多方面的布局,力争在这一具有广阔前景的领域获得战略地位。此外,无论是科研还是产业应用已经迎来了研究高潮和投融资热潮,微生物组相关产品和服务也不断出现。然而,行业在快速发展的同时,也存在一些不足。由于微生物组测序和分析相关技术和方法发展迅速,各国研究和应用尚未在技术、方案和数据等标准上达成统一,国内行业参与者对微生物组也存在认识不足,对微生物组相关新方法、新技术、新理论等还未能充分掌握和使用。除此之外,已有的一些标准和指南,内容过于简单,实操性也不足,这不仅给科研数据的整合造成了困难和资源浪费,还给相关企业进行不良竞争、以次充好提供了机会。更重要的是,我国尚缺乏微生物组相关的国家标准,国家微生物组计划仍处于筹备过程。在此背景下,中国生物工程学会、中国科学院微生物研究所于2019年6月至2020年3月,共同设立了“微生物组测序与分析专家共识”专项研究课题。中国生物工程学会组织了微生物组相关领域的27位专家以及来自行业内的30多位专业人员,通过分成4个项目小组、召开4轮研讨会后,最终形成了涵盖从微生物采集与保存、DNA提取与建库、高通量基因测序和数据分析以及质控标准品等全流程的“微生物组测序与分析专家共识”。本专家共识具有较强可参考性和可操作性,不仅能指导国内科研和产业机构规范进行微生物组相关产、学、研,还能为国家相关职能部门提供可参考的技术依据,保障规模型和规范化的企业利益,加强行业自律,避免不规范的企业扰乱市场,最终促进微生物组相关产业的良性发展。 展开更多
关键词 微生物组 高通量基因测序 专家共识 国家标准 微生物组计划
原文传递
The Genome Sequence Archive Family: Toward Explosive Data Growth and Diverse Data Types 被引量:86
9
作者 Tingting Chen Xu Chen +18 位作者 Sisi Zhang Junwei Zhu Bixia Tang Anke Wang Lili Dong Zhewen Zhang Caixia Yu Yanling Sun Lianjiang Chi Huanxin Chen Shuang Zhai Yubin Sun Li Lan Xin Zhang Jingfa Xiao yiming bao Yanqing Wang Zhang Zhang Wenming Zhao 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2021年第4期578-583,共6页
The Genome Sequence Archive(GSA)is a data repository for archiving raw sequence data,which provides data storage and sharing services for worldwide scientific communities.Considering explosive data growth with diverse... The Genome Sequence Archive(GSA)is a data repository for archiving raw sequence data,which provides data storage and sharing services for worldwide scientific communities.Considering explosive data growth with diverse data types,here we present the GSA family by expanding into a set of resources for raw data archive with different purposes,namely,GSA(https://ngdc.cncb.ac.cn/gsa/),GSA for Human(GSA-Human,https://ngdc.cncb.ac.cn/gsa-human/),and Open Archive for Miscellaneous Data(OMIX,https://ngdc.cncb.ac.cn/omix/).Compared with the 2017 version,GSA has been significantly updated in data model,online functionalities,and web interfaces.GSA-Human,as a new partner of GSA,is a data repository specialized in human genetics-related data with controlled access and security.OMIX,as a critical complement to the two resources mentioned above,is an open archive for miscellaneous data.Together,all these resources form a family of resources dedicated to archiving explosive data with diverse types,accepting data submissions from all over the world,and providing free open access to all publicly available data in support of worldwide research activities. 展开更多
关键词 Genome Sequence Archive GSA GSA-Human OMIX
原文传递
Genome Warehouse: A Public Repository Housing Genome-scale Data 被引量:27
10
作者 Meili Chen Yingke Ma +11 位作者 Song Wu Xinchang Zheng Hongen Kang Jian Sang Xingjian Xu Lili Hao Zhaohua Li Zheng Gong Jingfa Xiao Zhang Zhang Wenming Zhao yiming bao 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2021年第4期584-589,共6页
The Genome Warehouse(GWH)is a public repository housing genome assembly data for a wide range of species and delivering a series of web services for genome data submission,storage,release,and sharing.As one of the cor... The Genome Warehouse(GWH)is a public repository housing genome assembly data for a wide range of species and delivering a series of web services for genome data submission,storage,release,and sharing.As one of the core resources in the National Genomics Data Center(NGDC),part of the China National Center for Bioinformation(CNCB;https://ngdc.cncb.ac.cn),GWH accepts both full and partial(chloroplast,mitochondrion,and plasmid)genome sequences with different assembly levels,as well as an update of existing genome assemblies.For each assembly,GWH collects detailed genome-related metadata of biological project,biological sample,and genome assembly,in addition to genome sequence and annotation.To archive high-quality genome sequences and annotations,GWH is equipped with a uniform and standardized procedure for quality control.Besides basic browse and search functionalities,all released genome sequences and annotations can be visualized with JBrowse.By May 21,2021,GWH has received 19,124 direct submissions covering a diversity of 1108 species and has released 8772 of them.Collectively,GWH serves as an important resource for genomescale data management and provides free and publicly accessible data to support research activities throughout the world.GWH is publicly accessible at https://ngdc.cncb.ac.cn/gwh. 展开更多
关键词 Genome submission Genome sequence Genome annotation Genome Warehouse Quality control
原文传递
The Global Landscape of SARS-CoV-2 Genomes, Variants, and Haplotypes in 2019nCoVR 被引量:14
11
作者 Shuhui Song Lina Ma +27 位作者 Dong Zou Dongmei Tian Cuiping Li Junwei Zhu Meili Chen Anke Wang Yingke Ma Mengwei Li Xufei Teng Ying Cui Guangya Duan Mochen Zhang Tong Jin Chengmin Shi Zhenglin Du Yadong Zhang Chuandong Liu Rujiao Li Jingyao Zeng Lili Hao Shuai Jiang Hua Chen Dali Han Jingfa Xiao Zhang Zhang Wenming Zhao Yongbiao Xue yiming bao 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2020年第6期749-759,共11页
On January 22,2020,China National Center for Bioinformation(CNCB)released the 2019 Novel Coronavirus Resource(2019nCoVR),an open-access information resource for the severe acute respiratory syndrome coronavirus 2(SARS... On January 22,2020,China National Center for Bioinformation(CNCB)released the 2019 Novel Coronavirus Resource(2019nCoVR),an open-access information resource for the severe acute respiratory syndrome coronavirus 2(SARS-CoV-2).2019nCoVR features a comprehensive integration of sequence and clinical information for all publicly available SARS-CoV-2 isolates,which are manually curated with value-added annotations and quality evaluated by an automated in-house pipeline.Of particular note,2019nCoVR offers systematic analyses to generate a dynamic landscape of SARS-CoV-2 genomic variations at a global scale.It provides all identified variants and their detailed statistics for each virus isolate,and congregates the quality score,functional annotation,and population frequency for each variant.Spatiotemporal change for each variant can be visualized and historical viral haplotype network maps for the course of the outbreak are also generated based on all complete and high-quality genomes available.Moreover,2019nCoVR provides a full collection of SARS-CoV-2 relevant literature on the coronavirus disease 2019(COVID-19),including published papers from PubMed as well as preprints from services such as bioRxiv and medRxiv through Europe PMC.Furthermore,by linking with relevant databases in CNCB,2019nCoVR offers data submission services for raw sequence reads and assembled genomes,and data sharing with NCBI.Collectively,SARS-CoV-2 is updated daily to collect the latest information on genome sequences,variants,haplotypes,and literature for a timely reflection,making 2019nCoVR a valuable resource for the global research community.2019nCoVR is accessible at https://bigd.big.ac.cn/ncov/. 展开更多
关键词 2019nCoVR SARS-CoV-2 DATABASE Genomic variation HAPLOTYPE
原文传递
Population Genetics of SARS-CoV-2: Disentangling Effects of Sampling Bias and Infection Clusters 被引量:6
12
作者 Qi Liu Shilei Zhao +8 位作者 Cheng-Min Shi Shuhui Song Sihui Zhu Yankai Su Wenming Zhao Mingkun Li yiming bao Yongbiao Xue Hua Chen 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2020年第6期640-647,共8页
A novel RNA virus,the severe acute respiratory syndrome coronavirus 2(SARS-CoV-2),is responsible for the ongoing outbreak of coronavirus disease 2019(COVID-19).Population genetic analysis could be useful for investiga... A novel RNA virus,the severe acute respiratory syndrome coronavirus 2(SARS-CoV-2),is responsible for the ongoing outbreak of coronavirus disease 2019(COVID-19).Population genetic analysis could be useful for investigating the origin and evolutionary dynamics of COVID-19.However,due to extensive sampling bias and existence of infection clusters during the epidemic spread,direct applications of existing approaches can lead to biased parameter estimations and data misinterpretation.In this study,we first present robust estimator for the time to the most recent common ancestor(TMRCA)and the mutation rate,and then apply the approach to analyze 12,909 genomic sequences of SARS-CoV-2.The mutation rate is inferred to be 8.69×10^(−4) per site per year with a 95%confidence interval(CI)of[8.61×10^(−4),8.77×10^(−4)],and the TMRCA of the samples inferred to be Nov 28,2019 with a 95%CI of[Oct 20,2019,Dec 9,2019].The results indicate that COVID-19 might originate earlier than and outside of Wuhan Seafood Market.We further demonstrate that genetic polymorphism patterns,including the enrichment of specific haplotypes and the temporal allele frequency trajectories generated from infection clusters,are similar to those caused by evolutionary forces such as natural selection.Our results show that population genetic methods need to be developed to efficiently detangle the effects of sampling bias and infection clusters to gain insights into the evolutionary mechanism of SARS-CoV-2.Software for implementing VirusMuT can be downloaded at https://bigd.big.ac.cn/biocode/tools/BT007081. 展开更多
关键词 COVID-19 SARS-CoV-2 Phylogenetic divergence Infection cluster Sampling bias
原文传递
The Elements of Data Sharing 被引量:3
13
作者 Zhang Zhang Shuhui Song +3 位作者 Jun Yu Wenming Zhao Jingfa Xiao yiming bao 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2020年第1期1-4,共4页
Data and their tailored characteristics are inheritable and longlived,surpassing their analyzed results and conclusions regardless if they are produced by their generators or users.Aside from designing experiments for... Data and their tailored characteristics are inheritable and longlived,surpassing their analyzed results and conclusions regardless if they are produced by their generators or users.Aside from designing experiments for the new acquisition,scientific researchers always begin with a thorough synthesis of the existing data,especially those that have been demonstrated authentic and timely. 展开更多
关键词 authentic PASSING SYNTHESIS
原文传递
Genomic Perspectives on the Emerging SARS-CoV-2 Omicron Variant 被引量:2
14
作者 Wentai Ma Jing Yang +7 位作者 Haoyi Fu Chao Su Caixia Yu Qihui Wang Ana Tereza Ribeiro de Vasconcelos Georgii A.Bazykin yiming bao Mingkun Li 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2022年第1期60-69,共10页
A new variant of concern for SARS-CoV-2,Omicron(B.1.1.529),was designated by the World Health Organization on November 26,2021.This study analyzed the viral genome sequencing data of 108 samples collected from patient... A new variant of concern for SARS-CoV-2,Omicron(B.1.1.529),was designated by the World Health Organization on November 26,2021.This study analyzed the viral genome sequencing data of 108 samples collected from patients infected with Omicron.First,we found that the enrichment efficiency of viral nucleic acids was reduced due to mutations in the region where the primers anneal to.Second,the Omicron variant possesses an excessive number of mutations compared to other variants circulating at the same time(median:62 vs.45),especially in the Spike gene.Mutations in the Spike gene confer alterations in 32 amino acid residues,more than those observed in other SARS-CoV-2 variants.Moreover,a large number of nonsynonymous mutations occur in the codons for the amino acid residues located on the surface of the Spike protein,which could potentially affect the replication,infectivity,and antigenicity of SARS-CoV-2.Third,there are 53 mutations between the Omicron variant and its closest sequences available in public databases.Many of these mutations were rarely observed in public databases and had a low mutation rate.In addition,the linkage disequilibrium between these mutations was low,with a limited number of mutations concurrently observed in the same genome,suggesting that the Omicron variant would be in a different evolutionary branch from the currently prevalent variants.To improve our ability to detect and track the source of new variants rapidly,it is imperative to further strengthen genomic surveillance and data sharing globally in a timely manner. 展开更多
关键词 Omicron GENOMICS MUTATION Variant of concern SARS-CoV-2
原文传递
Genomic Epidemiology of SARS-CoV-2 in Pakistan 被引量:2
15
作者 Shuhui Song Cuiping Li +23 位作者 Lu Kang Dongmei Tian Nazish Badar Wentai Ma Shilei Zhao Xuan Jiang Chun Wang Yongqiao Sun Wenjie Li Meng Lei Shuangli Li Qiuhui Qi Aamer Ikram Muhammad Salman Massab Umair Huma Shireen Fatima Batool Bing Zhang Hua Chen Yun-Gui Yang Amir Ali Abbasi Mingkun Li Yongbiao Xue yiming bao 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2021年第5期727-740,共14页
COVID-19 has swept globally and Pakistan is no exception.To investigate the initial introductions and transmissions of the SARS-CoV-2 in Pakistan,we performed the largest genomic epidemiology study of COVID-19 in Paki... COVID-19 has swept globally and Pakistan is no exception.To investigate the initial introductions and transmissions of the SARS-CoV-2 in Pakistan,we performed the largest genomic epidemiology study of COVID-19 in Pakistan and generated 150 complete SARS-CoV-2 genome sequences from samples collected from March 16 to June 1,2020.We identified a total of 347 mutated positions,31 of which were over-represented in Pakistan.Meanwhile,we found over 1000 intra-host single-nucleotide variants(iSNVs).Several of them occurred concurrently,indicating possible interactions among them or coevolution.Some of the high-frequency iSNVs in Pakistan were not observed in the global population,suggesting strong purifying selections.The genomic epidemiology revealed five distinctive spreading clusters.The largest cluster consisted of 74 viruses which were derived from different geographic locations of Pakistan and formed a deep hierarchical structure,indicating an extensive and persistent nation-wide transmission of the virus that was probably attributed to a signature mutation(G8371T in ORF1ab)of this cluster.Furthermore,28 putative international introductions were identified,several of which are consistent with the epidemiological investigations.In all,this study has inferred the possible pathways of introductions and transmissions of SARS-CoV-2 in Pakistan,which could aid ongoing and future viral surveillance and COVID-19 control. 展开更多
关键词 SARS-CoV-2 VIRUS Molecular evolution Haplotype network Pakistan
原文传递
MPoxVR: A comprehensive genomic resource for monkeypox virus variant surveillance 被引量:3
16
作者 Yingke Ma Meili Chen +2 位作者 yiming bao Shuhui Song MPoxVR Team 《The Innovation》 2022年第5期31-32,共2页
Monkeypox is a viral zoonotic disease endemic in Central and West Africa.Since January 1,2022,3413 laboratory-confirmed monkeypox cases and one death have been reported from 50 countries/territories in five WHO region... Monkeypox is a viral zoonotic disease endemic in Central and West Africa.Since January 1,2022,3413 laboratory-confirmed monkeypox cases and one death have been reported from 50 countries/territories in five WHO regions(as of June 22,2022;https://www.who.int/emergencies/disease-outbreak-news/item/2022-DON396),and 1310 new cases and eight new countries have been reported in the past week.Genomic epidemiology is vital to determine the similarity between viruses and suggest possible links between cases,origins of infection,and transmission dynamics when combined with epidemiological information.However,one of the priority evidence gaps relating to the monkeypox outbreak is genome sequencing and in-host variation analysis.1 So,timely sharing both raw sequence data and consensus genomic data are useful to public health investigators and academic partners undertaking related studies. 展开更多
关键词 EPIDEMIOLOGY SIMILARITY MONKEY
原文传递
Ongoing Positive Selection Drives the Evolution of SARS-CoV-2 Genomes
17
作者 Yali Hou Shilei Zhao +7 位作者 Qi Liu Xiaolong Zhang Tong Sha Yankai Su Wenming Zhao yiming bao Yongbiao Xue Hua Chen 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2022年第6期1214-1223,共10页
SARS-CoV-2 is a new RNA virus affecting humans and spreads extensively throughout the world since its first outbreak in December,2019.Whether the transmissibility and pathogenicity of SARS-CoV-2 in humans after zoonot... SARS-CoV-2 is a new RNA virus affecting humans and spreads extensively throughout the world since its first outbreak in December,2019.Whether the transmissibility and pathogenicity of SARS-CoV-2 in humans after zoonotic transfer are actively evolving,and driven by adaptation to the new host and environments is still under debate.Understanding the evolutionary mechanism underlying epidemiological and pathological characteristics of COVID-19 is essential for predicting the epidemic trend,and providing guidance for disease control and treatments.Interrogating novel strategies for identifying natural selection using within-species polymorphisms and 3,674,076 SARSCoV-2 genome sequences of 169 countries as of December 30,2021,we demonstrate with population genetic evidence that during the course of SARS-CoV-2 pandemic in humans,1)SARS-CoV-2 genomes are overall conserved under purifying selection,especially for the 14 genes related to viral RNA replication,transcription,and assembly;2)ongoing positive selection is actively driving the evolution of 6 genes(e.g.,S,ORF3a,and N)that play critical roles in molecular processes involving pathogen–host interactions,including viral invasion into and egress from host cells,and viral inhibition and evasion of host immune response,possibly leading to high transmissibility and mild symptom in SARS-CoV-2 evolution.According to an established haplotype phylogenetic relationship of 138 viral clusters,a spatial and temporal landscape of 556 critical mutations is constructed based on their divergence among viral haplotype clusters or repeatedly increase in frequency within at least 2 clusters,of which multiple mutations potentially conferring alterations in viral transmissibility,pathogenicity,and virulence of SARS-CoV-2 are highlighted,warranting attention. 展开更多
关键词 COVID-19 SARS-CoV-2 Viral evolution Natural selection Darwinian selection
原文传递
GenBase: A Nucleotide Sequence Database
18
作者 Congfan Bu Xinchang Zheng +10 位作者 Xuetong Zhao Tianyi Xu Xue Bai Yaokai Jia Meili Chen Lili Hao Jingfa Xiao Zhang Zhang Wenming Zhao Bixia Tang yiming bao 《Genomics, Proteomics & Bioinformatics》 SCIE CAS 2024年第3期107-112,共6页
The rapid advancement of sequencing technologies poses challenges in managing the large volume and exponential growth of sequence data efficiently and on time.To address this issue,we present GenBase(https://ngdc.cncb... The rapid advancement of sequencing technologies poses challenges in managing the large volume and exponential growth of sequence data efficiently and on time.To address this issue,we present GenBase(https://ngdc.cncb.ac.cn/genbase),an open-access data repository that follows the International Nucleotide Sequence Database Collaboration(INSDC)data standards and structures,for efficient nucleotide sequence archiving,searching,and sharing.As a core resource within the National Genomics Data Center(NGDC)of the China National Center for Bioinformation(CNCB;https://ngdc.cncb.ac.cn),GenBase offers bilingual submission pipeline and services,as well as local submission assistance in China.GenBase also provides a unique Excel format for metadata description and feature annotation of nucleotide sequences,along with a real-time data validation system to streamline sequence submissions.As of April 23,2024,GenBase received 68,251 nucleotide sequences and 689,574 annotated protein sequences across 414 species from 2319 submissions.Out of these,63,614(93%)nucleotide sequences and 620,640(90%)annotated protein sequences have been released and are publicly accessible through GenBase’s web search system,File Transfer Protocol(FTP),and Application Programming Interface(API).Additionally,in collaboration with INSDC,GenBase has constructed an effective data exchange mechanism with GenBank and started sharing released nucleotide sequences.Furthermore,GenBase integrates all sequences from GenBank with daily updates,demonstrating its commitment to actively contributing to global sequence data management and sharing. 展开更多
关键词 Nucleotide sequence Database GenBase GenBank INSDC
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部