The next generation sequencing (NGS) is an important process which assures inexpen- sive organization of vast size of raw sequence dataset over any traditional sequencing systems or methods. Various aspects of NGS s...The next generation sequencing (NGS) is an important process which assures inexpen- sive organization of vast size of raw sequence dataset over any traditional sequencing systems or methods. Various aspects of NGS such as template preparation, sequencing imaging and genome alignment and assembly outline the genome sequencing and align- ment. Consequently, de Bruijn graph (dBG) is an important mathematical tool that graphically analyzes how the orientations are constructed in groups of nucleotides. Basi- cally, dBG describes the formation of the genome segments in circular iterative fashions. Some pivotal dBG-based de novo algorithms and software packages such as T-IDBA, Oases, IDBA-tran, Euler, Velvet, ABYSS, AllPaths, SOAPde novo and SOAPde novo2 are illustrated in this paper. Consequently, overlap layout consensus (OLC) graph-based algorithms also play vital role in NGS assembly. Some important OLC-based algorithms such as MIRA3, CABOG, Newbler, Edena, Mosaik and SHORTY are portrayed in this paper. It has been experimented that greedy graph-based algorithms and software pack- ages are also vital for proper genome dataset assembly. A few algorithms named SSAKE, SHARCGS and VCAKE help to perform proper genome sequencing.展开更多
Over 17 and 160 types of chemical modifications have been identified in DNA and RNA,respectively.The interest in understanding the various biological functions of DNA and RNA modifications has lead to the cutting-edge...Over 17 and 160 types of chemical modifications have been identified in DNA and RNA,respectively.The interest in understanding the various biological functions of DNA and RNA modifications has lead to the cutting-edged fields of epigenomics and epitranscriptomics.Developing chemical and biological tools to detect specific modifications in the genome or transcriptome has greatly facilitated their study.Here,we review the recent technological advances in this rapidly evolving field.We focus on high-throughput detection methods and biological findings for these modifications,and discuss questions to be addressed as well.We also summarize third-generation sequencing methods,which enable long-read and single-molecule sequencing of DNA and RNA modification.展开更多
文摘The next generation sequencing (NGS) is an important process which assures inexpen- sive organization of vast size of raw sequence dataset over any traditional sequencing systems or methods. Various aspects of NGS such as template preparation, sequencing imaging and genome alignment and assembly outline the genome sequencing and align- ment. Consequently, de Bruijn graph (dBG) is an important mathematical tool that graphically analyzes how the orientations are constructed in groups of nucleotides. Basi- cally, dBG describes the formation of the genome segments in circular iterative fashions. Some pivotal dBG-based de novo algorithms and software packages such as T-IDBA, Oases, IDBA-tran, Euler, Velvet, ABYSS, AllPaths, SOAPde novo and SOAPde novo2 are illustrated in this paper. Consequently, overlap layout consensus (OLC) graph-based algorithms also play vital role in NGS assembly. Some important OLC-based algorithms such as MIRA3, CABOG, Newbler, Edena, Mosaik and SHORTY are portrayed in this paper. It has been experimented that greedy graph-based algorithms and software pack- ages are also vital for proper genome dataset assembly. A few algorithms named SSAKE, SHARCGS and VCAKE help to perform proper genome sequencing.
基金This work was supported by the National Natural Science Foundation of China(Grant No.31861143026 to C.Y.)the Ministry of Science and Technology of China(Grant Nos.2019YFA0110902 and 2019YFA08002501 to C.Y.)the Ludwig Institute for Cancer Research(C-X.S.),Cancer Research UK(C63763/A26394 and C63763/A27122 to C-X.S.)NIHR Oxford Biomedical Research Centre(to C-X.S.)and Emerson Collective(to C-X.S.).L-Y.Z.is supported by China Scholarship Council.The views expressed are those of the authors and not necessarily those of the NHS,the NIHR or the Department of Health.We apologize for not being able to cite all the publications related to this topic due to space constraints of the journal.
文摘Over 17 and 160 types of chemical modifications have been identified in DNA and RNA,respectively.The interest in understanding the various biological functions of DNA and RNA modifications has lead to the cutting-edged fields of epigenomics and epitranscriptomics.Developing chemical and biological tools to detect specific modifications in the genome or transcriptome has greatly facilitated their study.Here,we review the recent technological advances in this rapidly evolving field.We focus on high-throughput detection methods and biological findings for these modifications,and discuss questions to be addressed as well.We also summarize third-generation sequencing methods,which enable long-read and single-molecule sequencing of DNA and RNA modification.