期刊文献+

Characterizing and annotating the genome using RNA-seq data 被引量:23

Characterizing and annotating the genome using RNA-seq data
原文传递
导出
摘要 Bioinformatics methods for various RNA-seq data analyses are in fast evolution with the improvement of sequencing technologies. However, many challenges still exist in how to efficiently process the RNA-seq data to obtain accurate and comprehensive results. Here we reviewed the strategies for improving diverse transcriptomic studies and the annotation of genetic variants based on RNA-seq data. Mapping RNA-seq reads to the genome and transcriptome represent two distinct methods for quantifying the expression of genes/transcripts. Besides the known genes annotated in current databases, many novel genes/transcripts(especially those long noncoding RNAs) still can be identified on the reference genome using RNA-seq. Moreover, owing to the incompleteness of current reference genomes, some novel genes are missing from them. Genome-guided and de novo transcriptome reconstruction are two effective and complementary strategies for identifying those novel genes/transcripts on or beyond the reference genome. In addition, integrating the genes of distinct databases to conduct transcriptomics and genetics studies can improve the results of corresponding analyses. Bioinformatics methods for various RNA-seq data analyses are in fast evolution with the improvement of sequencing technol- ogies. However, many challenges still exist in how to efficiently process the RNA-seq data to obtain accurate and comprehensive results. Here we reviewed the strategies for improving diverse transcriptomic studies and the annotation of genetic vari- ants based on RNA-seq data. Mapping RNA-seq reads to the genome and transcriptome represent two distinct methods for quantifying the expression of genes/transcripts. Besides the known genes annotated in current databases, many novel genes/transcripts (especially those long noncoding RNAs) still can be identified on the reference genome using RNA-seq. Moreover, owing to the incompleteness of current reference genomes, some novel genes are missing from them. Ge- nome-guided and de novo transcriptome reconstruction are two effective and complementary strategies for identifying those novel genes/transcripts on or beyond the reference genome. In addition, integrating the genes of distinct databases to conduct transcriptomics and genetics studies can improve the results of corresponding analyses.
出处 《Science China(Life Sciences)》 SCIE CAS CSCD 2017年第2期116-125,共10页 中国科学(生命科学英文版)
基金 supported by the National High Technology Research and Development Program of China(2015AA020104) the China Human Proteome Project(2014DFB30010) the National Science Foundation of China(31471239,to Leming Shi) the 111 Project(B13016)
关键词 RNA序列 基因组 数据表征 注释 转录组学 测序技术 生物信息学 数据分析 RNA-seq, genome-guided transcriptome reconstruction, de novo assembly, long noncoding RNA, genetic variants
  • 相关文献

参考文献1

二级参考文献1

共引文献15

同被引文献130

引证文献23

二级引证文献74

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部