期刊文献+

DNA序列拼接中de Bruijn图结构的研究 被引量:2

Study of De Bruijn Graph for DNA Sequence Assembly
下载PDF
导出
摘要 基因组测序是生物信息学中最基本的研究方向之一,然而大多数生物的基因组都不可能一次性获得,需要利用序列拼接技术对实验中获得的DNA片段进行拼接操作。目前,测序过程中获得的DNA片段越来越短,基于Euler路径的拼接算法在处理这种短片段拼接时具有优势。在Euler路径算法中,一个关键的步骤是deBruijn图的构建,一直以来,构建deBruijn图的方式总是让后一个κ-mer与前一个κ-mer之间有κ-1个碱基的交叠,相邻的两个κ-mer之间相互错开一位。但文中的研究发现,如果有边连接的两个κ-mer之间有κ-2个或者更少的碱基相交叠,会对deBruijn图结构复杂性产生重要影响。针对这些影响进行详细分析,并设计实验进行验证,实验结果表明,κ-mer之间的错位数变化对deBruijn图结构复杂性有显著影响。 DNA sequencing is one of the most basic directions ofbioinformatics research. However, most genomes are not a one-time gain. So DNA assembly technique is used to splice the fragment obtained in experiments. Recently, the fragments obtained in experiments become shorter. The Euler Path algorithm has more advantages to deal with these shorter fragments, the construction of de Bruijn graph is a key step of the Euler Path algorithm. κ-1 base pair overlap is always made between two κ-reefs. But the study of this paper finds that if less than κ-2 base pair overlap between two κ-mers is made, the construction of de Bruijn graph will be changed strongly. This paper makes a detailed analysis of these effects, and designs an experiment to verify the analysis. The result of the experiment shows that the dislocation ofk-mers will significantly affect the construction of de Bruijn graph.
作者 王东阳 任世军 王亚东 WANG Dongyang, REN Shijun, WANG Yadong (School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China)
出处 《智能计算机与应用》 2011年第2X期20-25,30,共7页 Intelligent Computer and Applications
关键词 生物信息学 基因组测序 DNA序列拼接 Euler路径 DE BRUIJN图 Bioinformatics Genome Sequencing DNA Assembly Euler Path De Bruijn Graph
  • 相关文献

参考文献3

二级参考文献44

  • 1方小永,骆志刚.DNA序列拼接的分布式并行处理[J].计算机工程与科学,2005,27(2):71-73. 被引量:3
  • 2Batzogou S. , Jaffe D. , Stanley K. , Butler J. , Gnerre S. , Mauceli E. , Berger B. , Meslrov J. P. , Lander E.S.. ARACHNE:A whole genome shotgun assembler. Genome Research, 2002,12: 177-189.
  • 3Huang X. , Nadan A.. CAP3: A DNA sequence assembly program. Genome Research, 1999, 9:868-878.
  • 4Huang X,, An improved sequence assembly program. Genomics, 1999, 33:21-31.
  • 5Sutton G. G. , White O. , Adams M. D. , Kerlavage A. R..TIGE assembler: A new tool for assembling large shotgun sequencing projects. Genome Science & Technology, 1995, 1(1):9-19.
  • 6Pevzner P, A, Tang Haixu, Waterman M. S.. An Eulerian path approach to DNA fragment assembly, PNAS, 2001, 8:9748-9753.
  • 7Pevzner P. A. , Tang Haixu, Waterman M. S.. A new approach to fragment assembly in DNA sequencing. In: Proceedings of the RECOMB, 2001, 256-267.
  • 8Myers E W,Sutton G G,Delcher A L,et al.A Whole-Genome Assembly of Drosophila[J].Science,2000,287(5461):2196-2204.
  • 9Havlak P,Chen R,Durbin K J,et al.The Atlas Genome Assembly System[J].Genome Research,2004,14(4):721-732.
  • 10Weber J L,Myers E W Human Whole-Genome Shotgun Sequencing[J].Genome Research,1997,7(5):401-409.

共引文献4

同被引文献6

  • 1Hernandez D.,Francois P.,Schrenzel J.,Farinelli L.,Osteras M.De novo bacterial genome sequencing: Millions of very short reads assembled on a desktop computer. Genome Research . 2008
  • 2Zerbino D.R.,Birney E.Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Research . 2008
  • 3X. Yang,S.K. Dorman,S. Aluru.Reptile: representative tiling for short read error-correction. Bioinformatics . 2010
  • 4Jan S,Heiko S,Simon J P, et al.SHREC: a short-read error correctionmethod. Oxford Journal . 2009
  • 5Lucian Ilie,Farideh Fazayeli,Silvana Ilie.HiTEC: accurate error correction in high-throughput sequencing data. Bioinformatics . 2011
  • 6杨帅,胡宗倩,伯晓晨,王升启,李非,王东根.云计算在生物医学中的应用[J].中国科学:生命科学,2013,43(7):569-578. 被引量:6

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部