期刊文献+

第二代测序序列比对方法综述 被引量:14

The Survey of Sequence Alignment Methods Based on the Second Generation Sequencing
原文传递
导出
摘要 使用聚合酶合成技术的Illumina和454平台以及使用连接酶合成测序技术的SOLiD平台是目前三种主流的第二代测序平台.对第二代测序平台产生的高通量序列片段进行比对的方法一般分为两步:①预处理,②序列比对.预处理方法有两类,即基于哈希表的方法和基于后缀trie的Burrows-Wheeler转换思想.序列比对方法也可分为两类,一是空位种子片段索引,二是Smith-Waterman动态规划算法.本文使用Illumina和SOLiD两种平台产生的数据对常用的比对软件SHRiMP,MAQ,BFAST,BWA,BOWTIE等进行了单机测试,结果显示:BOW-TIE在对Illumina平台数据进行比对时,在内存使用、比对速度以及准确性等方面表现比其他几种好,BWA比较适合用于比对SOLiD平台产生的数据.在处理第二代以及以纳米孔技术为标志的第三代测序平台高通量数据时,第二代比对技术仍不能完全满足要求,本文认为以云计算为基础的新序列比对方法是未来研究和发展的一个重要方向. Illumina, SOLID and 454 are three widly used platforms for the second generation sequencing. Among them, both Illumina and SOLID rely on the polymerase chain reaction (PCR) technique, while 454 relies on the DNA ligase. When dealing with the data produced by the platform, two steps are needed.. (1) the preprocess of the high throughput data; (2) sequence alignment. Generally, there are two kinds of preprocessing methods., hash table method and the method based on suffix trie of the Burrows-Wheeler transform. And there are two ways of sequence align- ment: spaced seed indexing and the Smith-Waterman algorithm based on dynamic programming strategy. This paper chooses to evaluate several commonly used ones, such as.. SHRIMP, MAQ, BFAST, BWA, and BOWTIE, by using two kinds of data produced by Illumina and SOLID respectively. The results show that BOWTIE fits for aligning the sequences produced by Illumina in terms of the memory usage, speed and accuracy, while BWA suits for aligning se- quences from SOLID. Considering the situation of disharmony between processing speed and data volume produced by the second generation sequencing platforms or even the third generation sequencing platforms represented by Nanoporous, the paper suggests that new sequence alignment methods based on the cloud computing is an important direction of the future research.
作者 杨烨 刘娟
出处 《武汉大学学报(理学版)》 CAS CSCD 北大核心 2012年第5期463-470,共8页 Journal of Wuhan University:Natural Science Edition
基金 国家自然科学基金(60970063) 教育部博士点基金(20090141110026) 新世纪优秀人才计划(NCET-10-0644)资助项目
关键词 第二代测序技术 读段 序列比对 second generation of sequencing read alignment
  • 相关文献

参考文献45

  • 1Sanger F, Nicklen S. DNA sequencing with chain ter- minating inhibitors[J].Proc Natl Acad Sci , 1977,74: 5463-5467.
  • 2Pop M,Salzberg S L. Bioinformatics challenges of new sequencing technology [J]. Trends Genet, 2008, 24: 142-149.
  • 3Jiang H, Wong W H. SeqMap: Mapping massive amount of oligonucleotides to the genome[J]. Bioin-formatics, 2008,24 : 2395-2396.
  • 4Brockman W. Quality scores and SNP detection in se- quencing-by-synthesis systems [J]. Genome Research, 2008,18:763-770.
  • 5Schuster S C. Next-generation sequencing transforms today' s biology[J]. Nat Methods, 2008,5 ( 1 ) : 16-18.
  • 6Sultan M, Schulz M, Richard H,et al. A global view of gene activity and alternative splicing by deep sequen- cing of the human transcriptome[J]. Science, 2008,321 (5891) :956-960.
  • 7Wheeler D A, Srinivasan M, Egholm M, et al. The complete genome of an individual by massively parallel DNA sequencing[J]. Nature,2008,452(7189) :872-6.
  • 8Morozova O, Marra M. Applications of next-genera- tion sequencing technologies in functional genomics [J]. Genomics,2008,92 : 255-264.
  • 9Strausberg R, Levy S, Rogers Y. Emerging DNA se- quencing technologies for human genomic medicine[J]. Drug Discovery Today, 2008,13 .. 569-577.
  • 10Pettersson E, Lundeberg J,Ahmadian A. Generations of sequencing technologies[J]. Genomics, 2009,93 : 105- 111.

同被引文献232

  • 1刘超,马志强,刘帅.生物信息学中的双序列比对算法[J].长春工程学院学报(自然科学版),2006,7(3):55-57. 被引量:1
  • 2BaoMingQIN,XiaoCHEN,JingDeZHU,DuanQingPEI.Identification of EGFR kinase domain mutations among lung cancer patients in China:implication for targeted cancer therapy[J].Cell Research,2005,15(3):212-217. 被引量:66
  • 3徐琳,李晓民,谭光明,刘新春,卜东波,冯圣中,孙凝晖.面向FPGA的RNA二级结构预测并行算法研究[J].计算机学报,2006,29(2):233-238. 被引量:2
  • 4KraneDE RaymerML 孙啸 陆祖宏 谢建明 译.生物信息学概论[M].北京:清华大学出版社,2004..
  • 5Salehi M,Rabiee H R.A Measurement Framework for Directed Networks [J].Selected Areas in Communications,2013,31 (6): 1007-1016.
  • 6Van der Aakt W M P.The application ofPeui nets to workflowmanagement[J]. Journal ofCircuits,Systems,and Computers,1998,8(1):21-66.
  • 7MOROZOVA O,MARRA M A.Applications of next-generation sequencing technologies in functional genomics[J].Genomics,2008,92(5):255-264.
  • 8MARTIN J A,WANG Z.Next-generation transcriptome assembly[J].Nature Reviews Genetics,2011,12(10):671-682.
  • 9LI H,HOMER N.A survey of sequence alignment algorithms for next-generation sequencing[J].Briefings in Bioinformatics,2010,2(5):473-483.
  • 10NING Z,COX A J,MULLIKIN J C,et al.SSAHA:a fast search method for large DNA databases[J].Genome Research,2001,11(10):1725-1729.

引证文献14

二级引证文献58

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部