期刊文献+

面向新一代基因测序数据的拼接算法综述 被引量:2

Survey on assembly algorithms for next generation sequencing
下载PDF
导出
摘要 针对新一代DNA测序数据存在reads长度短、高覆盖度且存在错误数据等特点,研发满足实际应用的拼接软件,是序列拼接领域迫切的研究课题。探讨了全基因组序列拼接面临的挑战,研究了主流的几类拼接算法的拼接原理、操作流程,分析各种算法的优缺点和适用范围,其中包括基于贪心图算法、基于OLC图算法、基于De Bruijn图算法等,并根据不同的标准列举了几类拼接算法之间的差异性,最后对基因拼接算法在未来的研究给出了建议。 On condition that next genome sequencing data typically suffers shorter read lengths, high coverage, and different error profiles, development of the sequencing assembly software that could meet practical application has become the most important research topic. This paper analysed the challenges of whole genome assembly, the main strategies of assembly, the steps, the advantages and disadvantages of each algorithms as well as the scope of application, including the graph algorithms based on the greedy, OLC, De Bruijn and so on. On the basis of the principles of different algorithms, the paper gave the comparing results between the various strategies depending on the different standard. Finally, it discussed the feature research recommendations of genome assembly.
作者 颜珂 何威 徐勇 张健 Yan Ke;He Wei;Xu Yong;Zhang Jian(IntelliSense & Bioinformatics Innovation Team, Shenzhen Graduate School Harbin Institute of Technology, Shenzhen Guangdong 518055,China;School of Software Engineering, Shenzhen Institute of Information Technology, Shenzhen Guangdong 518172, China)
出处 《计算机应用研究》 CSCD 北大核心 2016年第9期2573-2578,共6页 Application Research of Computers
基金 2014年深圳市未来产业发展专项资金资助项目(CXZZ20140904154910774,JCYJ20140904154645958)
关键词 生物信息学 全基因组序列拼接 高通量基因测序 bioinformatics whole genome assembly high-throughput sequencing
  • 相关文献

参考文献4

二级参考文献91

  • 1Jian P, Han JW, Morta-zavi-Asl B, et al. Mining Sequential Patterns by Prefix-Projected Growth. ICDE, 2001. 215~224
  • 2Foster I,Kesselman C. The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, 1998
  • 3OGSA(Open Grid Services Architecture) Documents. http:∥www. globus. org/ogsa
  • 4Globus: Research in Resource Management. http:∥ www. globus. org/research/
  • 5Foster I, Kesselman C. The globus project: A status report. In:Proc. The Heterogeneous Computing Workshop, 1998. 4~18
  • 6Mullikin J C,Ning Z. The Phusion Assembler. Genome Research,2003,13(1) :81~90
  • 7Wang JY, Han JW. BIDE: Efficient Mining of Frequent Closed Sequences. In: 20 Intl. Conf. on Date Engineering
  • 8http:∥www. phrap. org
  • 9http:∥www. ncbi. nlm. nih. gov/blast/
  • 10Wang J, Wang J, Yang HM, et al. RePS A: Sequence Assembler That Masks Exact Repeats Identified from the shotgun Data. Genome Research, 2002,12 : 824~831

共引文献8

同被引文献39

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部