期刊文献+

一种多序列比对分治算法DCA-ClustalW 被引量:1

A Multiple Sequence Alignment Algorithm DCA-ClustalW
下载PDF
导出
摘要 多序列比对是生物信息学研究中最基本的一项内容,多序列比对的精确算法是一个NP-hard问题,一般研究者都侧重于设计多序列比对近似算法,最有代表性的近似算法是ClustalW;分而治之是一种重要的算法设计思想,它将复杂问题分割成更简单的子问题来解决,能有效提高算法效率。本文设计了一个DCA-ClustalW算法,对多序列比对问题,同时考虑从纵向和横向两个方面将复杂问题分割成简单易解的子问题,在BaliBase基准数据集上测试表明,该算法是可行的。 Multiple sequence alignment is the most basic of bioinformatics problem.The multiple sequence alignment algorithms is a NP-hard problem,and researchers now focus on the design of approximation algorithm for multiple sequence alignment.The most representative approximation algorithm is ClustalW.The other hand,the divide and conquer algorithm have been attracted attention.A large problem is divided into simpler problems,can be solved effectively.This paper will combine the DCA and ClustalW,and designed a DCA-ClustalW algorithm taking into account both vertical and horizontal,and seek a compromise between the two.Testing in BaliBase dataset shows that the method is feasible.
出处 《计算机与数字工程》 2010年第11期30-33,80,共5页 Computer & Digital Engineering
基金 江苏省自然科学基金(编号:BK2009393)资助
关键词 多序列比对 分而治之 ClustalW Multiple sequence alignment divide and conquer ClustalW
  • 相关文献

参考文献14

  • 1S. B. Needleman, C. D. Wunseh. A general method applicable to the search for similarities in the amino acid sequence of two proteins[J]. Journal of Molecular Biol ogy, 1970,48 : 443-453.
  • 2T. F. Smith, M.S. Waterman. Identification of common molecular subsequences[J]. Journal of Molecular Biology, 1994,147: 195-197.
  • 3S. K. Gupta, J. Kececioglu, A. A. Schaffer. Improving the practical space and time efficiency of the shortest-paths approach to sum-of-pairs muhiplc, sequence alignment[J]. Journal of Computational Biology, 1995, 2(3) :459-472.
  • 4L. Wang, T. Jiang. On the complexity of multiple sequence alignment[J]. Journal of Computational Biology, 1994,1 (4) : 337-349.
  • 5J. D. Thompson, T. J. Gibson, and D. Higgins. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting position-specific gap penalties and weight matrix choice[J]. Nucleic Acids Res, 1994,22: 4673-4680.
  • 6C. Lee, C. Grasso, M. Sharlow. Multiple Sequence Alignment Using Partial Order Graphs[J]. Bioinformatics, 2002,18: 452-464.
  • 7K. Katoh, K. Misawal, K. Kuma. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform[J]. Nucleic Acids Res, 2002, 30:3059-3066.
  • 8P. Baldi, Y. Chauvin, T. Hunkapiller, et al. Hidden Markov Models of biological primary sequence information[C]//Proc. Natl. Acad. Sci. U. S. A. ,1994,91 (2) : 1059-1063.
  • 9Gotoh, O. Significant improvement in accuracy of multiple protein sequence alignments by iterative refinements as assessed by reference to structural alignments [J]. J. Mol. Biol,1996,264:823-838.
  • 10J. Kim, S. Pramanik, M. J. Chung. Multiple sequence alignment using simulated annealing. Comp [J]. Appl. Biosic., 1994,10(4) :419-426.

同被引文献7

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部