期刊文献+

基于偏序图生成共识序列的改进星比对算法

An Improved Star Alignment Algorithm for Generating Consistent Sequences Based on Partial Order Graphs
下载PDF
导出
摘要 多序列比对是生物信息学中十分常用的序列分析方法,主要用于分析分子进化关系、基因组分析、找出DNA序列之间的共同结构特征,从而准确判断序列结构和功能之间的具体联系。现今广泛使用的多序列比对方法主要分为渐进式比对和迭代式比对两种方式,但这两种方式在进行多序列比对时的时间开销相对较大,且比对结果准确性和复杂度受限于待比对序列的质量和相似度。相对来说,星比对算法的时间复杂度更低,常用于高相似度序列之间的比对。但对低相似度序列进行比对时,其结果精度还有待提高。针对星比对算法在低相似度序列中精度较差问题,文章提出了利用偏序图生成的共识序列对星比对算法进行了优化,结合SIMD并行策略加快共识序列的产生,从而扩大了算法的应用范围,提高了比对结果准确度,该研究最后通过实验证明了算法优化的有效性。 Multi sequence alignment is a very common sequence analysis method in bioinformatics,which is mainly used to analyze the molecular evolutionary relationship,genome analysis,find out the common structural characteristics between DNA sequences,and accurately judge the specific relationship between sequence structure and function.The multiple sequence alignment methods widely used today are mainly divided into progressive alignment and iterative alignment.However,the time cost of these two methods in multiple sequence alignment is relatively large,and the accuracy and complexity of the alignment results are limited by the quality and similarity of the sequences to be compared.Relatively speaking,the time complexity of star alignment algorithm is lower,and it is often used for alignment between high similarity sequences.However,the accuracy of the results of low similarity sequence alignment needs to be improved.To solve the problem of poor accuracy of star alignment algorithm in low similarity sequences,the paper proposes to optimize the star alignment algorithm by using consensus sequences generated by partial sequence diagrams,and speed up the generation of consensus sequences by combining SIMD parallel strategy,thus expanding the application scope of the algorithm and improving the accuracy of the alignment results.Finally,the research proves the effectiveness of algorithm optimization through experiments.
作者 胡振铎 刘宇暄 朱晓 HU Zhenduo;LIU yuxuan;ZHU Xiao(School of Computer Science and Information Engineering,Harbin Normal University,Harbin Heilongjiang 150500,China)
出处 《长江信息通信》 2023年第1期1-4,共4页 Changjiang Information & Communications
基金 国家自然科学基金资助项目(61902094) 黑龙江省自然科学基金项目(QC2018082) 黑龙江省普通本科高等学校青年创新人才培养计划项目(UNPYSCT-2018183)。
关键词 多序列比对 星比对算法 偏序图 共识序列 SIMD Multiple sequence alignment Star alignment algorithm Partial sequence diagram Consensus sequence SIMD
  • 相关文献

参考文献2

二级参考文献10

  • 1L Wang, T Jiang. On the complexity of multiple sequence alignment[ J] .Journal of Computational Biology, 1994, 1 (4) : 337 - 34.
  • 2Robert C Edgar, Seraflm Batzoglou. Muitiple sequence alignment[ J]. Current opinion in slructural biology. 2006, 16( 3): 368 - 373.
  • 3Notredame C,Higgins D. G. SAGA:sequence alignment by genetic algorithm[ J]. Nucleic Acids Research. 1996,24 (8) : 1515 - 1524.
  • 4Serafim Batzoglou. The many faces of sequence alignment[ J]. Briefings in Bioinformatics. 2005,6( 1 ) : 6 - 22.
  • 5Aho A V, Corasick M J. Efficient string matching: an aid to bibliographic search [J]. Communications of ACM, 1975, 18 (6) :333 - 340.
  • 6D Gusfield. Algorithms on strings, trees and sequences: computer science and computational biology [ M ]. Cambridge: Cambridge University Press. 1997.
  • 7Quan Zou,Maozu Guo, Yang Liu, Taotao Zhang. An Improved Algorithm for Aligning DNA/RNA Sequences Quickly[ A]. International Conference on Computational Intelligence and Security[C]. Harbin: reEF, Press,2007. 825 - 828.
  • 8D Gusfield. Efficient methods for multiple sequence alignment with guaranteed error bounds[ J]. Bulletin of Mathematical Biology. 1993,55(1) : 141 - 154.
  • 9Brown J W. The Ribonuclease P database [ J ]. Nucleic Acids Research, 1999,27( 1 ) :314.
  • 10A Rambaut.Estimating the rate of molecular evolution:incorporating noncontemporaneous sequences into maximum likelihood phylogenies[ J]. Bioinformatics, 2000,16(4) : 395 - 399.

共引文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部