期刊文献+

LEDA:一种基于Levenshtein距离的DNA序列拼接算法 被引量:6

LEDA:A DNA Sequence Assembly Algorithm Based on Levenshtein Distance
原文传递
导出
摘要 针对DNA双端测序产生的两条序列Read1和Read2,提出了一种基于Levenshtein距离的DNA序列拼接算法。根据Read1与Read2末端重叠部分的编辑距离,寻找所有可能正确的序列片段,拼接成完整的DNA序列。该算法将通常用于字符串比对的编辑距离运用到DNA序列的拼接问题中,将DNA序列拼接问题转换成为可能发生插入、删除以及替换操作的字符串比对问题,算法简单,解决了其他拼接算法使用时有诸多限制条件的问题。拼接正确率高达99%,相比于其他拼接算法O(N^(2))的时间复杂度,时间复杂度仅为O(n·2x),其中N为reads长度,n为overlap长度,x为Read1与Read2末端重叠部分的最小编辑距离,拼接高效,具有良好的技术优势。 A DNA sequence assembly algorithm based on Levenshtein distance is proposed for two sequences Read1 and Read2 generated by DNA paired-end sequencing in this paper.According to the edit distance between the end overlap of Read1 and Read2,all possible correct sequences are found and the sequences are assembled into a complete DNA sequence.The algorithm is innovative in applying Levenshtein distance to DNA sequence assembly problem which transforms a DNA sequence assembly problem into a string alignment problem that insertion,deletion and substitution operations may occur.The algorithm is simple,but it solves the problem of many restrictions when other assembly algorithms are used with the correct rate of stitching as high as 99%.Meanwhile compared with the time complexity of other assembly algorithms O(N^(2)),the time complexity of this algorithm is only O(n·2^(x)),where N is reads length,n is overlap length,and x is the minimum editing distance of the overlap between Read1 and Read2 ends.So this algorithm is well advanced technically.
作者 崔竞松 薛慧 王兰兰 郭迟 CUI Jingsong;XUE Hui;WANG Lanlan;GUO Chi(Key Laboratory of Aerospace Information Security and Trusted Computing,Ministry of Education,School of Cyber Science and Engineering,Wuhan University,Wuhan 430072,Hubei,China;School of Science,Hohai University,Nanjing 211100,Jiangsu,China;Satellite Navigation and Positioning Research Center,Wuhan University,Wuhan 430079,Hubei,China)
出处 《武汉大学学报(理学版)》 CAS CSCD 北大核心 2022年第3期271-278,共8页 Journal of Wuhan University:Natural Science Edition
基金 国家“十三五”重点研发计划(2016YFB0501800)。
关键词 DNA测序技术 Levenshtein距离 拼接算法 DNA sequencing techniques Levenshtein distance assembly algorithm
  • 相关文献

参考文献1

二级参考文献12

共引文献15

同被引文献123

引证文献6

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部