期刊文献+

Analysis method and algorithm design of biological sequence problem based on generalized k-mer vector

下载PDF
导出
摘要 K-mer can be used for the description of biological sequences and k-mer distribution is a tool for solving sequences analysis problems in bioinformatics.We can use k-mer vector as a representation method of the k-mer distribution of the biological sequence.Problems,such as similarity calculations or sequence assembly,can be described in the k-mer vector space.It helps us to identify new features of an old sequence-based problem in bioinformatics and develop new algorithms using the concepts and methods from linear space theory.In this study,we defined the k-mer vector space for the generalized biological sequences.The meaning of corresponding vector operations is explained in the biological context.We presented the vector/matrix form of several widely seen sequence-based problems,including read quantification,sequence assembly,and pattern detection problem.Its advantages and disadvantages are discussed.Also,we implement a tool for the sequence assembly problem based on the concepts of k-mer vector methods.It shows the practicability and convenience of this algorithm design strategy.
出处 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2021年第1期114-127,共14页 高校应用数学学报(英文版)(B辑)
基金 the National Natural Science Foundation of China(11771393,11632015) the Natural Sci-ence Foundation of Zhejiang Province,China(LZ14A010002).
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部