摘要
在生物信息学中,传统的序列比对算法在理论基础和计算上都具有局限性,因此近二十年人们提出和发展了很多序列比较的数学方法.本文综述了较有代表性的几种:图形表示及其矩阵不变量方法;研究生物大分子二级结构的拓扑图论方法;基于字出现频率的统计学方法;Kolmogorov及Lempel-Ziv复杂度方法.
In the bioinformatics, sequence alignment algorithms still have the limitations of theoretic foundation, and the computational load escalates as a power function of the length of the sequences. Therefore, in recent twenty years, many mathematic categories of sequence comparison are outlined and developed. Four main mathematic categories of sequence comparison are reviewed: graphical representation and matrix invariant methods; topologic methods that applies to biological macromolecular secondary structure; statistics methods based on the word frequency and its distribution; kolmogorov complexity and L- Z Complexity methods.
出处
《渤海大学学报(自然科学版)》
CAS
2013年第1期1-7,70,共8页
Journal of Bohai University:Natural Science Edition
基金
浙江省自然科学基金资助项目(No:Y1110752)
关键词
DNA
RNA二级结构
图形表示
L—Z复杂度
序列比较
DNA
RNA secondary structure
graphical representation
L- Z Complexity
sequence com-parison