摘要
基于DNA序列4种核苷酸的物理化学性质,考虑相邻两个碱基组合形式,提出一种新的DNA序列4D表示。基于这种表示,可以把DNA序列简化成4D空间的一系列点,根据点坐标抽取序列数值特征,再根据数值特征给出方法对DNA序列进行相似性分析。以10个不同物种的β-球蛋白基因的第一个外显子碱基序列的为例子,说明基于4D表示的DNA序列分析方法是有效的。
Based on the physical and chemical characters of the four nucleotides of the DNA sequence, a new 4D representation of the DNA sequence by considering the two adjacent bases is proposed. Based on this representation, the DNA sequence can simplify into a 4-D space dots, and give an approach to make analysis of DNA sequence using the numerical properties. The similarity analysis of 10 different species of β-myosin gene in the first exon of the sequence are used to explain that DNA sequence analysis method based on 4D representation is effective.
出处
《科学技术与工程》
2008年第6期1405-1409,共5页
Science Technology and Engineering
关键词
DNA序列4D表示
序列分析
DNA sequence 4D representation sequence analysis