摘要
利用密码子与氨基酸及终止信号之间的映射关系,提出了DNA序列的拟氨基酸序列。然后,借助多重集,构造了DNA序列的21维的数值向量表示,据此可计算DNA序列之间的相似距离。通过对汉坦病毒S片段全基因序列、番茄黄化曲叶病毒全基因组序列以及人鼻病毒全基因组序列3个数据集的系统发育分析,证明了所提方法的有效性。
According to a mapping of codons and amino acids and stop signal, the sequence of pseudo amino acid for DNA sequence was proposed. Then, by means of the muhiset, a 21-dimensional numerical vector of a DNA se- quence was constructed. On the basis of the vector, the similarity distance between any two DNA sequences can be calculated. The phylogenetic analysis on three datasets ( S segment of hantaviruses, complete genome sequences of Tomato yellow leaf curl virus and complete genome sequences of human rhinovirus) demonstrated the effectiveness of the proposed method.
出处
《浙江农业学报》
CSCD
北大核心
2015年第7期1244-1252,共9页
Acta Agriculturae Zhejiangensis
基金
国家自然科学基金项目(11171042)
辽宁省"百千万人才工程"项目(2012921060)
辽宁省高等学校创新团队(LT2014024)
辽宁省食品安全重点实验室开放课题(LNSAKF2011034)