期刊文献+

DNA序列判别分类模型 被引量:1

DNA Sequence Identification Classification Model
下载PDF
导出
摘要 [目的]结合生物学知识和数学方法构建DNA序列判别分类模型。[方法]根据氨基酸分子中侧链基的极性性质,从不同序列中氨基酸含量不同提炼出能从碱基含量和碱基排列情况两方面代表序列特征的氨基酸类别信息,用一个四维向量来表征,用马氏距离法和FISHER判别法对给定序列进行分类。[结果]该模型中,2种分类方法所得的样本回代率均达100%,分类一致率为90%。[结论]该模型算法简单,分类结果精度较高,优于仅基于碱基含量的判别分类模型。 [Objective] The research aimed to construct DNA sequence identification classification model by combining with the biology knowledge and the mathematical method.[Method] According to the polarity nature of side chain radical in the amino acid,the amino acid class information which could represent the sequence characteristic from the base content and the base arrangement was extracted from the different amino acid contents in the different sequences.The four-dimensional vector was used to token,and Mahalanobis distance method,FISHER discriminance were used to classify the given sequence.[Result] In the model,the sample return rates of two kinds of classification methods were both 100%,and the consistent rate of classification was 90%.[Conclusion] In the model,the arithmetic method was simple,and the accuracy of classification result was higher.It was superior to the identification classification model which only based on the base content.
作者 王显金 阳军
机构地区 宁波大红鹰学院
出处 《安徽农业科学》 CAS 北大核心 2011年第23期13955-13957,13976,共4页 Journal of Anhui Agricultural Sciences
基金 宁波大红鹰学院2011年科研课题(CF102601)
关键词 DNA序列 密码子 判别分析 频率 DNA sequence Codon Discriminant analysis Frequency
  • 相关文献

参考文献29

二级参考文献115

共引文献71

同被引文献12

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部