摘要
人类剪接位点的识别是当前研究的一个重要课题.根据人类剪接位点附近区域的保守性,以位置关联权重矩阵及DNA结构信息作为特征输入参数,应用支持向量机(SVM)对人类基因组中的供体端和受体端剪接位点做了预测.对于供体端,5-fold交叉检验总体预测精度为92.55%,3-way data split检验总体预测精度为92.25%;受体端5-fold交叉检验总体预测精度为90.70%,3-way data split检验总体预测精度为89.87%.
The human splice site recognition is an important problem.The DNA geometric descriptor and position-correlation weight matrix(PCWM)are introduced to describe the conservative segments around spice sites.And the support vector machine(SVM)models combined with the PCWM scoring function and DNA structural features are developed and used to predict the donor and acceptor spice sites of human genome.For five-fold cross-validation,the total prediction accuracies are 92.55% and 90.70% for donors and acceptors respectively.For 3-way data split,the total accuracies are 92.25% and 89.87% for donors and acceptors,respectively.
出处
《内蒙古大学学报(自然科学版)》
CAS
CSCD
北大核心
2010年第4期390-397,共8页
Journal of Inner Mongolia University:Natural Science Edition
基金
内蒙古优秀学科带头人计划资助项目(No.20060702)
关键词
剪接位点
位置关联权重矩阵
DNA结构信息
splice site position-correlation weight matrix(PCWM) DNA structural parameters