摘要
真核生物翻译起始位点(TIS,translation initiation site)的正确预测对于基因的正确注释有着重大的意义.在真核生物中,翻译并不都是起始于第一个AUG密码子,还取决于AUG前后序列的信息.结合位置权重矩阵(PWM,position weight matrix)和开放阅读框架(ORF,open reading frame)的长度分布特征建立了简单的方法识别翻译起始位点,此方法能很好地区分上游AUG和TIS.对于脊椎动物以及人类的mRNA序列,运用核糖体扫描模型预测其翻译起始位点得到了很好的预测率.
The correct identification of the Translation Initiation Sites (TIS) in eukaryotes is an important issue for genome annotation. Translation in vertebrates does not always start at the first AUG in an mRNA, implying that context information also plays a role. Based on the position weight matrix(PWM) and length distribution of open reading frame (ORF),a simple method for predicting translation initiation sites is presented. It can identify TIS from upstream AUGs easily. By using of ribosome scanning model,high accuracy in vertebrates and human mRNA sequences is obtained.
出处
《内蒙古大学学报(自然科学版)》
CAS
CSCD
北大核心
2007年第2期173-180,共8页
Journal of Inner Mongolia University:Natural Science Edition
基金
国家自然科学基金资助项目(30560039)
关键词
翻译起始位点
核糖体扫描模型
位置权重矩阵
开放阅读框架
translation initiation site
ribosome scanning model
position weight matrix
open reading frame