期刊文献+

基因剪切位点的统计分析研究

Research on Statistical Analysis of Gene Splicing Sites
下载PDF
导出
摘要 真核生物的基因由若干外显子和内含子交替组成,外显子序列在转录后保留,而内含子序列转录过程中被剪切掉。大量分子生物学实验验证基因的剪切位点遵从GT-AG规则,然而只有很少的含GT或AG序列是真剪切位点,目前预测的准确程度仍有待提高。本研究下载了HS3D剪切位点训练数据集,对启动子剪切位点附近的序列进行了统计分析研究。当真、假序列长度在剪切位点左旁和右旁均超出各七个位点时,序列呈现很高的特异性,可以使用这些特异性序列作为特征进行训练,从而准确地识别真假剪切位点。 The genes of eukaryotes are composed of several exons and introns. After transcript process, sequences of exons are retained, while sequences of introns are cleaved off. A large number of experiments of molecular biology validate that the splicing sites between exon and intron follow the rule of GT-AG, only a few GT or AG sequences are true splicing sites, and the accuracy of the prediction still needs to be improved. In this study, the training dataset of splicing site of HS3D was downloaded, and a statistical analysis of the sequence near the splicing site of the promoter was carried out. The sequence showed high specificity when the true and false sequence lengths of the left splicing site side and right splicing site side were both more than seven, which was helpful to train the sequences characters so as to accurately identify the true and false splicing sites.
出处 《计算生物学》 2016年第3期41-49,共9页 Hans Journal of Computational Biology
基金 陕西省科技厅社会发展科技攻关项目基金(2016SF-343)资助。
  • 相关文献

参考文献1

二级参考文献6

  • 1王化军,生物物理学报,1989年,5卷,422页
  • 2Qian N,J Mol Biol,1988年,202卷,865页
  • 3孙键,高技术通讯,1991年,1卷,8期,1页
  • 4王化军,生物物理学报,1991年,7卷,157页
  • 5陈润生,生物物理学报,1990年,6卷,267页
  • 6孙键,凌伦奖,陈润生.用神经网络法预测同源蛋白质的三级结构[J].高技术通讯,1991,1(4):1-4. 被引量:1

共引文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部