一种基于综合信息的剪接位点识别方法被引量：2

Identification method of splice sites using comprehensive information

导出

摘要为提高剪接位点识别的精度,提出一种基于综合信息的剪接位点识别方法.通过分析供体位点与受体位点的剪接信号、剪接序列、位点附近序列的二级结构,以及剪接因子作用过程等特征,分别为供体位点与受体位点建立信号模型和序列模型;应用Vienna软件中的Mfold包预测每个剪接位点附近序列最稳定的二级结构,将传统的四字符核酸表转化为八字符核酸表,每个序列用八字符进行描述,用结合了结构信息的序列对信号模型和序列模型进行训练学习;最后用训练好的模型进行剪接位点的识别.实验结果证明:该方法对剪接位点的识别取得了很好的效果,其识别精度可达95%以上. To identify splice sites more accurately and efficiently, a method to recognize splice sites based on comprehensive information was proposed. By analyzing the splicing signals, splicing sequences, secondary structures of flank sequence, different splicing factor mechanism of action and other characteristics of donor sites and acceptor sites, donor sites identification signal model, acceptor sites identification signal model, donor sites identification sequence model and acceptor sites identification sequence model were built, respectively. Then the Mfold package in Vienna soft was used to predict the most stable secondary structure of flank sequences. The traditional four-letter alphabet was converted into eight-letter alphabet sequence. The sequence-structure combination strings were used for training signal models and sequence models, and then well trained models were applied to recognize splice sites. Results show that the accuracy of splice site recognition is beyond 95%, suggesting that the method has great potential to achieve a good performance for splice sites identification.

作者王科俊吕俊杰冯伟兴王鑫

机构地区哈尔滨工程大学自动化学院剑桥大学癌症分子研究中心

出处《华中科技大学学报（自然科学版）》 EI CAS CSCD 北大核心 2011年第3期111-114,共4页 Journal of Huazhong University of Science and Technology(Natural Science Edition)

基金国家自然科学基金资助项目(61071174) 国家高技术研究发展计划资助项目(2008AA01Z148) 黑龙江省杰出青年科学基金资助项目(JC200703)

关键词生物信息学剪接位点剪接信号可变剪接二级结构 bioinformatics； splice sites； splice signal； alternative splice； secondary structures；

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1Black D L. Mechanisms of alternative pre-messenger RNA splicing[J]. Annual Reviews of Bioehemistry, 2003, 72(1): 291-336.
2龙伟,周艳红.基于序列特征预测先天性糖基化紊乱疾病基因[J].华中科技大学学报（自然科学版）,2009,37(8):120-124. 被引量：1
3闻芳,卢欣,孙之荣,李衍达.基于支持向量机(SVM)的剪接位点识别[J].生物物理学报,1999,15(4):733-739. 被引量：19
4Wang E T, Sandberg R, Luo S, et al. Alternative isoform regulation in human tissue transeriptomes [J]. Nature, 2008,456(7221): 470-476.
5Pertea M, Lin X Y, Salzberg S L. GeneSplicer: a new computational method for splice site prediction [J]. Nucleic Acids Res, 2001, 29(5): 1185-1190.
6Hiller M, Zhang Z, Backofen R, et al. Pre-mRNA secondary structure and splice site selection [J]. PLOS Genet, 2007, 3(1): 2147-2155.
7Buratti E, Baralle F E. Influence of RNA secondary structure on the pre-mRNA splicing process[J]. Mol Cell Biol, 2004, 24(24): 10505-10514.
8Reese M G, Eeckman F H, Kulp D. Improved splice site detection in genie[J]. Journal of Computational Biology, 1997, 4(3): 311-323.
9Kim E, Goren A, Ast G. Alternative splicing: cur- rent perspectives[J]. Bioessays, 2008, 30(1): 38- 47.
10Brendel V, Kleffe J. Prediction of locally optimal splice sites in plant pre-mRNA with applications to gene identification in Arabidopsis thaliana genomic DNA[J]. Nucleic Acids Res, 1998, 26(20): 4748- 4757.

二级参考文献16

1孙键,徐军,凌伦奖,沈如群,陈润生.用神经网络法预测mRNA的剪接位点[J].生物物理学报,1993,9(1):127-131. 被引量：7
2郑毅,丁达夫.果蝇内含子3＇剪接位点的选择机制[J].生物物理学报,1994,10(3):459-464. 被引量：6
3Freeze H HI Update and perspectives on congenital disorders of glycosylation[J]. Glyeobiology, 2001, 11(3) : 129-143.
4Aebi M, Hennet T. Congenital disorders of glycosylation: genetic model systems lead the way [J]. Trends Cell Biol, 2001, 11(3): 136-141.
5Schachter H. Congenital disorders involving defective N-glycosylation of proteins[J]. Cell Mol Life Sci, 2001, 58(8): 1 085-1 104.
6Perez-Iratxeta C, Bork P, Andrade-Navarro M A. Update of the G2D tool for prioritization of gene candidates to inherited diseases[J]. Nucleic Acids Res,2007, 35(Web Server Issue) : W212-6.
7Sprinzak E, Margalit H. Correlated sequence-signatures as markers of protein-protein interaction[J]. J Mol Biol, 2001, 311(4): 681-692.
8Li Z R, Lin H H, Han L Y, et al. PROFEAT.. a web server for computing strtlctural and physicochemical features of proteins and peptides from amino acid sequence[J]. Nucleic Acids Res, 2006, 34(Web Server Issue) : W32-37.
9Dubchak I, Muchnik I, Holbrook S R, et al. Prediction of protein folding class using global description of amino acid sequence[J]. Proc Natl Acad Sci U S A, 1999, 92(19): 8 700-8 704.
10Dobson P D, Cai Y D, Stapley B J, et al. Prediction of protein function in the absence of significant se- quence similarity [J]. Curr Med Chem, 2004, 11(16): 2 135-2 142.

共引文献18

1刘利,李前忠,樊国梁.低维输入空间的支持向量机识别人类剪接位点[J].生物物理学报,2008,24(1):49-56. 被引量：3
2李贵山.晋城西区潘庄地区地质构造应力分析和富气规律[J].中国科技信息,2005(14):131-131. 被引量：2
3JingZHAO,Yue-MinZHU,Pei-MingSONG,QingFANG,Jian-HuaLUO.Recognition of Gene Acceptor Site Based on Multi-objective Optimization[J].Acta Biochimica et Biophysica Sinica,2005,37(7):435-439.
4Fan Youping,Chen Yunping,Sun Wansheng,Li Yu.Multiclassification algorithm and its realization based on least square support vector machine algorithm[J].Journal of Systems Engineering and Electronics,2005,16(4):901-907.
5宋江宁,李炜疆,须文波.基于两层分类器的半胱氨酸氧化还原状态预测方法[J].计算机与应用化学,2006,23(2):177-182. 被引量：1
6张运陶,丁保淼,黎云祥.RS-GA-KNN算法识别灵长类动物DNA序列剪接位点[J].华中师范大学学报（自然科学版）,2006,40(1):90-94.
7薛依铭,孙应飞.基于结合HM-SVM方法的HMM剪接位点识别研究[J].陕西科技大学学报（自然科学版）,2006,24(5):72-76. 被引量：1
8薛依铭,孙应飞,倪新明.基于HM-SVM的剪接位点识别[J].微计算机信息,2006,22(12S):240-242. 被引量：1
9郭烁,朱义胜.Takagi-Sugeno模型在剪接位点识别中的应用[J].大连海事大学学报,2007,33(4):60-64. 被引量：1
10黄金艳,李通化,陈开.基于知识编码的剪切位点预测[J].同济大学学报（自然科学版）,2007,35(11):1548-1551. 被引量：3

同被引文献1

1王攀,何光源,杨广笑.利用闪烁直观观察生物大分子三维结构片段[J].华中科技大学学报（自然科学版）,2016,44(10):128-132. 被引量：1

引证文献2

1王攀,何光源,杨广笑.利用闪烁直观观察生物大分子三维结构片段[J].华中科技大学学报（自然科学版）,2016,44(10):128-132. 被引量：1
2王攀,何光源,杨广笑.用序列文本动态参照协助生物分子3D结构观察[J].华中科技大学学报（自然科学版）,2017,45(4):128-132.

二级引证文献1

1王攀,何光源,杨广笑.用序列文本动态参照协助生物分子3D结构观察[J].华中科技大学学报（自然科学版）,2017,45(4):128-132.

1杨艳.人工神经网络和支持向量机在剪接位点识别上的应用[J].科技资讯,2007,5(22):215-216. 被引量：1
2孙波,李小霞,李铖果.基于模糊支持向量机的剪接位点识别[J].计算机应用,2011,31(4):1117-1120. 被引量：2
3邹安,彭静.人工神经网络在预测供体位点的运用研究[J].计算机与数字工程,2008,36(3):14-15.
4王丽美,胡竞,彭富强.基于重复-动态博弈模型的可变剪接预测[J].昭通学院学报,2014,36(5):17-22.
5徐国市,鲁发凯,许卓群,余华山,丁文魁.一种面向生物基因组可变剪接问题的网络并行求解方案[J].计算机研究与发展,2007,44(10):1682-1687.
6吕佳,彭勤科.基于快速傅里叶变换的剪接特征提取[J].北京理工大学学报,2014,34(2):207-210. 被引量：3
7吴秋晗.Network Analysis软件在浙江电网的应用[J].浙江电力,2000,19(3):46-48.
8张欣,赵静,王庆康,曹志伟.一种快速可变剪接模式搜索算法的研究[J].高技术通讯,2006,16(10):1051-1055.
9牛北方,郎显宇,陆忠华,迟学斌.mRNA可变剪接问题的并行化研究[J].计算机应用研究,2008,25(3):705-708.
10李绍燕,邓伟.一种基于概率统计特征的剪接位点识别方法[J].计算机工程与应用,2011,47(31):182-184. 被引量：2

华中科技大学学报（自然科学版）

2011年第3期

浏览历史

内容加载中请稍等...

一种基于综合信息的剪接位点识别方法被引量：2

参考文献10

二级参考文献16

共引文献18

同被引文献1

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

一种基于综合信息的剪接位点识别方法 被引量：2

参考文献10

二级参考文献16

共引文献18

同被引文献1

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

一种基于综合信息的剪接位点识别方法被引量：2