MiRscreen:一种基于遗传算法和支持向量机的microRNA前体识别方法

MiRscreen:a prediction model for microRNA precursors using genetic algorithm and support vector machines

导出

摘要目的:构建具有高敏感性和高特异性的microRNA前体(pre-miRNA)识别模型。方法:根据300例经实验验证的人pre-miRNA和300例从3′UTR折成茎环结构的片段中随机选取的阴性样本,基于支持向量机方法构建了区分pre-miRNA和pseudo pre-miRNA的分类器MiRscreen。为提高分类器的性能,我们采用遗传算法搜索影响分类器性能的2个重要参数C和γ。结果与结论:该分类器对训练集的敏感性为99.33%,特异性为100%,对剩余的91例人pre-miRNA和91例3′UTR中的pseudo pre-miRNA敏感性和特异性分别达到91.21%(83/91)和93.41%(85/91)。在除人以外的其他20种动物和病毒的1353例pre-miRNA中,MiRscreen正确判断出其中的1192例,敏感性达到88.10%,其中马雷克病病毒、猕猴淋巴隐病毒、EB病毒、猿猴病毒40、非洲爪蟾、狗、绵羊和猕猴共计8个物种的敏感性达到100%;在随机抽取的100条RefSeq基因折叠形成的556例pseudo pre-miRNA和随机抽取的797例人19号染色体折叠形成的pseudo pre-miRNA(共计1353例混合阴性样本)中,MiRscreen的特异性达到85.14%(1152/1353)。与其他6种同类方法相比,MiRscreen在敏感性和特异性方面均具有较好的性能,分类精度最高,达到86.62%,比其他方法高6%以上;MiRscreen的AUC值达到0.938,也明显高于其他方法。 Objective： To construct a prediction model for microRNA precursors （pre-miRNAs） with high sensitivity and high specificity. Methods： A prediction model, MiRscreen, for microRNA precursors using genetic algorithm and support vector machines was introduced. The training dataset contained 300 human experimentally validated pre-miRNAs as positive samples and 300 pseudo pre-miRNAs as negative samples. The negative samples were randomly selected from 3＇ UTR stem-loops. To improve the performance of the classifier, genetic algorithm was employed to search for C and γ, which were two important parameters for SVM classifiers. Results and conclusion： The sensitivity and specificity for the training dataset were 99.33% and 100% , respectively. For the remaining 91 human pre-miRNAs and 91 pseudo pre-miRNAs from 3′UTR, the sensitivity and specificity were 91.21% （ 83/91 ） and 93.41% （ 85/91 ） , respectively. The overall sensitivity of MiRscreen for 1 353 experimentally validated animal and virus（ excluding human）pre-miRNAs was 88.10% （1 192/1 353 ） ,and the sensitivity for eight species was 100% , including Marek＇s disease virus, rhesus lymphoeryptovirus, Epstein-Barr virus, simian virus 40, Xenopus laevis, Canis familiaris, Ovis aries and Macaca mulatta. The overall specificity for the 556 pseudo pre-miRNAs from 100 randomly selected RefSeq genes and 797 pseudo pre-rniRNAs randomly selected from human chromosome 19 was 85.14% （ 1 152/1 353 ）. Compared with the other six miRNA classification methods proposed previously, MiRscreen is remarkable in both sensitivity and specificity on the independent test dataset. The accuracy of MiRscreen is 86.62% , which is 6% higher than that of the other methods. The AUC of MiRscreen is 0.938, alsogreater than the AUC of each of the other six methods. Therefore, the presented model MiRScreen can facilitate experimented identification of premiRNAs.

作者侯妍妍李华应晓敏李伍举

机构地区军事医学科学院基础医学研究所计算生物学中心

出处《军事医学科学院院刊》 CSCD 北大核心 2008年第3期287-292,共6页 Bulletin of the Academy of Military Medical Sciences

基金国家自然科学基金资助项目(30500105 30470411)

关键词微RNAS 识别遗传算法支持向量机 microRNAs classification genetic algorithm support vector machines

分类号 Q75 [生物学—分子生物学]

引文网络
相关文献

参考文献45

1Cai X, Hagedorn CH, Cullen BR. Human microRNAs are processed from capped, polyadenylated transcripts that can also function asmRNAs[J]. RNA, 2004, 10(12): 1957-1966.
2Lee Y, Kim M, Han J, et al. MicroRNA genes are transcribed by RNA polymerase Ⅱ[J]. EMBO J, 2004, 23 (20) : 4051 - 4060.
3Borchert GM, Lanier W, Davidson BE. RNA polymerase 2 transcribes human microRNAs [ J ]. Nat Struct Mol Biol, 2006, 13 (12): 1097-1101.
4Lee Y,Ahn C,Han J,et al. The nuclear RNase Ⅲ Drosha initiates microRNA processing[ JJ. Nature,2003,425(6956) : 415 -419.
5Zeng Y, Yi R, Cullen BR. Recognition and cleavage of primary microRNA precursors by the nuclear processing enzyme Drosha [J]. EMBO J, 2005, 24(1): 138-148.
6Yi R, Qin Y, Macara IG, et al. Exportin-5 mediates the nuclear export of pre-microRNAs and short hairpin RNAs [ J ]. Genes Dev, 2003, 17(24) : 3011 -3016.
7Bohnsack MT,Czaplinski K, Gorlich D. Exportin 5 is a RanGTP- dependent dsRNA-binding protein that mediates nuclear export of pre-miRNAs[ J]. RNA,2004,10(2) :185 - 191.
8Ketting RF, Fischer SE, Bernstein E, et al. Dicer functions in RNA interference and in synthesis of small RNA involved in developmental timing in C. elegans [ J ]. Genes Dev, 2001, 15 (20) : 2654 -2659.
9Jiang F, Ye X, Liu X, et al. Dicer-1 and R3D1-L catalyze microRNA maturation in Drosophila [ J ]. Genes Dev, 2005, 19 (14) :1674 - 1679.
10Lee YS, Nakahara K, Pham JW, et al. Distinct roles for Drosophila Dicer-1 and Dicer-2 in the siRNA/miRNA silencing pathways[J]. Cell, 2004, 117(1) : 69 -81.

二级参考文献4

1YangLI WeiLI You-XinJIN.Computational Identification of Novel Family Members of MicroRNA Genes in Arabidopsis thaliana and Oryza sativa[J].Acta Biochimica et Biophysica Sinica,2005,37(2):75-87. 被引量：24
2BaoHongZHANG,XiaoPingPAN,QingLianWANG,GeorgeECOBB,ToddA.ANDERSON.Identification and characterization of new plant microRNAs using EST analysis[J].Cell Research,2005,15(5):336-360. 被引量：74
3李培旺,卢向阳,李昌珠,方俊,田云.植物microRNAs研究进展[J].遗传,2007,29(3):283-288. 被引量：7
4盛熙晖,杜立新.MicroRNA及其在人和动物上的研究进展[J].遗传,2007,29(6):651-658. 被引量：20

共引文献12

1王立贵,赵雅琳,李伍举.细菌sRNA基因及其靶标预测研究进展[J].微生物学报,2009,49(1):1-5. 被引量：8
2张玉滨,赵洁苑,龚云路,王翼飞.MiRfilter:一个预测病毒microRNA的计算工具[J].上海大学学报（自然科学版）,2010,16(1):75-80.
3马闯,胡星驰,童潘,黄慧艳.黑腹果蝇miRNA前体的预测[J].中南林业科技大学学报,2010,30(6):127-131.
4赵洁苑,龚云路,王翼飞.基于MiRfilter系统的毛果杨miRNA预测[J].上海大学学报（自然科学版）,2010,16(4):397-403.
5应晓敏,朱娟娟,王小磊,赵东升,付汉江,郑晓飞,李伍举.人源microRNA前体的全基因组预测[J].中国科学：生命科学,2011,41(10):958-964.
6熊大鹏,刘蓉杰,胡凯,谭艳平,张园,唐广宇,曾寅.基于具有多茎环特征GA-SVM的人类Pre-miRNA的预测研究[J].中国生物化学与分子生物学报,2011,27(12):1174-1178.
7王常武,刘兵强,王宝文,刘文远.流形排序算法预测microRNA[J].计算机应用研究,2012,29(3):819-822. 被引量：1
8万琳霞,丁建栋,关佶红.计算方法预测microRNA研究进展[J].计算机应用与软件,2012,29(5):159-162. 被引量：1
9周向红,易乐飞,王萍.向日葵保守性microRNA的预测与分析[J].作物杂志,2012(6):38-41.
10崔健,张晓庆,杨芳芳,杨进.miRNA靶位点多态性与大肠癌的研究进展[J].中国细胞生物学学报,2013,35(7):1058-1062.

1李华.微RNAs的生物学特性及相关疾病概述[J].生物学教学,2016,41(1):6-9. 被引量：1
2黄远帅,戴勇（综述）,尹一兵（审校）.微小RNA的研究进展[J].国际检验医学杂志,2007,28(3):238-240.
3王庆敏,万辉,时粉周,沈俊,刘秋红.miRNA分子在近日节律调节中的作用[J].第二军医大学学报,2011,32(10):1137-1139. 被引量：1
4杨嵘,龚晨光,张长青,王进.人鼠间的MicroRNA前体预测[J].金陵科技学院学报,2008,24(3):106-108.
5马宁,李福源,董娜珍,周凌云,高旭.环形RNAs:microRNAs的新靶标[J].生物化学与生物物理进展,2013,40(8):728-730.
6姜毅,罗野婯,刘沅,郑新娟,孙远东,袁志栋.短尾负鼠microRNA前体的计算鉴定与分析[J].生命科学研究,2013,17(4):283-291.
7张家恺,王妍,于晓妉,蔡永萍.基因表达调控与肿瘤转移[J].军事医学,2011,35(6):473-476. 被引量：2
8石磊（综述）,尤永平（审校）,傅震（审校）.微RNA与神经胶质瘤的关系[J].国际肿瘤学杂志,2007,34(6):430-432.
9杨良怀,吕丕明,陈立军,邓明华.k-gram方法识别microRNA前体[J].生物化学与生物物理进展,2007,34(2):154-161. 被引量：4
10魏艳,李玲,于向民,徐敬国,王斌,钱冬萌.黄芩素对HCMV感染人神经干细胞的作用及机制[J].青岛大学医学院学报,2016,52(2):160-163.

军事医学科学院院刊

2008年第3期

浏览历史

内容加载中请稍等...

MiRscreen:一种基于遗传算法和支持向量机的microRNA前体识别方法

参考文献45

二级参考文献4

共引文献12

相关作者

相关机构

相关主题

浏览历史