MicroRNA前体的特征选择方法

Feature Selection Method for Pre-miRNAs

下载PDF

导出

摘要 microRNA(miRNA)是一类长度约为21nt的非编码RNA,具有重要的调控功能。miRNA前体包含一级序列特征和二级结构特征,其中含有冗余和无用的特征,这些特征无益于前体分类模型的分类准确度。因此需要去除冗余特征,进而降低特征维数并提高分类性能。针对miRNA的前体序列数据,已有特征选取方法,仅考虑了特征之间的区分距离。全面考虑了每个特征属性对分类的增益和特征间冗余性,选取的特征有助于建立高效的分类模型。实验结果表明,选取的特征子集有效地提高了miRNA前体分类器的预测性能,取得了更好的分类结果。 MicroRNAs（miRNAs） are a set of short（about 21nt） non-coding RNAs that have important regulation function.Pre-miRNAs have a lot of features based on primary sequences and secondary structure,some of which are redundant and useless for classification of pre-miRNAs.Therefore,the redundant features should be eliminated to decrease the feature dimension and improve the classification accuracy. In terms of pre-miRNAs,almost all the previous methods only consider the distance between two features.This paper considers information gain and feature redundancy.The selected features are useful for constructing efficient classification model.The experimental result indicates the selected feature subset could improve the prediction performance of pre-miRNA classification model and achieve better classification result.

作者玄萍郭茂祖吴玲王姗姗张兆功李媛

机构地区哈尔滨工业大学计算机科学与技术学院黑龙江大学计算机科学技术学院

出处《智能计算机与应用》 2012年第6期1-3,10,共4页 Intelligent Computer and Applications

基金国家自然科学基金(60932008 61172098 61271346) 高等学校博士学科点专项科研基金(20112302110040) 中央高校基本科研业务费专项资金(HIT.ICRST.2010 022) 黑龙江省自然科学基金项目(F201119) 黑龙江省教育厅科学技术研究项目(12521392 12511401) 哈尔滨市青年科技创新人才项目(2012RFQXS094) 黑龙江大学青年科学基金项目(QL201029)

关键词 miRNA前体特征选择信息增益特征冗余性 Pre-miRNA Feature Selection Information Gain Feature Redundancy

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献14

1BABTEL D P. MicroRNAs:genomics,biogenesis,mechanism,and function[J].Cell,2004,(02):281-297.
2BUSHATI N,COHEN S M. MicroRNA functions[J].Annual Review of Cell and Developmental Biology,2007.175-205.
3PFEFFER S,ZAVOLAN M,GRASSER F A. Identification of virus-encodod microRNAs[J].Science,2004,(5671):734-736.
4KLS N,MISHRA S K. De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures[J].Bioinformatics,2007,(11):1321-1330.
5BATUWTTA R,PALADE V. MicroProd:effective classification of pre-miBNAs for human miRNA gene prediction[J].Bioinformatics,2009,(08):989-995.
6XUE C,LI F,HE T. Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine[J].BMC Bioinformatics,2005.310.
7XUAN P,GUO M,LIU X. PlantMiRNAPred:efficient classification of real and pseudo plant pre-miRNAs[J].Bioinformatics,2011,(10):1368-1376.
8NAM J,SHIN K,HAN J. Human microRNA prediction through a probabilistic co-learning model of sequence and structure[J].Nucleic Acids Research,2005,(11):3570-3581.
9YOUSEF M,NEBOZHYN M,SHATKAY H. Combining multi-Species genomic data for microRNA identification using a nalve bayes classifier machine learning for identification of microrna genes[J].Bioinformatics,2006,(11):1325-1334.
10JIANG P,WU H,WANG W. MiPred:classification of real and pseudo microRNA precursors using random forest prediction model with combined features[J].Nucleic Acids Research,2007.W339-W344.

1刘嘉,李俊,苗涛.粒子群算法实现的防爆正压柜控制器设计[J].自动化仪表,2013,34(12):41-43. 被引量：2
2刘向宇.EPSON(爱普生)LQ-2520型彩色打印机的结构原理与解析(下)[J].家电检修技术（资料版）,2008(6):21-22.
3dream.鼠标变身音量控制器[J].电脑迷,2008,0(1):44-44.
4看完星星看月亮.挖掘鼠标的潜在价值驱动软件的应用解析[J].现代计算机（中旬刊）,2010(8):126-128.
5胡小燕,陈庆锋.miRNAs基因的结构保守模式挖掘[J].平顶山学院学报,2012,27(2):53-58.
6王金波,刘斌,郝亮,任珂珂.图像处理及镜头自动调控电路设计[J].激光与红外,2013,43(1):58-61.
7杜江峰,武永军,张银霞,吴磊,陶士珩.信息熵方法大批量分析成熟体及前体miRNA序列的重要位点[J].中国科学（C辑）,2009,39(4):420-428.
8史巧硕,马岱,米少华.基于蚁群和支持向量机的microRNA预测方法[J].河北工业大学学报,2012,41(1):5-8. 被引量：1
9赵美琳.自动控制原理与人体生理功能的自动调控[J].阜阳师范学院学报（自然科学版）,2005,22(2):31-33.
10邹权,郭茂祖,刘扬,王峻.类别不平衡的分类方法及在生物信息学中的应用[J].计算机研究与发展,2010,47(8):1407-1414. 被引量：26

智能计算机与应用

2012年第6期

浏览历史

内容加载中请稍等...

MicroRNA前体的特征选择方法

参考文献14

相关作者

相关机构

相关主题

浏览历史