摘要
MicroRNA(miRNA)是一类在生物体内发挥重要调控作用的非编码小RNA,对miRNA的预测有助于研究和理解其生物学功能。已经提出的基于成对约束的降维算法(local semi-supervised linear discriminant analysis,LSLDA)在对miRNA降维的同时,也能保持数据的局部结构信息和判别能力,可有效改进miRNA的预测性能。因此,在LSLDA算法基础上,提出了一种新的集成LSLDA算法(ensemble of local semi-supervised linear discriminant analysis,En-LSLDA)。该算法对不同约束个数下的分类结果进行集成,以集成结果作为最后的分类结果,以此进一步改进miRNA的预测性能。miRNA数据集上的实验结果表明,En-LSLDA算法是有效可行的。同时,UCI数据集上的实验结果也验证了新提出的集成方法同样适用于其他数据集。
MicroRNAs (miRNAs) are a class of endogenous RNAs, which play an important role in gene regulation. The prediction of miRNAs will help study and understand their biological functions. The proposed local semi-supervised linear discriminant analysis dimensionality reduction (LSLDA) algorithm, which obtains a low-dimensional space as well as preserves local structure information and discrimination of data, can improve the prediction of miRNAs. Based on the LSLDA algorithm, this paper proposes an ensemble local semi-supervised lin- ear discriminant analysis (En-LSLDA) algorithm by integrating classified results of different numbers of constraints as final results to further improve the prediction of miRNA effectively. Experimental results on miRNA data sets show the effectiveness of the En-LSLDA algorithm. Meanwhile, experimental results on UCI data sets validate the newly proposed algorithm can be applied to other data sets.
出处
《计算机科学与探索》
CSCD
2011年第10期921-931,共11页
Journal of Frontiers of Computer Science and Technology
基金
国家自然科学基金No.60873176
江苏省自然科学基金No.BK2008430~~
关键词
MIRNA
成对约束
降维
集成
预测
miRNA
pairwise constraints
dimensionality reduction
ensemble
prediction