期刊文献+

化学计量学方法用于蛋白质组学质谱数据的特征筛选 被引量:1

Feature selection from proteomic mass spectrometric data using chemometric methods
原文传递
导出
摘要 提出了一种基于偏最小二乘判别分析和F-score的特征筛选方法,并将其用于蛋白质组学质谱数据分析。方法主要包含3个步骤:(1)用LIMPIC算法对原始数据进行预处理;(2)计算每个变量的F-score值并将所有变量按F-score值降底的顺序排列;(3)采用偏最小二乘判别分析交互检验按前向选择法选择最佳变量子集。用本方法对一组卵巢癌数据进行分析,最终从原始的15154个质荷比变量中选择了12个特征变量作为潜在生物标记物,它们在训练集上交叉检验的特异性和灵敏度分别为98.36%和98.15%,在独立测试集上的特异性和灵敏度分别为96.67%和100%。用筛选出的变量作PCA所得的结果显示这些变量能够较好地将样本分类,说明能够反映出样本的类别信息。所提出的方法可用于蛋白质组学质谱数据的特征筛选及样本分类。 A feature selection method based on F-score and partial least square discriminant analysis (PLS-DA) was proposed and used for proteomic mass spectrometric (MS) data analysis and potential biomarker discovery. The method mainly includes 3 steps : ( 1 ) spectra preprocessing with LIMPIC algorithm; (2) calculating the F- score values for each variable and sorting them according to their F-score values in descending order; and (3) determination of the optimum feature set with PLS-DA cross validation in a forward stepwise selection manner. An ovarian cancer dataset was analyzed with the proposed method. As results, 12 m/z locations were selected as potential biomarkers. The features could distinguish the disease samples from healthy controls on the independent test sets with sensitivity of 100% and specificity of 96. 67%. The results show that the method proposed in this study is available for classification feature selection from proteomic MS data.
出处 《分析试验室》 CAS CSCD 北大核心 2012年第10期106-109,共4页 Chinese Journal of Analysis Laboratory
关键词 蛋白质组学 质谱 特征选择 F-score 偏最小二乘判别分析 Proteomic Mass spectra Feature selection F-score Partial least square-discriminant analysis
  • 相关文献

参考文献14

  • 1Abeel T, Helleputte T, Van de Peer Y, et al. Bioinformatics, 2010, 26(3) : 392.
  • 2Saeys Y, Inza I, Larra aga P. Bioinformatics, 2007, 23(19) : 2507.
  • 3Roy P,Truntzer C, Maucort-Boulch D, et al. Briefings in bioinformatics, 2010 : doi: 10. 1093/bib/bbqlO19.
  • 4Liu H, Li J, Wong L. Genome Informaticsa Series, 2002, 13(1) : 51.
  • 5谢娟英,王春霞,蒋帅,张琰.基于改进的F-score与支持向量机的特征选择方法[J].计算机应用,2010,30(4):993-996. 被引量:29
  • 6Heinzmann S S, Brown I J, Chan Q, et al. Am J Clin Nutrition, 2010, 92(2):436.
  • 7Legido-Quigley C, Stella C, Perez-Jimenez F, et al. Biomed Chromatogr, 2010, 24(7): 737.
  • 8Zeng M, Liang Y, Li H, et al. Anal Methods, 2011, 3(2) : 438.
  • 9Yuan D, Liang Y, Yi L, et al. Chemom Intell Lab Syst, 2008, 93 ( 1 ) : 70.
  • 10Mantini D, Petrucci F, Pieragostino D, et al. BMC bioinformatics, 2007, 8(I): 101.

二级参考文献11

  • 1LEE M C.Using support vector machine with a hybrid feature selection method to the stock trend prediction[J].Expert Systems with Applications,2009,36(8):10896-10904.
  • 2MALDONADO S,WEBER R.A wrapper method for feature selection using support machines[J].Information Sciences,2009,179(13):2208-2217.
  • 3LIU Y,ZHENG Y F.FS_SFS:A novel feature selection method for support vector machines[J].Pattern Recognition,2006,39 (7):1333-1345.
  • 4HUA J P,TEMBE W D,DOUGHERTY E R.Performance of feature-selection methods in the classification of high-dimensian data[J].Pattern Recognition,2009,42(3):409-424.
  • 5GUNAL S,GEREK O N,ECE D G,et al.The search for optimal feature set in power quality event classification[J].Expert Systems with Applications,2009,36(7):10266-10273.
  • 6WIDODO A,YANG B S.Application of nonlinear feature extraction and support vector machines for fault diagnosis of induction motors[J].Expert Systems with Applications,2007,33(1):241-250.
  • 7GUYON I,ELISSEEFF A.An introduction to variable and feature selection[J].Machine Learning Research,2003,3:1157-1182.
  • 8TALAVERA L.An evaluation of filter and wrapper methods for feature selection in categorical clustering[C]// Proceedings of 6th International Symposium on Intelligent Data Analysis.Madrid:Springer,2005:440-451.
  • 9CHEN Y W,LIN C J Combining SVMs with various feature selection strategies[EB/OL].[2009-08-10].http://www.csie.ntu.edu.tw/-cjlin/papere/features.pdf.
  • 10VAPNIK V N.The nature of statistical learning theory[M].New York:Springer,1995.

共引文献28

同被引文献15

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部