期刊文献+

高维特征选择方法在近红外光谱分类中的应用 被引量:17

High dimensional feature selection in near infrared spectroscopy classification
下载PDF
导出
摘要 针对卷烟近红外光谱高噪和高冗余特点,提出了一种基于随机森林(RF)和主成分分析(PCA)的特征优选方法 RF-PCA,建立了5种不同质量级别卷烟的分类模型,并和其他方法进行了比较。该方法能够有效地对高维数据样本进行分类,用于甄别卷烟品质真伪。特征选择可以过滤与分类不相关的特征,而通过PCA方法可以消除冗余特征的不良影响,并可进一步降低特征维数。实验表明:RF-PCA方法能有效地剔除近红外光谱数据中的噪声特征和冗余特征,提高了分类效率。 With regard to the large number of irrelevant and redundant features exist in the near infrared spectra, a novel feature selection method based on random forest and principal component analysis (RF- PCA) was proposed in this paper. By using the RF-PCA, a classification model of cigarettes qualitative evaluation was developed and also compared with other methods. The result shows that RF-PCA effectively classifies the samples of high dimensional data and can be used to evaluate quality and authenticity of the cigarettes. RF feature selection removes irrelevant features of the classification, while PCA further eliminates the influence of redundant features and also reduces the feature dimensionalities. The experiments show that RF-PCA effectively removes noise and redundant features in the NIR spectra and the classification accuracy is improved as well.
出处 《红外与激光工程》 EI CSCD 北大核心 2013年第5期1355-1359,共5页 Infrared and Laser Engineering
基金 科技部创新基金(06C26213710334)
关键词 近红外光谱 特征选择 随机森林 主成分分析 卷烟 NIR spectra feature selection RF PCA cigarettes
  • 相关文献

参考文献12

  • 1刘旭,陈华才,刘太昂,李银玲,陆治荣,陆文聪.PCA-SVR联用算法在近红外光谱分析烟草成分中的应用[J].光谱学与光谱分析,2007,27(12):2460-2463. 被引量:34
  • 2Hana M, Mcclure W F, Whitaker T B. Applying artificial neural networks II. Using near infrared data to classify tobacco types and identify native grown tobacco [J]. Journal of Near Infrared Spectroscopy, 1997, 5: 19-25.
  • 3唐雪梅,张薇,李慧.卷烟真伪鉴别的近红外定性分析方法[J].烟草科技,2008,41(11):5-8. 被引量:23
  • 4Bylesjo M, Rantalainen M, Nicholson J K, et al. K-OPLS package: Kernel-based orthogonal projections to latent structures for prediction and interpretation in feature space [J]. BMC Bioinformaties, 2008, 9(1): 106-112.
  • 5Boaz Nadler, Coifman Ronald R. The prediction error inCLS and PLS: the importance of feature selection prior to multivariate calibration [J]. Journal of Chemometrics, 2005, 19(2): 107-118.
  • 6Leo Breiman. Random forests [J]. Machine Learning, 2001, 45(1): 5-32.
  • 7Statnikov A, Wang L, Aliferis C F. A comprehensive comparison of random forests and support vector machines for microarray based cancer classification [J]. BMC Bioinformatics, 2008, 9: 319-323.
  • 8Menze B H, Petrich W, Hamprecht F A. Multivariate feature selection and hierarchical classification for infrared spectroscopy: serum-based detection of bovine spongiform encephalopathy [J]. Analytical and Bioanaytical Chemistry, 2007, 387(5): 1801-1807.
  • 9Efron B, Tibshirani R J. Bootstrap measures for standard errors, confidence interval and other measures of statistical accuracy[J]. Statistical Science, 1986, 1(1): 54-74.
  • 10Menze B H, Kelm B M, Masuch R, et al. A comparison of random forest ant its Gini importance with standard chemometric methods for the feature selection and classification of spectral data[J]. BMC Bioinformatics, 2009, 10:1-16.

二级参考文献19

  • 1陈鹰,丁映,乐俊明.烟草中三种主要成分的近红外光谱分析与化学分析方法比较[J].贵州农业科学,2004,32(5):72-72. 被引量:5
  • 2张录达,苏时光,王来生,李军会,杨丽明.支持向量机(SVM)在傅里叶变换近红外光谱分析中的应用研究[J].光谱学与光谱分析,2005,25(1):33-35. 被引量:47
  • 3乐俊明,陈鹰,丁映.近红外光谱分析法测定烟草化学成分[J].贵州农业科学,2005,33(3):62-63. 被引量:38
  • 4国家烟草专卖局.卷烟产品鉴别检验规程[G].2006.
  • 5赵龙莲 闵顺耕 严衍禄 等.傅里叶变换的红外光谱法测定烟草中九种品质参数[J].光谱学与光谱分析,1998,13(4):89-89.
  • 6严衍录,赵龙莲,杨曙明,等.近红外光谱分析基础与应用[M].北京:中国轻工业出版社,2005.
  • 7Fawky Abdallah,Translated by MIU Ming-ming(缪明明,译).Cigarette Product Development(卷烟产品开发).Kunming: Yunnan Science and Technology Press(昆明:云南科技出版社),2004..
  • 8LU Wan-zhen, YUAN Hong-fu, XU Guang-tong(陆婉珍,袁洪福,徐广通).The Modern Analysis Technique for Near Infrared Spectroscopy Technology(现代近红外光谱分析技术).Bejing: China Petrochemical Press(北京:中国石化出版社),2000.
  • 9Sielsler H W, Ozaki Y, Kawata S, et al. Near-Infrared Spectroscopy: Principles, Instruments, Applications. West Sussex: NIR Publications, 2002.
  • 10Blanco M, Coello J, Iturriaga H, et al. Anal. Chim. Acta, 1999, 384(2): 207.

共引文献51

同被引文献168

引证文献17

二级引证文献73

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部