期刊文献+

基于邻接区域交叠概率的特征选择方法 被引量:8

Feature Selection Method Based on Overlapped Probability of Intersection Area
下载PDF
导出
摘要 针对传统特征选择判据计算量大、需要先验知识以及应用效果不佳的缺点,根据分类错误通常发生在类别之间的邻接区域(贝叶斯决策分界面将穿过该邻接区域)的特点,提出基于邻接区域交叠概率的特征选择判据。该判据通过计算案例样本点落在类别邻接区域中的概率来选择特征,具有从样本中能直接计算并且选择出多个特征组合等优点。通过对标准机器学习数据集WINE的实际应用表明,该判据选择出的特征组合的聚类效果明显好于类内类间判据选择出的特征组合。对轴承故障数据进行特征选择时,该判据能提供多种多个特征组合供选择,其选择的垂直和水平振动特征组合符合工程应用的实际需要,远好于类内类间判据选择的特征组合。 Aiming at the shortcomings of large amount of calculation, needing prior knowledge and poor application effect of traditional feature selection criterion, and according to the trait of classification error usually occuring in intersection area between categories(Bayesian decision-making interface will pass through the intersection area), a feature selection criterion based on the overlapped probability of intersection area is put forward. The criterion selects features by calculating the probability of sample point falling into the category intersection area, and the advantages of it are calculating directly from the samples and choosing a number of features, etc. The practical application of standard machine learning data sets WINE shows that the clustering effect of feature combination selected by the criterion is better than within-category and between-category criterion. When selecting the beating failure data, the criterion can provide several feature combination, and the selected vertical and horizontal vibration feature combinations meet the actual needs of engineering application, which is better than the feature combination selected by within-category and between-category criterion.
出处 《机械工程学报》 EI CAS CSCD 北大核心 2009年第2期114-118,共5页 Journal of Mechanical Engineering
基金 国家自然科学基金(50335030) 国家高科技研究发展计划(863计划 2007AA04Z432) 苏州市工业科技攻关(SG0729)资助项目
关键词 特征选择 类别可分性 贝叶期错误概率 Feature selection Category separability Bayes error probability
  • 相关文献

参考文献4

  • 1JOHN G H, KOHAVI R, PFLEGER K. Irrelevanl features and the subset selection problem[C]//Machine Learning: Proceedings of the Eleventh International Conference, 1994: 121-129.
  • 2史东锋,屈梁生.遗传算法在故障特征选择中的应用研究[J].振动.测试与诊断,2000,20(3):171-176. 被引量:31
  • 3BLAKE C L, MERZ C J. UCI repository of machine learning databases[EB/OL]. [2007-05-17]. http: //www. ics.uci, edu/-mleam/MLRepository.html.
  • 4NOWICKI R, SLOWINSKI R, STEFANOWSKI J. Evaluation of vibroacoustic diagnostic symptoms by means of the rough sets theory[J]. Knowledge Engineering: Elsevier Computers in Industry, 1992, 20: 141-152.

二级参考文献1

  • 1孟建.回转机械故障诊断特征提取的若干前沿技术研究:博士论文[M].西安:西安交通大学,1996..

共引文献30

同被引文献84

引证文献8

二级引证文献39

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部