期刊文献+

基于filter+wrapper模式的特征选择算法 被引量:20

Feature selection algorithm based on filter + wrapper pattern
下载PDF
导出
摘要 特征选择是数据挖掘、机器学习和模式识别中始终面临的一个重要问题。针对类和特征分布不均时,传统信息增益在特征选择中存在的选择偏好问题,提出了一种基于信息增益率与随机森林的特征选择算法。该算法结合filter和wrapper模式的优点,首先从信息相关性和分类能力两个方面对特征进行综合度量,然后采用序列前向选择(sequential forward selection,SFS)策略对特征进行选择,并以分类精度作为评价指标对特征子集进行度量,从而获取最优特征子集。实验结果表明,该算法不仅能够达到特征空间降维的效果,而且能够有效提高分类算法的分类性能和查全率。 Feature selection is one of the most important issues in data mining,machine learning and pattern recognition.Aiming at the problem of preference of traditional information gain algorithm in feature selection when the class and feature are unevenly distributed,this paper proposed a new feature selection algorithm based on information gain ratio and random forest.The proposed algorithm combined with the advantages of filter and wrapper modes.Firstly,it carried out a comprehensive mea- surement of features from two aspects of information correlation and classification ability.Secondly,it used sequential forward selection (SFS) strategy to select the features,and used the classification accuracy as the evaluation index to measure the feature subset.Finally,it obtained the optimal feature subset.The experimental results show that the proposed algorithm can not only achieve the effect of dimension reduction in feature space,but also effectively improve the classification performance and recall rate of classification algorithm.
作者 周传华 柳智才 丁敬安 周家亿 Zhou Chuanhua;Liu Zhicai;Ding Jing’an;Zhou Jiayi(School of Management Science & Engineering,Anhui University of Technology,Maanshan Anhui 243002,China;School of Computer Science & Technology,University of Science & Technology of China,Hefei 230026,China;Graduate School of Information,Production & Systems,Waseda University,Tokyo,Japan)
出处 《计算机应用研究》 CSCD 北大核心 2019年第7期1975-1979,2010,共6页 Application Research of Computers
基金 国家自然科学基金资助项目(71371013,71772002) 安徽省留学人员创新项目择优资助计划(2016)
关键词 信息增益率 随机森林 特征选择 filter模式 wrapper模式 information gain ratio random forest feature selection filter mode wrapper mode
  • 相关文献

参考文献7

二级参考文献44

共引文献335

同被引文献254

引证文献20

二级引证文献70

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部