期刊文献+

特征选择方法研究综述 被引量:26

A review of feature selection methods
下载PDF
导出
摘要 在大数据时代,特征选择是对数据进行预处理的必要环节.特征选择作为一种数据降维技术,其主要目的是从原始数据中选择出对算法最有益的相关特征,降低数据的维度和学习任务的难度,提升模型的效率.现阶段,有关特征选择算法方面的研究已取得阶段性成效,但也面临着重大挑战,其中维度灾难就是特征选择与分类问题所面临的重大挑战.首先,介绍了特征选择算法的基本架构,依次描述了子集的生成、子集的评估、终止条件、结果验证四个过程;其次,综述了特征选择领域的研究方法及研究成果,对特征选择方法分别依据评价策略、搜索策略、监督信息进行分类阐述,并对这些传统方法进行比较,指出它们的优势和不足;最后对特征选择进行了总结,并对其未来的研究方向进行了展望. In the era of big data,feature selection is a necessary part of data preprocessing.Feature selection is a data dimensionality reduction technology.Its main purpose is to select the most beneficial relevant features for the algorithm from the original data,reduce the dimensionality of the data and the difficulty of learning tasks,and improve the efficiency of the model.At this stage,the research on feature selection algorithms has achieved initial results,but it is also facing major challenges.Among them,the disaster of dimensionality is the major challenge faced by the feature selection and classification problem.First,the basic architecture of the feature selection algorithm is introduced,and the four processes of the generation of the subset,the evaluation of the subset,the termination condition,and the result verification are described in sequence Second,the methods are classified according to evaluation strategy,search strategy,and supervision information respectively,and compare these traditional methods,and point out their advantages and disadvantages.Finally,the feature selection is summarized and its future research direction is prospected.
作者 施启军 潘峰 龙福海 李娜娜 苟辉朋 苏浩辀 谢雨寒 SHI Qijun;PAN Feng;LONG Fuhai;LI Nana;GOU Huipeng;SU Haozhou;XIE Yuhan(School of Data Science and Information Engineering,Guizhou Minzu University,Guiyang 550025,China)
出处 《微电子学与计算机》 2022年第3期1-8,共8页 Microelectronics & Computer
关键词 机器学习 特征选择 评价策略 搜索策略 监督信息 machine learning feature selection,evaluation strategy search strategy supervision information
  • 相关文献

参考文献7

二级参考文献58

  • 1赵云,刘惟一.基于遗传算法的特征选择方法[J].计算机工程与应用,2004,40(15):52-54. 被引量:16
  • 2PANLi,ZHENGHong,ZHANGZuxun,ZHANGJianqing.Genetic Feature Selection for Texture Classification[J].Geo-Spatial Information Science,2004,7(3):162-166. 被引量:6
  • 3Zhang M L,Zhou Z-H.M3MIML:A maximum margin methodfor multi-instance multi-label learning [C]. Pisa, Italy: Procee- dings of the 8th IEEE International Conference on Data Mining, 2008:688-697.
  • 4Sun Y-Y, Zhang Y, Zhou Z-H. Multi-label learning with weak label[C].Atlanta,GA:Proceedings of the 24th AAAI Conference on Artificial Intelligence,2010:593-598.
  • 5Tsousmakas G,Zhang M L,Zhou Z H.Leaming from multi-labeldata[C]. Bled,S Iovenia:ECML/PKDD'09,2009.
  • 6Zhang M-L,Zhou Z-H.Multi-label learning by instance differen- tiation [C]. Vancouver, Canada: Proceedings of the 22nd AAAI Conference on Artificial Intelligence,2007:669-674.
  • 7Zhou Z-H,Zhang M-L.Multi-instance multi-label learning with application to scene classification[C].Sch61kopfB,Platt J C,Ho- fmann T, et al.Vancouver, Canada:Advances in Neural lnforma- tJon Processing Systems. Cambridge, MA: MIT Press, 2007: 1609-1616.
  • 8Zhang M-L,Zhou Z-H.A k-nearest neighbor based algorithm for multi-label classification[C].Beijing, China: Proceedings of the I st IEEE International Conference on Granular Computing, 2005:718-721.
  • 9Zhang M-L,Zhou Z-H.Multilabel neural networks with applica- tions to functional genomics and text categorization [J]. IEEE Transactions on Knowledge and Data Engineering,2006,18(10): 1338-1351.
  • 10Jin R,Wang S,Zhou Z-H.Learning a distance metric from multi- instance multi-label data[C].Miami,FL:Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition,2009:896-gl)2.

共引文献89

同被引文献190

引证文献26

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部