期刊文献+

高维小样本分类问题中特征选择研究综述 被引量:36

Overview on feature selection in high-dimensional and small-sample-size classification
下载PDF
导出
摘要 随着生物信息学、基因表达谱微阵列、图像识别等技术的发展,高维小样本分类问题成为数据挖掘(包括机器学习、模式识别)中的一项挑战性任务,容易引发"维数灾难"和过拟合问题。针对这个问题,特征选择可以有效避免维数灾难,提升分类模型泛化能力,成为研究的热点,有必要对国内外高维小样本特征选择主要研究情况进行综述。首先分析了高维小样本特征选择问题的本质;其次,根据其算法的本质区别,重点对高维小样本数据的特征选择方法进行分类剖析和比较;最后对高维小样本特征选择研究面临的挑战以及研究方向作了展望。 With the development of bioinformatics, gene expression microarray and image recognition, classification on high-dimensional and small-sample-size data has become a challenging task in data ming, machine learning and pattern recognition as well. High-dimensional and small-sample-size data may cause the problem of "curse of dimensionality" and overfitting. Feature selection can prevent the "curse of dimensionality" effectively and promote the generalization ability of classification mode, and thus become a hot research topic. Accordingly, some recent development of world-wide research on feature selection in high-dimensional and small-sample-size classification was briefly reviewed. Firstly, the nature of high-dimensional and small-sample feature selection was analyzed. Secondly, according to their essential difference, feature selection algorithms for high-dimensional and small-sample-size classification were divided into four categories and compared to summarize their advantages and disadvantages. Finally, challenges and prospects for future trends of feature selection in high-dimensional small-sample-size data were proposed.
作者 王翔 胡学钢
出处 《计算机应用》 CSCD 北大核心 2017年第9期2433-2438,2448,共7页 journal of Computer Applications
基金 国家973计划项目(2016YFC0801406) 国家自然科学基金资助项目(61673152) 安徽省自然科学基金资助项目(1408085QF136)~~
关键词 特征选择 高维数据 小样本学习 信息过滤 支持向量机 feature selection high-dimensional data small-sample-size learning information filtering Support Vector Machine (SVM)
  • 相关文献

参考文献9

二级参考文献225

共引文献215

同被引文献307

引证文献36

二级引证文献120

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部