期刊文献+

基于GA-SVM封装算法的高光谱数据特征选择 被引量:8

A genetic algorithm based wrapper feature selection method for classification of hyper spectral data using support vector maching
下载PDF
导出
摘要 封装型的特征选择算法相对于过滤算法而言更有助于提高分类精度,因此在当前计算技术及效率快速发展的背景下必将成为未来之趋势。本文以支持向量机(SVM)为分类器,遗传算法(GA)为特征子集的搜索算法,构建了封装型的特征选择算法GA-SVM,并用EN-VI/IDL语言编程实现,最后以HYPERION高光谱数据为例对算法予以应用。结果表明,GA-SVM算法可从196个波段中选择出13个波段,同时分类精度较不做特征选择时提高了约4%。由此可见,GA-SVM封装型特征选择算法具有较好的同时优化特征子集和SVM核函数的性能,可为当前高光谱数据的特征选择提供一个较好的算法。 The high-dimensional feature vectors of hyper spectral data often impose a high computational cost as well as the risk of "over fitting" when classification is performed. Therefore it is necessary to reduce the dimensionality through ways like feature selection. Currently, there are two kinds of feature selection methods: filter methods and wrapper methods. The former kind requires no feedback from classifiers and estimates the classification performance indirectly. The latter kind evaluates the "goodness" of selected feature subset directly based on the classification accuracy. Many experimental results have proved that the wrapper methods can yield better performance, although they have the dis advantage of high computational cost. In this paper, we present a Genetic Algorithm (GA) based wrapper method for classification of hyper spectral data using Support Vector Machine (SVM), a state-of-art classifier that has found to be success in a variety of areas. The genetic algorithm (GA), which seeks to solve optimization problems using the methods of evolution, specifically survival of the fittest, was used to optimize both the feature subset, i.e. band subset, of hyper spectral data and SVM kernel parameters simultaneously. A special strategy was adopted to reduce computation cost caused by the high-dimensional feature vectors of hyper spectral data when the feature subset part of chromo- some was designed. The GA-SVM method was realized using the ENVI/IDL language, and was then tested by applying a HYPERION hyper spectral image. Comparison of the optimized results and the un-optimized results showed that the GA-SVM method could significantly reduce the computation cost while improving the classification accuracy. The number of bands used for classification was reduced from 198 to 13, while the classification accuracy increased from 88.81% to 92.51%. The optimized values of the two SVM kernel parameters were 95. 0297 and 0. 2021, respectively, which were different from the default values as used in the ENVI software. In conclusion, the proposed wrapper feature selection method GA-SVM can optimize feature subsets and SVM kernel parameters at the same time, therefore can be applied in feature selection of the hyper spectral data.
出处 《地理研究》 CSCD 北大核心 2008年第3期493-501,726,共10页 Geographical Research
基金 国家自然科学基金(40601010) 中国博士后基金(20060390208) “985工程”GIS与遥感的地学应用科技创新平台资助(105203200400006) 国家杰出青年科学基金资助项目(40525002)
关键词 特征选择 高光谱 遗传算法(GA) 支持向量机(SVM) feature selection hyperspectral genetic algorithm supported vector machine
  • 相关文献

参考文献32

  • 1陈蜜,易尧华,刘志刚,李德仁,秦前清.基于分块特性的高光谱影像波段选取方法的研究[J].测绘通报,2006(3):10-13. 被引量:3
  • 2杨哲海,张雅争,宫大鹏,李之歆,韩建峰.基于Tabu搜索的高光谱影像特征选择[J].海洋测绘,2006,26(4):11-14. 被引量:10
  • 3王长耀,刘正军,颜春燕.成像光谱数据特征选择及小麦品种识别实验研究[J].遥感学报,2006,10(2):249-255. 被引量:31
  • 4Liu Y, Zheng Y F. FS_SFS:A novel feature selection method for support vector machines. Pattern Recognition, 2006, 39: 1333-1345.
  • 5Kohavi R, John G H. Wrappers for feature subset selection. Artif. Intell.,1997, 97: 273-324.
  • 6Huang Ch L, Wang Ch J. A GA-based feature selection and parameters optimization for support vector machines. Expert Systems with Applications, 2006, 31: 231-240.
  • 7Mao K Z. Feature subset selection for support vector machines through discriminative function pruning analysis. IEEE Transactions on Systems, Man, and Cybernetics, 2004, 34(1): 60-67.
  • 8Yu E, Cho S. Ensemble based on GA wrapper feature selection. Computers & Industrial Engineering, 2006, 51 (1): 111-116.
  • 9Bi J, Bennett K P, Embrechts M, et al. Dimensionality reduction via sparse support vector machines. J. Mach. Learn. Res. 2003, 3: 1229-1243.
  • 10Sikora R, Piramuthu S. Framework for efficient feature selection in genetic algorithm based data mining. European Journal of Operational Research, 2007, 180.(2): 723-737.

二级参考文献47

共引文献87

同被引文献57

引证文献8

二级引证文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部