期刊文献+

基于Hellinger距离的特征选择算法 被引量:3

Feature selection algorithm based on Hellinger distance
下载PDF
导出
摘要 针对数据挖掘中的特征选择问题,依据Hellinger距离的特性,研究了两种Hellinger距离的定义方式,提出了基于Hellinger距离的特征选择方法,设计了两种相应的算法。不同数据集上的实验结果表明了新算法选择的特征的有效性。与其他特征选择算法的对比可发现:这两种算法选择的特征个数少且对C4.5分类精度较好。 To solve the feature selection problem,two kinds of definitions of Hellinger distance were studied in this paper,and the corresponding feature selection algorithms based on Hellinger distance were also proposed.The experiments on different data sets show the efficiency of the two algorithms.Compared with other feature selection algorithms,the feature selection algorithms based on Hellinger distance can get fewer features,which are useful for C4.5 and can improve the average accuracy of the classification in the learned data sets.
出处 《计算机应用》 CSCD 北大核心 2010年第6期1530-1532,1634,共4页 journal of Computer Applications
基金 江苏省自然科学基金资助项目(BK2009233)
关键词 特征选择 Hellinger距离 数据挖掘 feature selection Hellinger distance data mining
  • 相关文献

参考文献10

  • 1GUYON I,ELISSEEFF A.An introduction to variable and feature selection[J].Journal of Machine Learning Research,2003,3:1157-1182.
  • 2XUAN G R,CHAI P Q,WU M H.Bhattacharyya distance feature selection[C]//Proceedings of the 13th International Conference on Pattern Recognition.Washington,DC:IEEE Computer Society,1996,2:195-199.
  • 3PIRAMUTHU S.The Hausdofff distance measure for feature selection in learning applications[C]//Proceedings of the 32nd Hawaii International Conference on System Sciences.Washington,DC:IEEE Computer Society,1999.
  • 4PAPANTONI-KAZAKOS P.Some distance measures and their use in feature selection,#7611[R].Houston:Rice University,Electrical Engineering Department,1976.
  • 5CIESLAK D A,CHAWLA N V.Learning decision tree for unbalmaced data[C]//ECML/PKDD.Berlin:Springer-Verlag,2008,1:241-256.
  • 6LEE C H,SHIN D G.Using Hellinger distance in a nearest neighbor classifier for relational databases[J].Knowledge-Based Systems,1999,12(7):363-370.
  • 7RAO C.A review of canonical coordinates and an alternative to correspondence analysis using Hellinger distance[J].Questiio,1995,19(1/3):23-63.
  • 8Weka 3:Data mining software in Java[EB/OL].[2009-12-20].http://www.cs.waikato.ac.nz/ml/weka/.
  • 9CHANG C.LIBSVM:A library for support vector machines[EB/OL].[2009-12-20].http://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/.
  • 10UCI:Machine learning repository[EB/OL].[2009-12-20].http://archive.ics.uci.edu/ml/.

同被引文献24

引证文献3

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部