期刊文献+

基于支持向量机的不平衡数据分类的改进欠采样方法 被引量:16

An Improved SVM Based Under-Sampling Method for Classifying Imbalanced Data
下载PDF
导出
摘要 支持向量机作为一种有监督分类算法,具有小样本,非线性等独特优势,但其在处理不平衡数据分类时效果不够理想。欠采样是一类常用的数据重构方法,它被广泛用于解决不平衡数据的分类问题,然而,传统的随机欠采样方法受随机性影响,稳定性较差。提出一种改进的欠采样方法,并应用在支持向量机上进行分类对比实验。实验结果表明,相比传统随机欠采样方法,该方法的稳定性更好,且在许多情况下可以提高支持向量机对不平衡数据的分类性能。 As a supervised classifier, Support Vector Machine (SVM) has prominent advantages in solving some problems on petty and nonlinear datasets, but it is unsatisfying in tackling with imbalanced datasets. Random under-sampling has been a widely used method to improve SVM's performance on imbalanced data, but its stability is easily influenced by the nature of randomness. A modified SVM based on under-sampling method is presented to classify imbalanced data. Compared with the random undersampling technique, it is shown through experiments on natural datasets that the new proposed undersampling method is more stable in classifying imbalanced data, and exhibits improved SVM performance in classifying imbalanced data for many cases.
出处 《中山大学学报(自然科学版)》 CAS CSCD 北大核心 2012年第6期10-16,共7页 Acta Scientiarum Naturalium Universitatis Sunyatseni
基金 国家自然科学基金资助项目(U1135005)
关键词 支持向量机 不平衡数据 欠采样 稳定性 support vector machine imbalanced data under-sampling stability
  • 相关文献

参考文献17

  • 1WEISS G M. Mining with rarity:A unifying framework[J].ACM SIGKDD Explorations Newsletter-Special issue on learning from imbalanced datasets,2004,(01):7-19.
  • 2HE H B,GARCIA. Learning from imbalanced data[J].IEEE Transactions on Knowledge and Data Engineering,2009,(09):1263-1284.
  • 3CHAN P K,STOLFO S J. Toward scalable learning with non-uniform class and cost distributions:A case study in credit card fraud detection[A].1998.164-168.
  • 4JAPKOWICZ N,STEPHEN S. The class imbalance problem:A systematic study[J].Intelligent Data Analysis,2002,(05):429-449.
  • 5PRATI R C,BATISTA G,MONARD M C. Class imbalances versus class overlapping:an analysis of a learning system behavior[A].2004.312-321.
  • 6CORTES C,VAPNIK V. Support-vector networks[J].Machine Learning,1995,(03):273-297.
  • 7QUINLAN J R. C4.5 programs for machine learning[M].San Mateo,Calif:Morgan Kaufmann Publishers,1993.
  • 8TANG Y C,ZHANG Y Q,CHAWLA N V. SVMs modeling for highly imbalanced classification[J].IEEE Transactions on Systems Man and Cybernetics,2009,(01):281-288.
  • 9WANG X H,SHU P,CAO L. A ROC curve method for performance evaluation of support vector machine with optimization strategy[A].2009.117-120.
  • 10PRATI R C,BATISTA G,MONARD M C. A study of the behavior of several methods for balancing machine learning training data[J].ACM SIGKDD Explorations Newsletter-Special issue on learning from imbalanced datasets,2004,(01):20-29.

二级参考文献9

  • 1COVER T M,HART P E. Nearest neighbor pattern classification [J]. In Trans IEEE Inform Theory, 1967,IT- 13:21 - 27.?A
  • 2CHO T H,CONNERS R W,ARAMAN P A. A comparison of rule-based, K-nearest neighbor, and neural net classifiers for automation [ C ]. Proceedings, Developing and Managing Expert System Programs, 1991, 202 - 209.?A
  • 3DUDANI S A. The distance-weighted k-nearest-neighbor rule [J]. IEEE Trans Syst Man Cyber, 1976, 6:325-327.?A
  • 4VAPNIK V N. The nature of statistical learningtheory[M].NewYork:Springer-Verlag,1995.张学工,译.统计学习理论的本质[M].北京:清华大学出版社,1999.?A
  • 5BURGES J C. A tutorial on support vector machines for pattern recognition [ M ]. Bell Laboratories, Lucent Technologies, Boston, 1997.?A
  • 6KEERTHI S S, SHEVADE S K, BHATTACHARYYA C, et al. Improvements to Platt's SMO algorithm for SVM classifier design[J]. Neural Computation,2001,13(3):637 - 649.?A
  • 7LIN C J. A formal analysis of stopping criteria of decomposition methods for support vector machines[J]. IEEE Transaction on Neural Networks 2002, 13 (5): 1045 - 1052.?A
  • 8LEE J H, LIN C J. Automatic model selection for support vector machines[ EB/OL]. Available from http:∥www. csie.ntu. edu. tw/~ cjlin/papers. html, 2000.?A
  • 9田盛丰,黄厚宽.基于支持向量机的数据库学习算法[J].计算机研究与发展,2000,37(1):17-22. 被引量:53

共引文献50

同被引文献122

引证文献16

二级引证文献138

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部