期刊文献+

一种新型欠采样的支持向量机非平衡数据故障诊断研究 被引量:6

Classification Research of SVM with Imbalanced Data Based on a New Type of Undersampling Samples
下载PDF
导出
摘要 支持向量机对于非平衡数据的分类效果不是十分理想;某些新型装备的故障数据较少且难于采集,正常数据则能够较为容易获得足够数量;这就使得支持向量机的诊断效果受到较大影响,如诊断精度降低,漏报、虚警概率的提高;文章借鉴距离最大熵样本欠采样原理,并引入条件熵的概念,提出了距离条件最大熵欠采样策略,用以改善支持向量机对于非平衡样本的诊断性能,实验表明该方法可行有效。 The classification performance of SVM is not very good to the imbalanced data. For new equipment, the failure data are difficult to find and acquire, the enough normal data are much easier to acquire than failure data. There are some bad influences on classification effect of SVM to this problem, such as degrading diagnosis accuracy, increasing failure missing report and false alarm. In this paper, the con ditional entropy is introduced based on the distance maximum entropy undersarnpling', the distance conditional maximum entropy is utilized to improve the diagnosis performance of SVM. The simulation is done to testify its validity.
出处 《计算机测量与控制》 CSCD 北大核心 2012年第5期1203-1204,1235,共3页 Computer Measurement &Control
关键词 支持向量机 距离最大熵 条件熵 非平衡样本 SVM distance maximum entropy conditional entropy imbalanced data sets
  • 相关文献

参考文献5

  • 1Vapnik V. The Nature of Statistical Learning Theory [M]. New York: Spring Verlag, 1995.
  • 2邹淑雪,刘桂霞,时小虎,周春光.基于距离最大熵值的蛋白质结构域边界检测系统[J].吉林大学学报(理学版),2009,47(6):1237-1240. 被引量:1
  • 3林洪桦.现代测量误差分析及数据处理(六)[J].计量技术,1997(6):41-45. 被引量:3
  • 4Kotsiantis S, Kanellopoulos D, Pintelas P. Handling Imbalanced Datasets: a Review [J].GESTS International Transaction on Computer Science and Engineering, 2006, 30 (1): 25-36.
  • 5Akbani R, Kwek S, Japkowicz N. Applying Support Vector Ma chines to Imbalanced Datasets [A]. Proceedings of 15th European Conf on Machine Learning (ECML) [C]. Berlin.. Springer, 2004: 39-50.

二级参考文献8

  • 1Tong S, Chang E. Support Vector Machine Active Learning for Image Retrieval [ C ]//Proceedings of the 9th ACM International Conference on Multimedia. New York: ACM, 2001: 107-118.
  • 2Joachims T. Text Categorization with SVM: Learning with Many Relevant Features [ C ]//Proceedings of ECML-98,10th European Conference on Machine Learning. Berlin: Springer, 1998: 137-142.
  • 3Wu G, Chang E Y. Class-boundary Alignment for Imbalanced Dataset Learning [ C ]//ICML 2003 Workshop on Learning from Imbalanced Data Sets II. Washington D C : [ s. n. ] , 2003 : 49-56.
  • 4Cristianini N, Shawe-Taylor J. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods [ M ]. Cambridge : Cambridge University Press, 2000.
  • 5Akbani R, Kwek S, Japkowicz N. Applying Support Vector Machines to Imbalanced Datasets [ C]//Proceedings of 15th European Conf on Machine Learning (ECML). Berlin: Springer, 2004: 39-50.
  • 6Veropoulos K, Campbell C, Cristianini N. Controlling the Sensitivity of Support Vector Machines [C]//Proceedings of the International Joint Conference on Artificial Intelligence. Stockholm, Sweden: [ s. n. ], 1999: 55-60.
  • 7Kotsiantis S, Kanellopoulos D, Pintelas P. Handling Imbalanced Datasets: a Review [ J ]. GESTS International Transactions on Computer Science and Engineering, 2006, 30( 1 ) : 25-36.
  • 8邹淑雪,黄艳新,李艳文,周春光.一种基于支持向量机的蛋白质结构域边界预测方法[J].吉林大学学报(理学版),2008,46(5):930-934. 被引量:2

共引文献2

同被引文献61

引证文献6

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部