期刊文献+

面向不平衡数据集的线性分类方法研究

Linear Classification Methods for Imbalanced Datasets
下载PDF
导出
摘要 近年来,面向不平衡数据集的分类器学习与推广问题越来越受到人们的关注,在此以机器学习数据库、美国邮政编码、2维元音等国际上典型的分类问题为应用背景,重点研究如何用线性分类器解决样本数不平衡的问题;对Fisher、伪逆和单层感知器等3种典型的线性分类器做了深入的研究,并将这3种线性分类方法应用到不平衡数据集的分类中;通过实验及分析,这些新方法对平衡数据集的线性分类起到了良好的分类效果。 In recent years,much attention is paid to the learning and generalization problems of classifiers for imbalanced datasets.For the typical classification applications such as machine learning datasets,the US postal service,and 2-dimensional vowels,this paper focuses on the design and learning algorithms of linear classifiers in order to tackle the imbalanced datasets and makes deep studies on Fisher,Pseudo-inverse and single-layer perceptrons and applies these three linear classifiers to imbalanced datasets.Through experiments and analysis,these new methods play a good classification role in linear classification of imbalance datasets.
作者 殷士勇
出处 《重庆工商大学学报(自然科学版)》 2010年第5期467-475,共9页 Journal of Chongqing Technology and Business University:Natural Science Edition
关键词 不平衡数据集 FISHER分类器 伪逆法 单层感知器 线性分类方法 imbalanced datasets Fisher classifier pseudo-inverse algorithm single-layer perceptrons linear classification methods
  • 相关文献

参考文献24

  • 1KUBAT M, HOLTE R, MATWIN S. Machine Learning for the Detection of oil Spills in Satellite Radar Images [ J ]. Machine Learning, 1998, 30(23): 195-215.
  • 2PPUA C, ALAHAKOON D. Minority Report in Fraud Detection: classification of Skewed Data[J]. Sigkdd Explorations,2004,6 ( 1 ) :50-59.
  • 3PeREZ J, MUCUERZA J, ARBELAITZ O. ConSolidated tree Classifier learning in a car insurance fraud detection domain with class imbalance[ C]. Proc of the 3^rd International Conference on Advances in Pattern Recognition,2005:381-389.
  • 4CASTILL M, SERRANO J. A multistrategy approach for digital text categoryation from imbalanced documents [ J ]. SIGKDD Explorations, 2004, 6 ( 1 ) : 70-79.
  • 5ZHANG Z, WU X, SRIHARI R. Feature selection for text categoryzation on imbalanced data[ J]. SIGKDD Explorations,2004,6 ( 1 ) : 80-89.
  • 6COHEN G, HILARIO M, SAX H. Data imbalance in surveillance of nosocomial infections [ C ]. Proc of the 4th International Symposium on Medical Data Analysis, Berlin : ~ s. n. ] , 2003 : 109-117.
  • 7CHEN J, CHENG T, CHAN A. An application of classification analysis for skewed class distribution in therapeutic drug monitoring the case of vancomycin[ C ]. Proc of Workshop on Medical Information Systems, Beijing: [ s. n. ] ,2004:35-39.
  • 8YOON K, KWEK S. An Unsupervised Learning Approach to Resolving The Data Imbalanced Issue in Supervised Learning Problems in Functional Genomics [ C ]. Proc of the 5^th International Conference on Hybrid Intelligent Systems (HISO5), Rio de Janeiro : [ s, n. ], 2005:303-308.
  • 9RADIVOJAC P, KORAD U, SIVALINGAM K. Learning from Class Imbalanced Data in Wireless Sensor Networks [ J 1. Pros of Vehicular Technology Conference, Orlando: [ s, n. ] ,2003:3030-3034.
  • 10MARTY F, OSCAR F. Readings in Computer Vision : Issuer, Problems, Principles and Paradigms [ J ]. Morgan Kaufmann, San Mateo, CA, 1987.

二级参考文献30

  • 1邵森木,权建峰.三角波调频测距引信系统仿真研究[J].探测与控制学报,2005,27(2):13-17. 被引量:6
  • 2朱莉,娄国伟,李兴国.非大气窗口毫米波FMCW近程雷达[J].制导与引信,2005,26(4):13-15. 被引量:1
  • 3崔占忠.调频测距信号分析[J].探测与控制学报,2006,28(5):1-3. 被引量:15
  • 4叶文,朱爱红,刘博,范洪达.飞机低空突防技术研究[J].电光与控制,2007,14(4):87-91. 被引量:17
  • 5李金宗,模式识别导论,1994年
  • 6边肇祺 张学工 等.模式识别[M].北京:清华大学出版社,2001..
  • 7Chan P K, Stolfo S J. Toward Scalable Learning with Non-Uniform Class and Cost Distributions: A Case Study in Credit Card Fraud Detection[C]//In. Proc of the Fourth International Conference on Knowledge Discovery and Data Mining(KDD-98). New York, 1998: 164- 168.
  • 8Weiss G M, Hirsh H. Learning to Predict Rare Events in Event Sequences[ C]// In. Proc of the Fourth International Conference on Knowledge Discovery and Data Mining(KDD-98). New York: 1998:359- 363.
  • 9Atiya A F. Bankruptcy Prediction for Credit Risk Using Neural Network: a Survey and New Results [J ]. IEEE Trans. Neural Networks, 2001, 12(4) : 929 - 935.
  • 10Kubat M, Holte R C, Matwin S. Machine Learning for the Detection of Oil Spills in Satellite Radar Images[J ].Machine Learning, 1998, 30(2): 195-215.

共引文献29

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部