期刊文献+

用于不平衡数据分类的代价敏感超网络算法 被引量:7

Cost-sensitive hypernetworks for imbalanced data classification
下载PDF
导出
摘要 传统的超网络模型在处理不平衡数据分类问题时,具有很大的偏向性,正类的识别率远远高于负类。为此,提出了一种代价敏感超网络Boosting集成算法。首先,将代价敏感学习引入超网络模型,提出了代价敏感的超网络模型;同时,为了使算法能够自适应正类的错分代价,采用Boosting算法对代价敏感超网络进行集成。代价敏感超网络能很好地修正传统的超网络在处理不平衡数据分类问题时过分偏向正类的缺陷,提高对负类的分类准确性。实验结果表明,代价敏感超网络Boosting集成算法具有处理不平衡数据分类问题的优势。 Traditional hypernetwork model is biased towards the majority class, which leads to much higher accuracy on majority class than the minority when being tackled on imbalanced data classification problem. In this paper, a Boosting ensemble of cost-sensitive hypernetworks was proposed. Firstly, the cost-sensitive learning was introduced to hypernetwork model, to propose cost-sensitive hyperenetwork model. Meanwhile, to make the algorithm adapt to the cost of misclassification on positive class, cost-sensitive hypernetworks were integrated by Boosting. The proposed model revised the bias towards the majority class when traditional hypernetwork model was tackled on imbalanced data classification, and improved the classification accuracy on minority class. The experimental results show that the proposed scheme has advantages in imbalanced data classification.
出处 《计算机应用》 CSCD 北大核心 2014年第5期1336-1340,1377,共6页 journal of Computer Applications
基金 重庆市教育委员会2010年度科学技术研究资助项目(渝教科[2013]4号)
关键词 不平衡数据分类 超网络 代价敏感学习 自适应学习 imbalanced data classification hypernetwork cost-sensitive learning adaptive learning Boosting
  • 相关文献

参考文献18

  • 1ZHANG B T.Hypernetworks:a molecular evolutionary architecture for cognitive learning and memory[J].IEEE Computational Intelligence Magazine,2008,3(3):49-63.
  • 2ZHANG B T,JANG H Y.Molecular programming:evolving genetic programs in a test tube[C]//Proceedings of the 2005 Conference on Genetic and Evolutionary Computation.New York:ACM,2005:1761-1768.
  • 3KIM S,HEO M O,ZHANG B T.Text classifiers evolved on a simulated DNA computer[C]//Proceedings of the 2006 IEEE Congress on Evolutionary Computation.Piscataway:IEEE Press,2006:2646-2652.
  • 4KIM E S,HA J W,JUNG W H,et al.Mutual information-based evolution of hypernetworks for brain data analysis[C]//Proceedings of the 2011 IEEE Congress on Evolutionary Computation.Piscataway:IEEE Press,2011:2611-2617.
  • 5SUN Y,WONG A K C,KAMEL M S.Classification of imbalanced data:a review[J].International Journal of Pattern Recognition and Artificial Intelligence,2009,23(4):687-719.
  • 6LING C X,YANG Q,WANG J,et al.Decision trees with minimal costs[C]//Proceedings of the 21 st International Conference on Machine Learning.New York:ACM,2004:69-76.
  • 7ZHOU Z H,LIU X Y.Training cost-sensitive neural networks with methods addressing the class imbalance problem[J].IEEE Transactions on Knowledge and Data Engineering,2006,18(1):63-77.
  • 8雷治军,张素玲,薛贞霞.基于球边界的不平衡数据分类方法[J].计算机应用,2008,28(4):866-868. 被引量:1
  • 9CHAWLA N V,LAZAREVIC A,HALL L O,et al.SMOTEBoost:improving prediction of the minority class in boosting[C]//Proceedings of the 7th European Conference on Principles and Practice of Knowledge Discovery in Database.Berlin:Springer-Verlag,2003:107-119.
  • 10GUO H,VIKTOR H L.Learning from imbalanced data sets with boosting and data generation:the databoost-IM approach[J].ACM SIGKDD Explorations Newsletter,2004,6(1):30-39.

二级参考文献32

  • 1陆从德,张太镒,胡金燕.基于乘性规则的支持向量域分类器[J].计算机学报,2004,27(5):690-694. 被引量:21
  • 2姜桂艳,温慧敏,杨兆升.高速公路交通事件自动检测系统与算法设计[J].交通运输工程学报,2001,1(1):77-81. 被引量:67
  • 3凌晓峰,SHENG Victor S..代价敏感分类器的比较研究(英文)[J].计算机学报,2007,30(8):1203-1212. 被引量:35
  • 4Bartlett P L, Traskin M. AdaBoost is consistent. Journal of Machine Learning Research, 2007, 8:2347-2368.
  • 5Schapire R E. The convergence rate of AdaBoost [open prob lem]//Proceedings of the 23rd Conference on Learning Theo ry. Haifa, Israel, 2010.
  • 6Japkowicz N. Learning from imbalanced data sets: A com parison of various strategies/ /Proceedings of the AAAI 2000 Workshop, 2000:10-15.
  • 7Chawla N V, Japkowicz N, Kotcz A. Workshop on learning from imbalanced data sets//Proceedings of the ICML' 2003. Washington, DC, USA, 2003.
  • 8Chawla N V, Japkowicz N, Kolez A. Editorial: Special issue on learning from imbalanced data sets. ACM SIGKDD Ex- plorations Newsletter, 2004, 6 (1) : 1-6.
  • 9He Hai-Bo, Garcia E A. Learning from imbalanced data. IEEE Transactions on Knowledge and Data Engineering, 2009, 21(9): 1263-1284.
  • 10Liu X Y, Zhou Z H. The influence of class imbalance on cost-sensitive learning: An empirical study//Proeeedings of the 6th International Conference on Data Mining(ICDM'06). Hong Kong, China, 2006 : 970-974.

共引文献66

同被引文献60

  • 1聂佩林,陈晓翔,佘锡伟,戴秀斌.基于代价敏感神经网络的交通状态判别[J].公路交通科技(应用技术版),2011,7(3):220-223. 被引量:3
  • 2朱兴统,熊建斌.基于PCA和BP神经网络的故障诊断仿真系统[J].自动化与仪器仪表,2015(12):47-48. 被引量:5
  • 3赵凤英,王崇骏,陈世福.用于不均衡数据集的挖掘方法[J].计算机科学,2007,34(9):139-141. 被引量:5
  • 4BATISTA G E, PRATI R C, MONARD M C. A study of the behavior of several methods for balancing machine learning training data[J]. ACM Sigkdd Explorations Newsletter, 2004, 6(1):20-29.
  • 5KOTSIANTIS S B, PINTELAS P E. Mixture of expert agents for handling imbalanced data sets[J]. Annals of Mathematics, Computing & Teleinformatics, 2003, 1(1):46-55.
  • 6CHAWLA N V, BOWYER K W, HALL L O, et al. SMOTE: synthetic minority over-sampling technique[J]. Journal of Artificial Intelligence Research, 2002, 16(1):321-357.
  • 7HAN H, WANG W Y, MAO B H. Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning[J]. Computer Science, 2005, 3644:878-887.
  • 8GARCIA S, HERRERA F. Evolutionary under sampling for classification with imbalanced data sets: proposals and taxonomy[J]. Evolutionary Computation, 2009, 17(3):275-306.
  • 9YEN S J, LEE Y S. Cluster-based under-sampling approaches for imbalanced data distributions[J]. Expert Systems with Applications, 2009, 36(3):5718-5727.
  • 10WU J, XIONG H, WU P, et al. Local decomposition for rare class analysis[J]. Kdd, 2007, 20(2):191-220.

引证文献7

二级引证文献26

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部