期刊文献+

An effective framework for characterizing rare categories 被引量:1

An effective framework for characterizing rare categories
原文传递
导出
摘要 Rare categories become more and more abundant and their characterization has received little attention thus far. Fraudulent banking transactions, network intrusions, and rare diseases are examples of rare classes whose detection and characterization are of high value. However, accurate char- acterization is challenging due to high-skewness and non- separability from majority classes, e.g., fraudulent transac- tions masquerade as legitimate ones. This paper proposes the RACH algorithm by exploring the compactness property of the rare categories. This algorithm is semi-supervised in na- ture since it uses both labeled and unlabeled data. It is based on an optimization framework which encloses the rare exam- ples by a minimum-radius hyperball. The framework is then converted into a convex optimization problem, which is in turn effectively solved in its dual form by the projected sub- gradient method. RACH can be naturally kernelized. Experi- mental results validate the effectiveness of RACH. Rare categories become more and more abundant and their characterization has received little attention thus far. Fraudulent banking transactions, network intrusions, and rare diseases are examples of rare classes whose detection and characterization are of high value. However, accurate char- acterization is challenging due to high-skewness and non- separability from majority classes, e.g., fraudulent transac- tions masquerade as legitimate ones. This paper proposes the RACH algorithm by exploring the compactness property of the rare categories. This algorithm is semi-supervised in na- ture since it uses both labeled and unlabeled data. It is based on an optimization framework which encloses the rare exam- ples by a minimum-radius hyperball. The framework is then converted into a convex optimization problem, which is in turn effectively solved in its dual form by the projected sub- gradient method. RACH can be naturally kernelized. Experi- mental results validate the effectiveness of RACH.
出处 《Frontiers of Computer Science》 SCIE EI CSCD 2012年第2期154-165,共12页 中国计算机科学前沿(英文版)
关键词 rare category minority class characterization compactness optimization hyperball SUBGRADIENT rare category, minority class, characterization,compactness, optimization, hyperball, subgradient
  • 相关文献

参考文献38

  • 1Chau D H,Pandit S,Faloutsos C. Detecting fraudulent personalities in networks of online auctioneers[A].2006.103-114.
  • 2EURODIS. Rare diseases:understanding this public health priority[OL].http://www.eurordis.org/IMG/pdf/princeps_document-EN pdf,2005.
  • 3Pelleg D,Moore A W. Active learning for anomaly and rare-category detection[A].2004.
  • 4Fine S,Mansour Y. Active sampling for multiple output identification[A].2006.620-634.
  • 5He J,Carbonell J. Nearest-neighbor-based active learning for rare category detection[A].2007.
  • 6Dasgupta S,Hsu D. Hierarchical sampling for active learning[A].2008.208-215.
  • 7Vatturi P,Wong W K. Category detection using hierarchical mean shift[A].2009.847-856.
  • 8Japkowicz N. Proceedings of the AAAI'2000 Workshop on Learning from Imbalanced Data Sets[M].Menlo Park:AAAI Press,2000.
  • 9Chawla N V;Japkowicz N;Kolcz A.查看详情[A],2003.
  • 10Chawla N V,Japkowicz N,Kotcz A. Editorial:special issue on learning from imbalanced data sets[J].ACM SIGKDD Explorations,2004,(01):1-6.

同被引文献9

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部