期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
Studying cost-sensitive learning for multi-class imbalance in Internet traffic classification 被引量:1
1
作者 LIU Zhen LIU Qiong 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2012年第6期63-72,共10页
Cost-sensitive learning has been applied to resolve the multi-class imbalance problem in Internet traffic classification and it has achieved considerable results. But the classification performance on the minority cla... Cost-sensitive learning has been applied to resolve the multi-class imbalance problem in Internet traffic classification and it has achieved considerable results. But the classification performance on the minority classes with a few bytes is still unhopeful because the existing research only focuses on the classes with a large amount of bytes. Therefore, the class-dependent misclassification cost is studied. Firstly, the flow rate based cost matrix (FCM) is investigated. Secondly, a new cost matrix named weighted cost matrix (WCM) is proposed, which calculates a reasonable weight for each cost of FCM by regarding the data imbalance degree and classification accuracy of each class. It is able to further improve the classification performance on the difficult minority class (the class with more flows but worse classification accuracy). Experimental results on twelve real traffic datasets show that FCM and WCM obtain more than 92% flow g-mean and 80% byte g-mean on average; on the test set collected one year later, WCM outperforms FCM in terms of stability. 展开更多
关键词 Internet traffic classification minority class cost matrix machine learning
原文传递
An effective framework for characterizing rare categories 被引量:1
2
作者 JingruiHE,HanghangTONG,JaimeCARBONELL。 HanghangTONG SJaimeCARBONELL 《Frontiers of Computer Science》 SCIE EI CSCD 2012年第2期154-165,共12页
Rare categories become more and more abundant and their characterization has received little attention thus far. Fraudulent banking transactions, network intrusions, and rare diseases are examples of rare classes whos... Rare categories become more and more abundant and their characterization has received little attention thus far. Fraudulent banking transactions, network intrusions, and rare diseases are examples of rare classes whose detection and characterization are of high value. However, accurate char- acterization is challenging due to high-skewness and non- separability from majority classes, e.g., fraudulent transac- tions masquerade as legitimate ones. This paper proposes the RACH algorithm by exploring the compactness property of the rare categories. This algorithm is semi-supervised in na- ture since it uses both labeled and unlabeled data. It is based on an optimization framework which encloses the rare exam- ples by a minimum-radius hyperball. The framework is then converted into a convex optimization problem, which is in turn effectively solved in its dual form by the projected sub- gradient method. RACH can be naturally kernelized. Experi- mental results validate the effectiveness of RACH. 展开更多
关键词 rare category minority class characterization compactness optimization hyperball SUBGRADIENT
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部