期刊文献+

非平衡数据集的分类

Classification of Imbalanced Dataset
下载PDF
导出
摘要 非平衡数据集在金融、商业以及学术的研究等诸多的领域有着广泛的应用,主要研究的是对于非平衡数据集的处理和分类问题,首先使用了Smote算法对于非平衡数据集进行平衡化处理,然后采用Weka软件中提供的分类算法建立分类模型,最后与没有进行预处理而建立的分类模型进行分析和比较,验证了Smote算法对于非平衡数据集分类的必要性,同时也指出有待于进一步的改进。 The imbalanced dataset has broad applications in many fields,such as finance,business and scientific research,so the research of the imbalanced dataset has theoretical and practical significant.It takes main study in processing and classification of imbalanced dataset,firstly,it takes balance of processing with synthetic minority over-sampling technique algorithm in imbalanced dataset,then it establishes classification model with classification applied in weka,the last compared with no pretreatment of the established classification model and analyzing,it verified that synthetic minority over-sampling technique algorithm has its necessary,and in the same need to be further improved.
作者 付优
出处 《电力学报》 2010年第4期349-352,共4页 Journal of Electric Power
关键词 非平衡数据集 Smote算法 Weka软件 分类算法 imbalanced dataset smote weka classification model
  • 相关文献

参考文献4

  • 1HAN J W, KAMBER M. Data Mining Concepts and Techniques[M]. Morgan Kaufmann Publishers, 2000.
  • 2METHA M, AGRAWAL R, RISSANEN J. SLIQ: A Fast Scalable Classifier for Data Mining[J]. Lecture Notes in Computer Sei. Proc. of the 5 th Int. Conf. on Extending Database Tech. [C], 1996 : 18- 33.
  • 3范明 魏芳.挖掘基本显露模式用于分类[J].计算机科学,2004,31:207-309.
  • 4ADAMATZKY A. Affections: Automata Models of Emotional Interactions [J]. Applied Mathematics and Computation, 2003,146 :579-594.

共引文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部