期刊文献+

基于信息熵的概念漂移检测

Concept drift detection based on information entropy
下载PDF
导出
摘要 文章针对概念漂移检测分类器很难维持较高的分类性能,存在错误检测和延迟检测等问题,提出了一种基于信息熵的概念漂移检测算法。首先,使用信息熵对动态数据流中的概念漂移进行检测;然后,将检测到的概念漂移信息,在概念池中进行汇总和统计;最后,使用了两种公开的真实数据和一种人造概念漂移数据进行实验,并对实验结果进行分析,验证了模型的有效性和正确性。实验结果表明,该算法可以有效地检测概念漂移和更新分类器,同时表现出较好的分类性能。 Due to problems such as error detection and delay detection,the classifier of concept drift detection can not maintain a higher classification performance.In this study,a concept drift detection algorithm based on information entropy was proposed.Firstly,the concept drift of dynamic data stream is detected with information entropy.Secondly,the detected concept drift information will be collected and counted in the concept pool.Finally,two publicly available real data and an artificial conceptual drift data are used to make experiments and the experimental results are analyzed to verify the validity and correctness of the model.The results indicate that the proposed algorithm can effectively detect concept drifts and update classifiers,which show a good classification performance.
作者 张大伟 ZHANG Da-wei(College of Information Engineering,Eastern Liaoning University,Dandong 118003,China)
出处 《辽东学院学报(自然科学版)》 CAS 2019年第1期59-64,共6页 Journal of Eastern Liaoning University:Natural Science Edition
关键词 信息熵 概念漂移检测 概念池 数据流分类 information entropy concept drift detection concept pool data stream classification
  • 相关文献

参考文献2

二级参考文献34

  • 1Masud M M, Gao J, Khan L. Mining concept drifting data stream to detect peer to peer botnet traffic [C] //Proe of the 4th Annual Workshop on Cyber Security and Information Intelligence Research. New York: ACM, 2008:56-68.
  • 2Delany S J, Cunningham P, Tsymbal A. A comparison of ensemble and case-base maintenance techniques for handing concept drift in spare filtering [C] //Proc of the 19th Int Conf on Artificial Intelligence. Menlo Park: AAAI, 2006: 340- 345.
  • 3Masud M M, Gao J, Khan L, et al. A practical approach to classify evolving data streams: Training with limited amount of labeled data [C] //Proc of the 8th IEEE Int Conf on Data Mining. Piscataway, NJ: IEEE, 2008:929-934.
  • 4Widmer G, Kubat M. Learning in the presence of concept drift and hidden contexts [J]. Machine Learning, 1996, 23 (1) : 69-101.
  • 5李南,郭躬德.面向高速数据流的集成分类器算法[J].计机应用,2012,32(3):629-633.
  • 6Klinkerberg R. Learning drifting concepts: Examples selection vs. example weighing [J]. Intelligent Data Analysis, 2004, 8(3): 281-300.
  • 7Zhang P, Zhu X Q, Shi Y. Categorizing and mining concept drifting data streams [C] //Proc of the 14th Int Conf on Knowledge Discovery and Data Mining. New York: ACM, 2008:812-820.
  • 8Street W, Kim Y. A streaming ensemble algorithm (SEA) for large-scale classification [C] //Proc of the 7th ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining. New York: ACM, 2001: 77-382.
  • 9Kolter J Z, Maloof M A. Dynamic weighted majority; An ensemble method for drifting concepts [J]. Journal of Machine Research, 2007, 8(12): 2755-2790.
  • 10Dietterich T G, Barkiri G. Solving multiclass learning problems via error-correcting output codes [J]. Artificial Intelligence Research, 1995, 2(1): 263-286.

共引文献24

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部