期刊文献+

基于多分类器集成的数据流分类方法 被引量:1

Ensemble Classifier Based Data Stream Classifying
下载PDF
导出
摘要 概念漂移给数据流挖掘工作带来了很大阻碍。经典的SEA算法通过动态裁剪集成分类器的方式有效地捕获到概念漂移。其裁剪集成分类器的策略是直接删除掉一个权值最低的基础分类器,这意味着算法抛弃了一个已经学习了的概念,当该概念再出现时还需再学习,导致算法效率的降低。现提出了一种能够提取旧概念的算法(ECRRC),并给出了存储和提取概念的具体方法。面对概念的重复出现,ECRRC不用再学习就能够完成数据流分类。实验结果表明,ECRRC能够提高数据流分类效率。 Concept drift is a big obstacle in the field of mining stream data. By dynamic modifying the ensemble classifier,SEA can effectively catch concept drift for mining stream data. The method of SEA modifying the ensemble classifier is direct dropping a base classifier of the lowest weight. That means the algorithm abandon a learned concept,but the algorithm will waste time to learn the abandoned concept,as a result this leads to a low-level effective algorithm. A new algorithm ECRRC(Ensemble Classifiers Retrieving Repeated Concept ) with the ability of retrieving the old concept is proposed to reuse the old classifier. Facing the concept repeating,ECRRC need not learn again for mining stream data. Besides the method of storing and retrieving the concept is presented. The experimental results show that the algorithm raises classifying data stream efficiency.
出处 《科学技术与工程》 2010年第18期4521-4524,4529,共5页 Science Technology and Engineering
关键词 数据流分类 集成分类器 概念漂移 classify stream data ensemble classifier concept drift
  • 相关文献

参考文献9

  • 1Han J,Kamber M.Data mining:concept and techniques.2ed.San Fransisco,CA.Higher Education Press,2001:1-7.
  • 2Tan Pang ning,Sreinbach M,Kumar V.数据挖掘导论.范明,范宏建,译.北京:北京大学出版社,2006.
  • 3王涛,李舟军,颜跃进,陈火旺.数据流挖掘分类技术综述[J].计算机研究与发展,2007,44(11):1809-1815. 被引量:40
  • 4Widmer G,Kubat M.Learning in the presence of concept drift and hidden contexts.Machine Learning,1996;23(1):69-101.
  • 5金澈清,钱卫宁,周傲英.流数据分析与管理综述[J].软件学报,2004,15(8):1172-1181. 被引量:161
  • 6史金成,胡学钢.数据流挖掘研究[J].计算机技术与发展,2007,17(11):11-14. 被引量:6
  • 7Domingos P,Hulten G.Mining high-speed data streams.Proc of ACM SIGKDD Inter Conference Knowledge Discovery in Databases (KDD'00),2000:71-80.
  • 8Wang H,Yin J,Pei J.Suppressing model over-fitting in mining concept-drifting data streams.SIGKDD'06.Philadelphia.[s.n.] ,2006:736-741.
  • 9Street W H,Kim Y S.A streaming ensemble algorithm for large-scale classification.In:Proceeding of the 2005 ACM Symposium on Applied Computing.New Mexico,USA:2005:537-577.

二级参考文献92

共引文献202

同被引文献28

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部