期刊文献+

噪声数据流的分类方法 被引量:2

Classification method of noise data stream
下载PDF
导出
摘要 数据流中噪声数据的处理是当前数据流分类挖掘中重要的研究分支,近些年来得到了广泛的关注.本文提出了一种称为FDBCA的数据流分类算法.它使用基于密度的带有噪声的空间聚类(DBSCAN)的改进算法Fast-DB-SCAN(FDBSCAN)处理噪声数据,并利用错误率方差(MSE)来检测概念漂移.同已有的数据流分类算法相比,实验结果表明了FDBCA算法可以提高噪声数据流的分类精度. Processing noise data from data stream is one of the most important fields in data stream classification.In this paper,a new classification algorithm called FDBCA is proposed,which deals with noise data by Fast-DBSCAN that improved density based spatial clustering of applications with noise algorithm,and use mean square error to detect concept drifting.The experimental results show that FDBCA can improve classification accuracy on classifying noise data stream compared with existed data stream classification algorithm.
出处 《天津理工大学学报》 2011年第3期37-41,共5页 Journal of Tianjin University of Technology
关键词 数据流 概念漂移 FDBSCAN 分类算法 data stream concept drifting FDBSCAN classification algorithm
  • 相关文献

参考文献6

  • 1Wang H, Fan Wei, Yu P, et al. Mining concept-drifting data streams using Ensemble classifiers [ C ]//ACM Spe- cial Interest Group on Management of Data, SIGMOD. KDD' 03:Proceedings of 9th ACM SIGKDD International Conference Knowledge Discovery and Data Mining. United States : Association for Computing Machinery, 2003.
  • 2Xing quan Zhu, Xing dong Wu, Ying Yang. Dynamic Classifier Selection for Effective Mining from Noisy Data Streams[ C ]//IEEE Computer Society TCCL. ICDM : Pro- ceeding of the 4th IEEE International Conference Data Mining. Brighton UK : IEEE Computer Society,20(M.
  • 3王涛,李舟军,颜跃进,陈火旺.数据流挖掘分类技术综述[J].计算机研究与发展,2007,44(11):1809-1815. 被引量:40
  • 4Widmer G, Kubat M. Learning in the presence of concept drift and hidden contexts[ J]. Machine Learning, 1996,23 (1) :69-101.
  • 5Bing Liu. A fast density-based clustering algorithm for large databases [ C ]//Horrocks I, Hendler J A. Machine Learning and Cybernetics : Proceedings of the Fifth Interna- tional Conference on Machine Learning and Cybernetics. United States:Institute of Electronic and Electronic Engi- neering Engineering Computer Society,2006.
  • 6Ester M, Kriegel H, Sander J, et al, A Density-Based Al- gorithm for discovering clusters in large spatial databases with noise [ C ]//ACM Special Interest Group on Manage- ment o f Data, SIGMOD. KDD' 96: Proceedings of 2th ACM SIGKDD International Conference Knowledge Dis- covery and Data Mining. United States: Association for Computing Machinery, 1996.

二级参考文献28

  • 1杨宜东,孙志挥,张净.基于核密度估计的分布数据流离群点检测[J].计算机研究与发展,2005,42(9):1498-1504. 被引量:8
  • 2钱江波,徐宏炳,董逸生,王永利,刘学军,杨雪梅.基于最小生成树的数据流窗口连接优化算法[J].计算机研究与发展,2007,44(6):1000-1007. 被引量:3
  • 3S Muthukrishnan.Data streams:Algorithms and applications[C].The 14th Annual ACM-SIAM Symp on Discrete Algorithms,Baltimore,MD,USA,2003
  • 4H Wang,W Fan,P Yu,et al.Mining concept-drifting data streams using ensemble classifiers[C].The 9th ACM Int'l Conf on Knowledge Discovery and Data Mining (SIGKDD),Washington,2003
  • 5Q H Xie.An efficient approach for mining concept-drifting data streams:[Master dissertation][D].Tainan,China:National University of Tainan,2004
  • 6M Guetova,Holldobter,H V Storr.Incremental fuzzy decision trees[C].The 25th German Conf on Artificial Intelligence(KI2002),Aachen,Germany,2002
  • 7V Ganti,J Gehrke,R Ramakrishnan.Mining data streams under block evolution[J].SIGMOD Explorations,2002,3(2):1-10
  • 8J Han,M Kamber.Data Mining:Concepts and Techniques[M].San Francisco:Morgan Kaufmann,2001
  • 9D Kifer,S Ben-David,J Gehrke.Detecting change in data streams[G].In:Proc of VLDB 2004.San Francisco:Morgan Kaufmann,2004
  • 10W Hoeffding.Probability inequalities for sums of bounded random variables[J].American Statistical Association,1963,58:13-30

共引文献39

同被引文献20

  • 1GOMES J B, MENASALVAS E, SOUSA P A. Learning recurring concepts from data streams with a context-a- ware ensemble [ C ] //Symposium on Applied Compu- ting, [ S. 1. ] :ACM, 2011 : 994 - 999.
  • 2WANG H, FAN W, YU P, et al. Mining concept-drif- ting data streams using ensemble classifiers [ C ]//Inter- national Conference on Knowledge Discovery and Data Mining. IS. 1. ] : ACM, 2003:128 -137.
  • 3CHU F, WANG Y Z, ZANIOLO C. An adaptive learn- ing approach for noisy data streams [ C ]//International Conference on Data Mining. Brighton UK: IEEE, 2004: 351 - 354.
  • 4WU Xindong, LI Peipei, HU Xuegang. Learning from concept drifting data streams with unlabeled data [ J ]. Neurocomputing, 2012,92 ( 3 ) : 145 - 155.
  • 5KATAKIS I, TSOUMAKAS G, VLAHAVAS I. Track- ing recurring contexts using ensemble classifiers: an application to email filtering [ J ]. Knowledge and Infor- mation Systems, 2010,22 ( 3 ) :371 - 391.
  • 6KOTSIANTIS S. Increasing the accuracy of incremen- tal naive bayes classifier using instance based learning [ J ]. International Journal of Control, Automation and Systems, 2013, 11 ( 1 ) : 159 - 166.
  • 7OZA N C, RUSSELL S. Online bagging and boosting [ C JJ International Workshop on Artificial Intelligence and Statistics. San Francisco: Morgan Kaufmann, 2001 : 105 - 112.
  • 8杨静,张楠男,李建,刘延明,梁美红.决策树算法的研究与应用[J].计算机技术与发展,2010,20(2):114-116. 被引量:49
  • 9李燕,张玉红,胡学钢.基于C4.5和NB混合模型的数据流分类算法[J].计算机科学,2010,37(12):138-142. 被引量:8
  • 10甄田甜,张玉红,李燕,王海平,胡学钢.一种基于UFFT的数据流分类器[J].合肥工业大学学报(自然科学版),2011,34(1):65-70. 被引量:1

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部