期刊文献+

流数据挖掘综述 被引量:36

An Overview of Stream Data Mining
下载PDF
导出
摘要 作为一种新的数据形态,流数据对数据挖掘提出了诸多挑战。学者们已提出大量处理流数据的挖掘算法。本文对这些算法进行了综述。首先介绍了多个不同的数据流模型,这些模型对算法设计有着不同的要求。然后,总结了流数据挖掘算法的特点,并给出了算法中常用的技术。最后,分析了各个流数据挖掘任务中的代表性算法。 Data streams pose great challenges to data mining. Many stream data mining algorithms have been proposed. In this paper, we give an overview of these algorithms. Firstly, the data stream models are introduced. Then the characters of stream data mining algorithms are summarized and several techniques that are used in these algorithms are introduced. At last, the representative algorithms of every mining task are analyzed.
出处 《计算机科学》 CSCD 北大核心 2007年第1期1-5,11,共6页 Computer Science
基金 湖北省自然科学基金项目"时空数据库的关键技术研究与实验"(ABA048)的资助
关键词 数据流 数据挖掘 时空复杂度 滑动窗口 Data stream, Data mining, Time-space complexity, Sliding window
  • 相关文献

参考文献51

  • 1Henzinger M R,Raghavan P,Rajagopalan S.Computing on data streams.SRC Technical Note 1998-011.Digital systems research center:Palo Alto,California,1998
  • 2Papadimitriou S,Sun J,Faloutsos C.Streaming Pattern Discovery in Multiple Time-Series.In:Proc of the 31st VLDB Conf,2005.697~708
  • 3Babcock B,et al.Models and issues in data stream systems.In:Proc of 21st ACM Symposium on Principles of Database Systems(PODS 2002),2002.1~16
  • 4Chandrasekaran S,et al.TelegraphCQ:Continuous Dataflow Processing for an Uncertain World.Proc of The Conf on Innovative Data Systems Research (CIDR),2003
  • 5Abadi D J,et al.Aurora:A New Model and Architecture for Data Stream Management.The Intl Journal on Very Large Data Bases,2003,12(2):120~139
  • 6Cai Y D,et al.MAIDS:Mining Alarming Incidents from Data Streams.In:Proc of the 2004 ACM SIGMOD Intl Conf on Management of data,2004.919~920
  • 7O'Callaghan L.Approximation algorithms for clustering streams and large data sets:[Ph D Thesis].The Department of Computer Science,Stanford University,2003
  • 8Muthukrishnan S.Data streams:Algorithms and applications.In:Proc of the fourteenth annual ACM-SIAM symposium on discrete algorithms,2003.413~413
  • 9金澈清,钱卫宁,周傲英.流数据分析与管理综述[J].软件学报,2004,15(8):1172-1181. 被引量:161
  • 10Manku G S,Motwani R.Approximate frequency counts over data streams.In:Proc of the 28th VLDB Conf,2002.346~357

二级参考文献52

  • 1Babcock B, Babu S, Datar M, Motwani R, Widom J. Models and issues in data streams. In: Popa L, ed. Proc. of the 21st ACM SIGACT-SIGMOD-SIGART Symp. on Principles of Database Systems. Madison: ACM Press, 2002. 1~16.
  • 2Terry D, Goldberg D, Nichols D, Oki B. Continuous queries over append-only databases. SIGMOD Record, 1992,21(2):321-330.
  • 3Avnur R, Hellerstein J. Eddies: Continuously adaptive query processing. In: Chen W, Naughton JF, Bernstein PA, eds. Proc. of the 2000 ACM SIGMOD Int'l Conf. on Management of Data. Dallas: ACM Press, 2000. 261~272.
  • 4Hellerstein J, Franklin M, Chandrasekaran S, Deshpande A, Hildrum K, Madden S, Raman V, Shah MA. Adaptive query processing: Technology in evolution. IEEE Data Engineering Bulletin, 2000,23(2):7-18.
  • 5Carney D, Cetinternel U, Cherniack M, Convey C, Lee S, Seidman G, Stonebraker M, Tatbul N, Zdonik S. Monitoring streams?A new class of DBMS applications. Technical Report, CS-02-01, Providence: Department of Computer Science, Brown University, 2002.
  • 6Guha S, Mishra N, Motwani R, O'Callaghan L. Clustering data streams. In: Blum A, ed. The 41st Annual Symp. on Foundations of Computer Science, FOCS 2000. Redondo Beach: IEEE Computer Society, 2000. 359-366.
  • 7Domingos P, Hulten G. Mining high-speed data streams. In: Ramakrishnan R, Stolfo S, Pregibon D, eds. Proc. of the 6th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining. Boston: ACM Press, 2000. 71-80.
  • 8Domingos P, Hulten G, Spencer L. Mining time-changing data streams. In: Provost F, Srikant R, eds. Proc. of the 7th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining. San Francisco: ACM Press, 2001. 97~106.
  • 9Zhou A, Cai Z, Wei L, Qian W. M-Kernel merging: Towards density estimation over data streams. In: Cha SK, Yoshikawa M, eds. The 8th Int'l Conf. on Database Systems for Advanced Applications (DASFAA 2003). Kyoto: IEEE Computer Society, 2003. 285~292.
  • 10Gibbons PB, Matias Y. Synopsis data structures for massive data sets. In: Tarjan RE, Warnow T, eds. Proc. of the 10th Annual ACM-SIAM Symp. on Discrete Algorithms. Baltimore: ACM/SIAM, 1999. 909-910.

共引文献160

同被引文献256

引证文献36

二级引证文献100

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部