期刊文献+

数据流分析与技术研究 被引量:6

Data stream analysis and research
下载PDF
导出
摘要 数据流作为一种新的数据形态,不同于传统的静态数据,具有连续快速、短暂易逝和不可预测的特点,对其进行有效地分析和挖掘遇到了极大的挑战。介绍了数据流的基本概念、数据流模型、数据流处理模型和目前一些数据流管理系统,并对数据流技术及其挖掘算法进行归纳和分类论述。 Data stream is a new data form, which is continuous, fast, transitory and unpredictable, as compared to traditional staticdata.So it challenges to analyze and mine it effectively.The purpose of this paper is to review recent work in this field.The maincontents include data stream models, data stream processing models and current Data Stream Management System.Furthermore, datastream technology and its mining algorithm are also summarized comprehensively.
出处 《计算机工程与应用》 CSCD 北大核心 2008年第15期8-11,共4页 Computer Engineering and Applications
基金 国家自然科学基金(the National Natural Science Foundation of China under Grant No.70371007 No.70771004) 北京市自然科学基金(the Natural Science Foundation of Beijing City of China under Grant No.9052006)
关键词 数据流 DSMS 概要数据结构 滑动窗口 data stream,DSMS,synopsis data structure,sliding window
  • 相关文献

参考文献16

  • 1Muthukrishnan S.Data streams:algorithms and applications[C]//Proc of the 14th Annual ACM-SIAM Symposium on Discrete Algorithms ,2003:413-413.
  • 2金澈清,钱卫宁,周傲英.流数据分析与管理综述[J].软件学报,2004,15(8):1172-1181. 被引量:161
  • 3Henzinger M R, Raghavan P, Rajagopalan S.Computing on data streams,SRC Technical Note 1998-011[R].Palo Alto,California:Digital Systems Research Center, 1998.
  • 4Babcock B,Babu S,Datar M,et al.Models and issues in data stream systems[C]//Proceedings of the 21st ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems,Madison, USA :ACM Press,2002:1-16.
  • 5Arasu A,Babcock B.STREAM:the stanford stream data manager[J]. IEEE Data Engineering Bulletin,2003,26( 1 ) : 19-26.
  • 6Carney D,Cetintemel U.Monitoring streams:a new class of data management applications[C]//Proceedings of the 28th International Conference on Very Large Data Bases,2002:215-226.
  • 7何增有,徐晓飞,邓胜春.Squeezer:An Efficient Algorithm for Clustering Categorical Data[J].Journal of Computer Science & Technology,2002,17(5):611-624. 被引量:32
  • 8Zhang T,Ramakrishnan R.BIRCH:a efficient data clustering method for very large databases[C]//Proc of ACM SIGMOD Conference on Management of Data, Montreal, Canada 1996.
  • 9Guha S,Meyerson A,MiShra N,et al.Clustering data streams:theory and practice[J]JEEE Transactions on Knowledge and Data Engineering,2003,15(3 ) :515-528.
  • 10Aggarwal C,Han J,Wang J,et al.A framework for clustering evolving data streams[C]//Proc of Int Conf on Very Large Data Bases (VLDB'03 ), Berlin, Germany, 2003.

二级参考文献69

  • 1Babcock B, Babu S, Datar M, Motwani R, Widom J. Models and issues in data streams. In: Popa L, ed. Proc. of the 21st ACM SIGACT-SIGMOD-SIGART Symp. on Principles of Database Systems. Madison: ACM Press, 2002. 1~16.
  • 2Terry D, Goldberg D, Nichols D, Oki B. Continuous queries over append-only databases. SIGMOD Record, 1992,21(2):321-330.
  • 3Avnur R, Hellerstein J. Eddies: Continuously adaptive query processing. In: Chen W, Naughton JF, Bernstein PA, eds. Proc. of the 2000 ACM SIGMOD Int'l Conf. on Management of Data. Dallas: ACM Press, 2000. 261~272.
  • 4Hellerstein J, Franklin M, Chandrasekaran S, Deshpande A, Hildrum K, Madden S, Raman V, Shah MA. Adaptive query processing: Technology in evolution. IEEE Data Engineering Bulletin, 2000,23(2):7-18.
  • 5Carney D, Cetinternel U, Cherniack M, Convey C, Lee S, Seidman G, Stonebraker M, Tatbul N, Zdonik S. Monitoring streams?A new class of DBMS applications. Technical Report, CS-02-01, Providence: Department of Computer Science, Brown University, 2002.
  • 6Guha S, Mishra N, Motwani R, O'Callaghan L. Clustering data streams. In: Blum A, ed. The 41st Annual Symp. on Foundations of Computer Science, FOCS 2000. Redondo Beach: IEEE Computer Society, 2000. 359-366.
  • 7Domingos P, Hulten G. Mining high-speed data streams. In: Ramakrishnan R, Stolfo S, Pregibon D, eds. Proc. of the 6th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining. Boston: ACM Press, 2000. 71-80.
  • 8Domingos P, Hulten G, Spencer L. Mining time-changing data streams. In: Provost F, Srikant R, eds. Proc. of the 7th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining. San Francisco: ACM Press, 2001. 97~106.
  • 9Zhou A, Cai Z, Wei L, Qian W. M-Kernel merging: Towards density estimation over data streams. In: Cha SK, Yoshikawa M, eds. The 8th Int'l Conf. on Database Systems for Advanced Applications (DASFAA 2003). Kyoto: IEEE Computer Society, 2003. 285~292.
  • 10Gibbons PB, Matias Y. Synopsis data structures for massive data sets. In: Tarjan RE, Warnow T, eds. Proc. of the 10th Annual ACM-SIAM Symp. on Discrete Algorithms. Baltimore: ACM/SIAM, 1999. 909-910.

共引文献191

同被引文献40

  • 1施晓莺.利用WORD模板功能快速实现成果地质资料文字部分的制作[J].中国煤田地质,2005,17(B06):112-115. 被引量:2
  • 2汪小飞,赵克佳,田祖伟.数据流分析的关键技术研究[J].计算机科学,2005,32(12):91-93. 被引量:10
  • 3闫新珠,王秀芹.在VC中利用Word生成测量报告[J].地矿测绘,2006,22(1):32-33. 被引量:7
  • 4Melek WW, Lu Z, Kapps A, et al. Comparison of trend detection algorithms in the analysis of physiological time-series data. IEEE Trans. on Biomed Engineering, 2005,52(4):639 - 65 1.
  • 5Bill. Java2Word 1. 1 [ EB/OL ]. http://dev. heavenlake. com: 81/developer/viewthread? thread = 24,2008-07-29.
  • 6武森,高学东,M.巴斯蒂安[德].数据仓库与数据挖掘[M].北京:冶金工业出版社.2009.
  • 7楚红涛 寒枫 张燕 王婷.基于数据流的挖掘研究.计算机技术与应用进展,2007,:215-219.
  • 8Guha S, Mishra N, Motwani R, O'Callaghan L. Clustering data streams[C]. In:Proceedings of the 41st Annual IEEE Symposium on Foundations of Computer Science (FOCS). 2000, 359-366.
  • 9O,Callaghan L,Mishia N, Meyerson A, et al.Streaming-data algorithms forhigh-quality clustering[C]. In: Proceedings of IEEE International Conference on Data Engineering.San Jose, California, USA: 2002, 685-704.
  • 10Charu C, Aggarwal, Jiawei Han,Jianyong Wang. A framework for clusteringevolving data streams[C]. In: Proceedings of the 29th VLDB Conference.Berlin, Germany: 2003, 81-92.

引证文献6

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部