期刊文献+

流数据复杂聚类查询处理算法

Complex Data Stream Aggregation Query Processing Algorithms
下载PDF
导出
摘要 流数据上的连续查询,尤其连续聚类查询是流数据处理研究的难点。草图算法能够计算流数据上等值连接大小的高精度近似值,而直方图算法能够较精确地统计流数据的分布。本文结合这两种算法的优势,提出了一种能够高效处理流数据上复杂聚类查询的算法。理论和实验结果表明,该算法具有较高的精度和较小的空间复杂度。 Continuous queries over data streams, especially aggregation queries are one of the difficult problems in data stream processing. The algorithm based on the sketch estimates the equal join size of data stream with high precision and gives the distribution of data stream accurately. An efficient data stream aggregation query processing algorithm is proposed based on sketching and histograms techniques. The algorithm can provide approximate The theory and experimental results show that answers to a certain kind of complex aggregate queries.The theory and experimental results show that the algorithm has high precision and small space complexity.
出处 《南京航空航天大学学报》 EI CAS CSCD 北大核心 2009年第6期762-766,共5页 Journal of Nanjing University of Aeronautics & Astronautics
基金 国家高技术研究发展计划("八六三"计划)(2007AA01Z404)资助项目 江苏省科技支撑计划(BE2008135)资助项目 国家自然科学基金(60673127)资助项目
关键词 流数据 连续查询 聚类查询 data streams continuous queries aggregation query
  • 相关文献

参考文献9

  • 1Arvind A, Brian B, Shivnath B, et al. Stream:the stanford stream data manager[C]//Proc SIGMOD. New York : ACM Press, 2003 : 665.
  • 2Babcock B, Babu S, Datar M, et al. Models and issues in data stream systems [C]//Proc 21st ACM Sigact-Sigmodsigart Syrup on Principles of Database Systems. Wisconsin, New York:ACM Press, 2002 1-16.
  • 3Dobra A, Gehrke J, Garofalakis M, et al. Processing complex aggregate queries over data streams [C]//Proc of the 2002 ACM SIGMOD Intl Conf on Management of Data. New York : ACM Press, 2002 : 1-26.
  • 4Gehrke J, Korn F, Srivastava D. On computing correlated aggregates over continual data streams[C]// Proc of the 2001 ACM Sigmod Intl Conf on Management of Data. New York:ACM Press,2001:13-24.
  • 5Alon N, Gibbons P, Matias Y, et al. Tracking join and self-join sizes in limited storage[C]//Proc of the 1999 ACM Syrup on Principles of Database Systems. New York : ACM Press, 1999 : 10-20.
  • 6Alon N, Matias Y, Szegedy M. The space complexity of approximating the frequency moments [C]// Proc of the 1996. New York : ACM Press, 1996 : 61- 72.
  • 7Charikar M, Chen K, Farach-Colton M. Finding frequent items in data streams[J]. Theoretical Computer Science, 2004,312 (1) : 3-15.
  • 8Greenwald M, Khanna S. Space-efficient online computation of quantile summaries[C]//Proc of the 2001 ACM Sigmod Intl Conf on Management of Data. New York : ACM Press, 2001 : 58-66.
  • 9韩近强 杨东青 唐世渭.流数据处理中一种自适应的直方图维护算法.软件学报,2004,15:90-95.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部