期刊文献+

Dynamically Computing Approximate Frequency Counts in Sliding Window over Data Stream 被引量:1

Dynamically Computing Approximate Frequency Counts in Sliding Window over Data Stream
下载PDF
导出
摘要 This paper presents two one-pass algorithms for dynamically computing frequency counts in sliding window over a data stream-computing frequency counts exceeding user-specified threshold ε. The first algorithm constructs subwindows and deletes expired sub-windows periodically in sliding window, and each sub-window maintains a summary data structure. The first algorithm outputs at most 1/ε + 1 elements for frequency queries over the most recent N elements. The second algorithm adapts multiple levels method to deal with data stream. Once the sketch of the most recent N elements has been constructed, the second algorithm can provides the answers to the frequency queries over the most recent n ( n≤N) elements. The second algorithm outputs at most 1/ε + 2 elements. The analytical and experimental results show that our algorithms are accurate and effective. This paper presents two one-pass algorithms for dynamically computing frequency counts in sliding window over a data stream-computing frequency counts exceeding user-specified threshold ε. The first algorithm constructs subwindows and deletes expired sub-windows periodically in sliding window, and each sub-window maintains a summary data structure. The first algorithm outputs at most 1/ε + 1 elements for frequency queries over the most recent N elements. The second algorithm adapts multiple levels method to deal with data stream. Once the sketch of the most recent N elements has been constructed, the second algorithm can provides the answers to the frequency queries over the most recent n ( n≤N) elements. The second algorithm outputs at most 1/ε + 2 elements. The analytical and experimental results show that our algorithms are accurate and effective.
出处 《Wuhan University Journal of Natural Sciences》 EI CAS 2006年第1期283-288,共6页 武汉大学学报(自然科学英文版)
基金 Supported by the National Natural Science Foun-dation of China (60403027)
关键词 data stream sliding window approximation algorithms frequency counts data stream sliding window approximation algorithms frequency counts
  • 相关文献

参考文献3

  • 1Fischer M,Salzberg S.Finding a Majority amongnVotes : Solution to Problem81-5[].Journal of Algorithms.1982
  • 2Misra J,Gries D.Finding Repeated Elements[].Sci Comput Programming.1982
  • 3Karp R M,Shenker S,Papadi mitriou C H.ASi mple Algo- rithmfor Finding Frequent Elements in Streams and Bags[].ACMTrans on Database Systems.2003

同被引文献12

  • 1Manku G S, Motwani R. Approximate frequency counts over data stream[C:]. In: Proceedings of the 28th International Conference on Very Large Data Bases, Hong Kong, China: Morgan Kanfmann, 2002, 346-357.
  • 2Giannella C, Han J, Pei J, et al. Mining frequent patterns in data streams at multiple time granularities[J]. Next Generation Data Mining, 2003, 191-212.
  • 3Chedy Raissi, Pascal Poncelet, Maguelonne Teisseire. Towards a new approach for mining frequent itemsets on data stream[J]. Journal of Intelligent Information Systems, 2006, 28 (1) : 23- 36.
  • 4Ahmed Metwally, Divyakant Agrawal, Amr EI Abbadi. An integrated efficient solution for computing frequent and top-k elements in data streams[J]. ACM Transactions on Database Systems, 2006, 31 (3) : 1095-1133.
  • 5Barouni-Ebarhimi M, Ali A Ghorbani. A novel approach for frequent phrase mining in web search engine query streams[C]. In: The Fifth Annual Conference on Communication Networks and Services Research (CNSR' 07), 2007, 125-132.
  • 6Han J, Pei J, Yin Y. Mining frequent patterns without candidate generation[C]. In: Proceedings of the 2000 ACM SIG- MOD International Conference on Management of Data, Dallas, USA: ACM Press, 2000, 1-12.
  • 7Arasu A, Manku G S. Approximate counts and quantiles over sliding windows[C]. In: Proceedings of the 23rd ACM SIG- MOD- SIGACT-SIGART Symposium on Principles of Database Systems, Paris, France: ACM Press, 2004, 286-296.
  • 8Joong Hyuk Chang. estWin: online data stream mining of recent frequent itemsets by sliding window method[J]. Journal of Information Science, 2005,31 (2) : 76-90.
  • 9Yishan, Jiao. Maintaining stream statistis over multiscale sliding windows [J], ACM Transactions on Database Systems, 2006, 31(4):1305-1334.
  • 10Yun Chi, Haixun Wang, Philip S Yu, et al. Catch the moment: maintaining closed frequent itemsets over a data stream sliding window[J]. Knowledge and Information Systems, 2006, 10 (3):265-294.

引证文献1

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部