期刊文献+

TWCT-Stream:数据流上的频繁模式挖掘算法 被引量:1

TWCT-Stream:Algorithm for mining frequent patterns in data streams
下载PDF
导出
摘要 提出一种结合倾斜时间窗的TWCT树结构,可以保存不同时间粒度下频繁模式的完全集,并设计了其顺序更新和删除算法,使其能够存储在外存,从而有效地降低算法的内存空间需求。结合TWCT树结构特点,提出了数据流上的频繁模式挖掘算法TWCT-Stream,其模式生长的TWCT-Growth算法按字典顺序生成频繁模式,以配合TWCT结构的顺序更新。实验证实算法的内存需求低于FP-Stream等同类算法。 A TWCT tree structure is proposed with tilted-time window framework embedded,which can maintain the complete set of frequent patterns at multiple time granularities.And this paper designs the sequential update and delete algorithms for the structure,which makes it can be saved in auxiliary storage in order to reduce the algorithms’ requirements of the main memory effectively.Taking advantage of this characteristic,TWCT-Stream,a frequent pattern mining algorithm in data stream,is proposed.And its pattern growth algorithm,TWCT-Growth,generates frequent patterns in lexical order suitable for the sequential updating of TWCT structure.Experiments have proved its memory requirements lower than the same kind of algorithms like FP-Stream and so on.
出处 《计算机工程与应用》 CSCD 北大核心 2009年第20期147-150,161,共5页 Computer Engineering and Applications
基金 山东省自然科学基金No.Z2007G03 "泰山学者"建设工程专项经费资助~~
关键词 数据流挖掘 频繁模式 倾斜时间窗口 data stream mining frequent pattern tilted-time window
  • 相关文献

参考文献16

  • 1Han J,Cheng H,Xin D,et al.Frequent pattern mining:Current status and future directions[J].Data Mining and Knowledge Discovery, 2007,15(1 ) : 55-86.
  • 2金澈清,钱卫宁,周傲英.流数据分析与管理综述[J].软件学报,2004,15(8):1172-1181. 被引量:161
  • 3潘云鹤,王金龙,徐从富.数据流频繁模式挖掘研究进展[J].自动化学报,2006,32(4):594-602. 被引量:34
  • 4Manku G, Motwani R.Approximate frequency counts over data streams[C]//Proceedings of the 28th International Conference on Very Large Data Bases.Hong Kong, China: Morgan Kanfmann, 2002: 346-357.
  • 5Giannella C,Han J,Pei J,et al.Mining frequent patterns in data streams at muhiple time granularities[C]//Next Generation Data Mining, 2003 : 191-212.
  • 6Han J,Pei J,Yin Y.Mining frequent patterns without candidate generation[C]//Proc 2000 ACM-SIGMOD Int Conf Management of Data(SIGMOD'00), Dallas,TX,2000: 1-12.
  • 7Chen Y,Dong G,Han J,et al.Multi-dimensional regression analysis of time-series data streams[C]//Proc 2002 Int Conf Very Large Data Bases (VLDB' 02 ), Hang Kong, China, Aug 2002 : 323-334.
  • 8Hidber C.Online association rule mining[C]//Proceedings of the 1999 ACM SIGMOD International Conference on Management of Data (SIGMOD 1999).Philadelphia:ACM Press,1999: 145-156.
  • 9Chang J,Lee W.Finding recent frequent itemsets adaptively over online data streams[C]//Proceeding of the ACM International Conference on Knowledge Discovery and Data Mining,Washington, DC, 2003:487-492.
  • 10Jin R,Agrawal G.An algorithm for in-core frequent itemset mining on streaming data[C]//Fifth IEEE International Conference on DataMining(ICDM' 05 ), 2005 : 210-217.

二级参考文献53

  • 1金澈清,钱卫宁,周傲英.流数据分析与管理综述[J].软件学报,2004,15(8):1172-1181. 被引量:161
  • 2Babcock B, Babu S, Datar M, Motwani R, Widom J. Models and issues in data streams. In: Popa L, ed. Proc. of the 21st ACM SIGACT-SIGMOD-SIGART Symp. on Principles of Database Systems. Madison: ACM Press, 2002. 1~16.
  • 3Terry D, Goldberg D, Nichols D, Oki B. Continuous queries over append-only databases. SIGMOD Record, 1992,21(2):321-330.
  • 4Avnur R, Hellerstein J. Eddies: Continuously adaptive query processing. In: Chen W, Naughton JF, Bernstein PA, eds. Proc. of the 2000 ACM SIGMOD Int'l Conf. on Management of Data. Dallas: ACM Press, 2000. 261~272.
  • 5Hellerstein J, Franklin M, Chandrasekaran S, Deshpande A, Hildrum K, Madden S, Raman V, Shah MA. Adaptive query processing: Technology in evolution. IEEE Data Engineering Bulletin, 2000,23(2):7-18.
  • 6Carney D, Cetinternel U, Cherniack M, Convey C, Lee S, Seidman G, Stonebraker M, Tatbul N, Zdonik S. Monitoring streams?A new class of DBMS applications. Technical Report, CS-02-01, Providence: Department of Computer Science, Brown University, 2002.
  • 7Guha S, Mishra N, Motwani R, O'Callaghan L. Clustering data streams. In: Blum A, ed. The 41st Annual Symp. on Foundations of Computer Science, FOCS 2000. Redondo Beach: IEEE Computer Society, 2000. 359-366.
  • 8Domingos P, Hulten G. Mining high-speed data streams. In: Ramakrishnan R, Stolfo S, Pregibon D, eds. Proc. of the 6th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining. Boston: ACM Press, 2000. 71-80.
  • 9Domingos P, Hulten G, Spencer L. Mining time-changing data streams. In: Provost F, Srikant R, eds. Proc. of the 7th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining. San Francisco: ACM Press, 2001. 97~106.
  • 10Zhou A, Cai Z, Wei L, Qian W. M-Kernel merging: Towards density estimation over data streams. In: Cha SK, Yoshikawa M, eds. The 8th Int'l Conf. on Database Systems for Advanced Applications (DASFAA 2003). Kyoto: IEEE Computer Society, 2003. 285~292.

共引文献187

同被引文献13

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部