期刊文献+

界标窗口中数据流频繁模式挖掘算法研究 被引量:1

Research on Data Stream Frequent Pattern Mining Algorithm in Landmark Window
下载PDF
导出
摘要 数据流的流量太大会无法被整个存储,或被多次扫描。为此,在研究已有挖掘算法的基础上,提出一种界标窗口中数据流频繁模式挖掘算法DSMFP_LW。利用扩展前缀模式树存储全局临界频繁模式,实现单遍扫描数据流和数据增量更新。实验结果表明,与Lossy Counting算法相比,DSMFP_LW算法具有更好的时空效率。 For data traffic flow is too large to store the entire data stream or on its scan times and other issues,through the research of algorithms on mining frequent patterns that are proposed,this paper proposes an algorithm on mining frequent patterns over data stream based on Landmark window,named DSMFP_LW.DSMFP_LW has major features as follows: namely single streaming data scan for counting pattern's information,extended prefix-tree-based compact pattern representation,and incremental update of data.Experimental results show that DSMFP_LW algorithm has better utilization of time and space efficiency.In addition,it outperforms the well-known algorithm Lossy Counting in the same streaming environment.
出处 《计算机工程》 CAS CSCD 2012年第1期55-58,61,共5页 Computer Engineering
基金 海南省自然科学基金资助项目(610221 109002 808155) 海南师范大学青年科研基金资助项目(QN0923)
关键词 界标窗口 频繁模式 数据流 DSMFP_LW算法 滑动窗口 landmark window frequent pattern data stream DSMFP_LW algorithm sliding window
  • 相关文献

参考文献7

二级参考文献49

  • 1NIE Guo-liang LU Zheng-ding.Dynamically Computing Approximate Frequency Counts in Sliding Window over Data Stream[J].Wuhan University Journal of Natural Sciences,2006,11(1):283-288. 被引量:1
  • 2王伟平,李建中,张冬冬,郭龙江.基于滑动窗口的数据流连续J-A查询的处理方法[J].软件学报,2006,17(4):740-749. 被引量:18
  • 3刘学军,徐宏炳,董逸生,钱江波,王永利.基于滑动窗口的数据流闭合频繁模式的挖掘[J].计算机研究与发展,2006,43(10):1738-1743. 被引量:26
  • 4Pasquier N,Bastide Y,LakhalL T R.Discovering Frequent Closed Itemsets for Association Rules[C]//Proc.of the 7th International Conference on Database Theory.Jerusalem,Israel:Springer,1999.
  • 5Pei Jian,Han Jiawei,Mao Runying.CLOSET:An Efficient Algorithm for Mining Frequent Closed Itemsets[C]//Proc.of the ACM SIGMOD Int'l Workshop on Data Mining and Knowledge Discovery.New York,USA:ACM Press,2001.
  • 6Chi Yun,Wang Haixun,Yu Philip,et al.Catch the Moment:Maintaining Closed Frequent Itemsets over a Data Stream Sliding Window[Z].IBM,2004.
  • 7Han Jiawei,Wang Jianyong,Lu Ying,et al.Mining Top K Frequent Closed Patterns Without Minimum Support[C]//Proc.of Int'l Conf.on Data Mining.Maebashi,Japan:IEEE Press,2002.
  • 8Motwani R, Widom J, Arasu A, et al. Query Processing, Approximation, and Resource Management in a Data Stream Management System[C]//Proc. of the 1st Biennial Conf. on Innovative Data Systems Research. Asilomar, USA: [s. n.], 2003.
  • 9Tatbul N, Cetintemel U, Zdonik S, et al. Load Shedding in a Data Stream Manager[C]//Proc. of the 29th Int'l Conf. on Very Large Data Bases. Berlin, Germany: [s. n.], 2003.
  • 10Babcock B, Datar M, Motwani R. Load Shedding for Aggregation Queries over Data Streams[C]//Proc. of the 20th InternationalConference on Data Engineering. Boston, USA: [s. n.], 2004.

共引文献53

同被引文献15

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部