期刊文献+

滑动窗口中近期数据流频繁项集挖掘 被引量:3

Frequent itmsets ming of recent data in sliding window
下载PDF
导出
摘要 基于嵌套滑动窗口和遗传算法的NSWGA(nested sliding window genetic algorithm)算法对快速挖掘数据流频繁项集进行了研究。NSWGA算法利用遗传算法的并行性来搜索嵌套子窗口内最新数据的频繁项集,合并形成滑动窗口内待选频繁项集,然后扫描获得滑动窗口内的近期频繁项集。NSWGA算法及时准确捕获数据流上最新频繁项集,周期性地删除过期的流数据,并通过嵌套窗口以及遗传算法的并行处理,降低了计算的时间复杂度。 A new approach-NSWGA(nested sliding window genetic algorithm) for mining frequent itemsets over data stream based on nested sliding window and genetic algorithm is proposed.NSWGA uses the parallelism of genetic algorithm to search for the frequent itemsets of the latest data in the nested sub-window.This series of frequent itemsets are merged together.The final frequent itemsets of the sliding window is obtained by scanning the merged itemsets.NSWGA captures the latest frequent itemsets accurately and timely over data stream.At the same time the expired data is deleted periodically.As the use of nested windows and the parallel processing capability of genetic algorithm,this method also reduced the time complexity.
出处 《计算机工程与设计》 CSCD 北大核心 2011年第4期1307-1310,1346,共5页 Computer Engineering and Design
基金 国家自然科学基金项目(60703101)
关键词 数据流 频繁项集 遗传算法 嵌套滑动窗口 并行计算 data stream frequent itemsets genetic algorithm nested sliding window parallel computation
  • 相关文献

参考文献8

  • 1Carson Kai-Sang Leung,Quamrul I Khan.DSTree:A tree struc-ture for the mining of frequent sets from data streams[C].Hong-Kong:Proceedings of the Sixth International Conference on DataMining,2006:928-932.
  • 2Tzung Pei Hong,Chun Wei Lin,Yu Lung Wu.Maintenance of fastupdated frequent pattern trees for record deletion[J].Computa-tional Statistics and Data Analysis,2009,53(7):2485-2499.
  • 3谈克林,孙志挥.一种FP树的并行挖掘算法[J].计算机工程与应用,2006,42(13):155-157. 被引量:10
  • 4冯志新,钟诚.基于FP-tree的最大频繁模式挖掘算法[J].计算机工程,2004,30(11):123-124. 被引量:18
  • 5Joong Hyuk Chang, Won Suk Lee. Finding recent frequentitemsets adaptively over online data streams[C].Washington:Proceedings of the ninth ACM SIGKDD International Con-ference on Knowledge Discovery and Data Mining, 2003:487-492.
  • 6李国徽,陈辉.挖掘数据流任意滑动时间窗口内频繁模式[J].软件学报,2008,19(10):2585-2596. 被引量:45
  • 7Syed Khakuzzaman Tanbeer, Chowdhury Farhan Ahmed,Byeong-Soo Jeong.Sliding window-based frequent pattern mining overdata streams[J].lnformation Sciences:An International Journal,2009,179(22) :3843-3865.
  • 8Joong Hyuk Chang,Won Suk Lee.A sliding window method forfinding recently frequent itemsets over online data streams[J].Journal of Information Science and Engineering, 2004,20 (4):753-762.

二级参考文献21

  • 1[1]Han J, Kambe M. Data Mining: Concepts and Techniques. Morgaan Kaufmann Publishers, San Francisco, CA, 2001
  • 2[2]Agrawal R, Srikant R. Fast Algorithms for Mining Association Rules. Proc. 20th Int'l Conf. Very Large Databases, Santiago, Chile, 1994-09: 497-499
  • 3[3]Han J, Pei J, Yin Y. Mining Frequent Patterns Without Candidate Generation. In Proc. 2000 ACM-SIGMOD Int. Conf. Management of Data (SIGMOD'00), Dalas, TX, 2000-05: 1-12
  • 4[4]Pei J. Pattern Growth Methods for Frequent Pattern Mining [doctor thesis]. Simon Fraser University, 2002-06-13
  • 5[5]Burdick D, Calimlim M, Gehrke J. MAFIA: A Maximal Frequent Itemset Algorithm for Transactional Databases. In Int'l Conf. on Data Engineering, 2001-04
  • 6Jiawei Han.DATA MINING Concepts and Techniques[M].Morgan Kaufmann Publishers.2002-04
  • 7Agrawal Srikant R.Fast Algorithms for Mining Associations Rules[C].In:Proc 1994 Int Conf Very Large Data Bases(VLDB'94),Santiago,1994:487~499
  • 8Gaber MM, Zaslavsky A, Krishnaswamy S. Mining data streams: A review. ACM SIGMOD Record, 2005,34(2): 18-26.
  • 9Jiang N, Gruenwald L. Research issues in data stream association rule mining. ACM SIGMOD Record, 2006,35(1):14-19.
  • 10Garofalakis MN, Gehrke J. Querying and mining data streams: You only get one look a tutorial. In: Franklin MJ, Moon B, Ailamaki A, eds. Proc. of the 2002 ACM SIGMOD Int'l Conf. on Management of Data. Madison: ACM Press, 2002. 635-635.

共引文献70

同被引文献14

引证文献3

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部