期刊文献+

GTK:A Hybrid-Search Algorithm of Top-Rank-k Frequent Patterns Based on Greedy Strategy 被引量:1

下载PDF
导出
摘要 Currently,the top-rank-k has been widely applied to mine frequent patterns with a rank not exceeding k.In the existing algorithms,although a level-wise-search could fully mine the target patterns,it usually leads to the delay of high rank patterns generation,resulting in the slow growth of the support threshold and the mining efficiency.Aiming at this problem,a greedy-strategy-based top-rank-k frequent patterns hybrid mining algorithm(GTK)is proposed in this paper.In this algorithm,top-rank-k patterns are stored in a static doubly linked list called RSL,and the patterns are divided into short patterns and long patterns.The short patterns generated by a rank-first-search always joins the two patterns of the highest rank in RSL that have not yet been joined.On the basis of the short patterns satisfying specific conditions,the long patterns are extracted through level-wise-search.To reduce redundancy,GTK improves the generation method of subsume index and designs the new pruning strategies of candidates.This algorithm also takes the use of reasonable pruning strategies to reduce the amount of computation to improve the computational speed.Real datasets and synthetic datasets are adopted in experiments to evaluate the proposed algorithm.The experimental results show the obvious advantages in both time efficiency and space efficiency of GTK.
出处 《Computers, Materials & Continua》 SCIE EI 2020年第6期1445-1469,共25页 计算机、材料和连续体(英文)
基金 This research was supported in part by the Hunan Province’s Strategic and Emerging Industrial Projects under Grant 2018GK4035 in part by the Hunan Province’s Changsha Zhuzhou Xiangtan National Independent Innovation Demonstration Zone projects under Grant 2017XK2058 in part by the National Natural Science Foundation of China under Grant 61602171 in part by the Scientific Research Fund of Hunan Provincial Education Department under Grant 17C0960 and 18B037.
  • 相关文献

参考文献2

二级参考文献32

  • 1HaHan J W, Pei J, Yin Y W. Mining frequent itemsets without candidate generation. In: The 2000 ACM SIGMOD International Conference on Management of data (SIGMOD’00), New York, 2000. 1-12.
  • 2AgAgrawal R, Imielinski T, Swami A. Mining association rules between sets of items in large databases. In: The 1993 ACM SIGMOD International Conference on Management of Data (SIGMOD’93), Washington, 1993. 207-216.
  • 3HaHan J, Cheng H, Xin D, et al. Frequent itemset mining: current status and future directions. Data Min Knowl Discov,2007, 15: 55-86.
  • 4BaBaralis E, Cerquitelli T, Chiusano S. IMine: index support for item set mining. IEEE TKDE J, 2009, 21: 493-506.
  • 5ZaZaki M J, Gouda K. Fast vertical mining using diffsets, In: The 9th ACM SIGKDD International Conference on. Knowledge Discovery and Data Mining (SIGKDD’03), Washington, 2003. 326-335.
  • 6DeDeng Z H, Wang Z H. A new fast vertical method for mining frequent itemsets. Int J Comput Intell Syst, 2010, 3:733-744.
  • 7AgAgrawal R, Srikant R. Fast algorithm for mining Association rules. In: The 20th International Conference on Very Large Data Bases (VLDB’94), Santiago de Chile, 1994. 487-499.
  • 8SaSavasere A, Omiecinski E, Navathe S. An efficient algorithm for mining association rules in large databases. In: The21th International Conference on Very Large Data Bases (VLDB’95), Zurich, 1995. 432-443.
  • 9ShShenoy P, Haritsa J R, Sundarshan S, et al. Turbo-charging vertical mining of large databases. In: ACM International Conference on Management of Data and Symposium on Principles of Database Systems (SIGMOD’00), Dallas, 2000.22-33.
  • 10ZZaki M J. Scalable algorithms for association mining. IEEE TKDE J, 2000, 12: 372-390.

共引文献24

同被引文献1

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部