期刊文献+

一种基于位运算的频繁闭项集挖掘算法 被引量:3

Algorithm based on bit operation for mining frequent closed itemsets
下载PDF
导出
摘要 针对相关算法在挖掘频繁闭项集时所存在的问题,提出了一种基于位运算的频繁闭项集挖掘算法。该算法首先将数据集转换成布尔矩阵,只需扫描数据集一次;通过位运算计算支持度,利用矩阵和数组存储辅助信息,减少时间和空间消耗;深度优先搜索产生频繁闭项集时利用剪枝策略进一步减少挖掘时间;利用同生项集性质进行闭合性检测,无须检查超集或子集。理论分析和实验结果验证了该算法的有效性。 Aiming at the problems of mining frequent closed itemsets, this paper proposed an algorithm based on bit operation for mining frequent closed itemsets (MFCIS). Firstly, the algorithm used the vector to express items in database and scaned the database for only one time. Secondly it computed the support of itemsets through the bit operation and used the matrice and the array to store the ancillary information to reduce the time and memeory, and used pruning technology to improve the mining efficiency during creating the frequent closed itemsets by depth-first search. Finally, it used the nature of syngenetic itemsets to test frequent closed itemsets so as not to test superset or subset. Theoretical analysis and experimental results show that the algorithm is efficient.
出处 《计算机应用研究》 CSCD 北大核心 2013年第11期3280-3282,3286,共4页 Application Research of Computers
基金 四川省科技厅资助项目(2011JY0141) 四川省教育厅资助项目(12ZB171)
关键词 数据挖掘 频繁闭项集 矩阵 位运算 同生项集 data mining frequent closed itemsets matrix bit operation syngenetic itemsets
  • 相关文献

参考文献9

  • 1PASQUIER N, BASTIDE Y, TAOUIL R, et al. Discovering frequent closed itemsets for association rules[ C ]//Proc of the 7 th International Conference on Database Theory. [ S. 1. ] :Springer,1999:398-416.
  • 2PEI Jian, HAN Jia-wei, MAO Run-ying. CLOSET: an efficient algo- rithm for mining frequent closed itemsets[ C ]//Proc of ACM SIGMOD International Workshop on Data Mining and Knowledge Discovery.2000 : 11 - 20.
  • 3WANG Jian-yong, HAN Jia-wei, PEI Jian. CLOSET + :searching for the best strategies for mining frequent closed itemsets [ C ]//Proc of the 9th ACM SIGKDD Intemational Conference on Knowledge Disco- very and Data Mining. 2003:236-245.
  • 4ZAKI M J, HSIAO C J. CHARM:an efficient algorithm for closed itemset mining[ C ]//Proc of the 2nd SIAM International Conference on Data Mining. 2002:457-473.
  • 5ZHANG Tie-jun, YANG Jun-rni, WANG Xiu-qin. An algorithm for mining frequent closed itemsets [ C ]//Proc of the 3rd International Conference on Intelligent System and Knowledge Engineering. 2008 : 240- 245.
  • 6QIU Yong, LAN Yong-jie. Mining frequent closed itemsets with one database scanning[ C ]//Proc of the 5th International Conference on Machine Learning and Cybernetics. 2006 : 1326-1331.
  • 7牛小飞,石冰,卢军,吴科.挖掘关联规则的高效ABM算法[J].计算机工程,2004,30(11):118-120. 被引量:16
  • 8徐嘉莉,陈佳,胡庆,黄波,郭红霞.基于向量的数据流滑动窗口中最大频繁项集挖掘[J].计算机应用研究,2012,29(3):837-840. 被引量:7
  • 9吴春旭,陈家耀,刘博文.一种挖掘频繁闭项集的改进算法[J].计算机系统应用,2008,17(10):32-35. 被引量:1

二级参考文献21

  • 1牛小飞,石冰,卢军,吴科.挖掘关联规则的高效ABM算法[J].计算机工程,2004,30(11):118-120. 被引量:16
  • 2[1]Agrawal R, Srikant R. Fast Algorithms for Mining Association Rules. In Proceeding of the 20th International Conference on Very Large Data Bases, 1994-09: 487-499
  • 3[2]Park J S, Chen Mingsyan, Yu P S. An Effective Hash-based Algorithm for Mining Association Rules. In Proceedings of ACM SIGMOD, 1995 24(2):175-186
  • 4[3]Han Jiawei, Kamber K. Data Mining Concepts and Techniques. 2000- 08
  • 5[1]Pasquier N,Bastide Y,Taouil R,Lakhal L.Discovering frequent closed itemsets for association rules.In:Proceeding of the 7th International Conference on Database Theory (ICDT99).Jerusalem,1999.398-416.
  • 6[2]Pei J,Han J.CLOSET:An efficient algorithm for mining frequent closed itemsets.In:Proceeding of the 2000 ACM-SIGMOD Interna-tional Workshop Data Mining and Knowledge Discovery (DMKD00).Dallas,2000.11 -20.
  • 7[3]Zaki,Hsiao.CHARM:An efficient algorithm for closed itemset mining.In:Proceeding of the 2002 SIAM International Conference on Data Mining (SDM 02).Arlington,2002.457 -473.
  • 8[4]J Wang,J Han,J Pei.CLOSET +:Searching for the best strategies for mining frequent closed Itemsets.In:Proceeding of the 2003 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD03).Washington,2003.236-245.
  • 9[5]Wang J,Han J,Lu Y,Tzvetkov P.TFP:An Efficient Algorithm for Mining Top-K Frequent Closed Itemsets.IEEE Trans Knowl Data Eng,2005,17 (5):652-664.
  • 10BABCOCK B,BABU S,DATAR M, et al. Models and issues in data stream systems [ C ]//Proc of the 21 st ACM SIGMOD-SIGART Sympo- sium on Principles of Database System. New York:ACM Press,2002: 1-16.

共引文献21

同被引文献28

引证文献3

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部