MLFI:新的最大长度频繁项集挖掘方法被引量：1

MLFI:New method for maximum length frequent itemsets mining

下载PDF

导出

摘要在理解现有的最大长度频繁项集挖掘问题的定义,探索最大长度频繁项集的几个具体应用后,提出了一种新的基于FP-tree(Frequent Pattern tree)结构的最大长度频繁项集挖掘方法——MLFI算法。该算法仅对初始的FP-tree实现遍历操作,从而完成对最大长度频繁项集的挖掘。在算法整个执行过程中,仅用到了一棵初始的FP-tree。理论分析和实验证明,该算法加快了挖掘速度,提高了挖掘效率。 After the current definition of the maximum length frequent itemsets mining problem is understood and its many practical applications are explored,an FP-tree-based algorithm is proposed for the mining problem.Maximum length frequent itemsets are mined while traversing the FP-tree in the algorithm.There is only an initial FP-tree.Theoretic analysis and experiments show that the algorithm accelerates the speed to traverse the tree and improves the mining efficiency.

作者张忠平郭静韩丽霞

机构地区燕山大学信息科学与工程学院

出处《计算机工程与应用》 CSCD 北大核心 2010年第16期140-142,共3页 Computer Engineering and Applications

基金国家自然科学基金(No.60773100) 河北省教育厅科研计划项目(No.2006143)~~

关键词数据挖掘频繁项集最大长度频繁项集频繁模式树 data mining frequent itemsets maximum length frequent itemsets FP-tree

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献7

1Hipp J,Guntzer U,Nakaeizadeh G.Algorithrns for association rule mining-A genera] survey and comparison[C] //ACM SIGKDD International Conference on Management of Data,2000:58-64.
2Pei J,Han J,Pinto H,et al.PrefixSpan:Mining sequential patterns efficiently by prefix-projocted pattern growth[C] //IEEE International Conference on Data Engineering,2001:215-224.
3Xiong I I,Tan P N,Kumar V.Mining strong affinity association patterns in data sets with skewed support distribution[C] //IEEE International Conference on Data Mining,2003:387-394.
4Hu Tian-ming,Fu Qian,Wang Xiao-nan,et al.Mining maximum length frequent itemsets:A summary of results[C] //The 18th IEEE International Conference on Tools with Artificial Intelligence,2006:505-512.
5Hart J,Pei J,Yin Y.Mining frequent patterns without candidate generation[C] //Proc ACM SIGMOD,2000,29(2):1-12.
6王静红,刘教民,郭盛,孙亚非.一种新型快速建立频繁模式树的方法[J].计算机应用,2008,28(3):735-737. 被引量：2
7UCI machine learning repository[EB/OL].Univ of CA,Irvine.http://archive.ies.uei.edu/ml/.

二级参考文献10

1AGRAWAL R, SRIKANT R. Fast algorithms for mining association rules[ C]//Proceedings of the 20th International Conference on Very Large Data Bases. San Francisco, CA: Morgan Kaufmann Publishers, 1994:487 -499.
2AGRAWAL R, SRIKANT R. Mining sequential patterns[ C]//Proceedings of the llth International Conference on Data Engineering. Los Alamitos, CA: IEEE Computer Society Press, 1995:3 - 14.
3SRIKANT R, AGRAWAL R. Mining sequential patterns: Generalizations and performance improvements[ C]// Proceedings of the 5th International Conference on Extending Database Technology. London: Springer-Verlag, 1996:3 - 17.
4MASSEGLIA F, CATHALA F, PONCELET P. The PSP approach for mining sequential patterns[ C]//Proceedings of the 2nd European Symposium on Principles of Data Mining and Knowledge Discovery. Nantes, France: [s. n. ], 1998:176 - 184.
5HAN J, PEI J, MORTAZAVI- ASL B, et al. FreeSpan: Frequent pattern-projected sequential pattern mining[ C]//Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: ACM Press, 2000:355 -359.
6PEI J, HAN J, PINTO H, et al. PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth[ C] // Proceedings of the 17th International Conference on Data Engineering. Los Alamitos, CA: IEEE Computer Society Press, 2001:215 -224.
7HAN J, PEN J, YIN Y. Mining frequent patterns without candidate generation[ C]// Proceedings of 2000 ACM-SIMOD. New York: ACM Press, 2000:1 - 12.
8GRAHNE G, ZHU J F. High performance mining of maximal frequent itemsets[ C]// Proceedings of the 6th SIAM International Workshop on High Performance Data Mining. [S. l.]: SIAM, 2003:135 - 143.
9PASQUIER N, BASTIDE Y, TAOUIL R, et al. Discovering frequent closed itemsets for association rules[ C]// Proceedings of International Conference on Database Theory, LNCS 1540. London, UK: Springer, 1999:398-416.
10IBM Almaden Research Center . Quest synthetic data generation [ EB/OL]. [ 2007 - 10 - 01]. http://www. almaden. ibm. com/ software/quest/Resources/datasets/syndata.htnl.

共引文献1

1刘敏娴,马强.基于混合型的Web实时推荐模型研究[J].计算机工程与设计,2011,32(10):3518-3521. 被引量：3

同被引文献4

1Hu Tianming, Fu Qian, Wang Xiaonan, et al.Mining maxi- mum length frequent itemsets:a summary of results[C]// The 18th IEEE International Conference on Tools with Artificial Intelligence, 2006: 505-512.
2Pei J, Han J, Pinto H, et al.PrefixSpan:mining sequential patterns efficiently by prefix-projected pattern growth[C]// IEEE International Conference on Data Engineering,2001: 215-224.
3UCI machine learning repository[EB/OL].Univ of CA, Ir- vine.http://archive.ics.uci.edu/ml/.
4陈晨,鞠时光.基于改进FP-tree的最大频繁项集挖掘算法[J].计算机工程与设计,2008,29(24):6236-6239. 被引量：14

引证文献1

1廖福蓉,王成良.基于有序FP-tree的最大长度频繁项集挖掘算法[J].计算机工程与应用,2012,48(30):147-150. 被引量：4

二级引证文献4

1田庆,朱俊岭,刘永梅.FP-Growth算法在电子商务中的应用[J].科技与企业,2014(14):148-149.
2彭卫平,蒋瑞,雷金,陈磊,张秋华,胡向阳,窦俊豪.面向DFMC的广义模块间包含性关系分析[J].中南大学学报（自然科学版）,2018,49(6):1414-1423.
3邱云飞,赵彬,林明明,王伟.结合语义改进的K-means短文本聚类算法[J].计算机工程与应用,2016,52(19):78-83. 被引量：14
4王利军,唐立.基于有序FP-tree结构和二维表的最大频繁模式挖掘算法[J].韶关学院学报,2019,40(9):21-25.

1廖福蓉,王成良.基于有序FP-tree的最大长度频繁项集挖掘算法[J].计算机工程与应用,2012,48(30):147-150. 被引量：4
2战立强,刘大昕.频繁项集快速挖掘算法研究[J].哈尔滨工程大学学报,2008,29(3):266-271. 被引量：11
3李希春.二叉树的一种新存储结构[J].计算机学报,1996,19(7):554-557. 被引量：5
4刘胤杰,周家超.树型数据结构的探讨[J].江南学院学报,1999,14(4):41-44. 被引量：5
5张宁.基于FP-tree的Apriori算法的改进[J].信息通信,2015,28(2):94-95. 被引量：4
6邱京伟.粒关联规则挖掘的一种改进算法[J].漳州师范学院学报（自然科学版）,2013,26(2):18-22. 被引量：2
7董旭初,欧阳丹彤,刘大有.Bayesian网推理中的化简方法[J].吉林大学学报（理学版）,2004,42(1):77-83. 被引量：9
8张丽丽,杜鹃,贾亮.改进的支持向量机SMO算法说话人识别系统研究[J].长春理工大学学报（自然科学版）,2009,32(2):279-281. 被引量：1
9白川平.多数据流的频繁模式挖掘算法研究[J].宁夏师范学院学报,2014,35(3):86-89. 被引量：1
10敬茂华,李国瑞,史闻博,才书训.一种改进的从NFA到DFA的转换算法[J].东北大学学报（自然科学版）,2012,33(4):482-485. 被引量：1

计算机工程与应用

2010年第16期

浏览历史

内容加载中请稍等...

MLFI:新的最大长度频繁项集挖掘方法被引量：1

参考文献7

二级参考文献10

共引文献1

同被引文献4

引证文献1

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

MLFI:新的最大长度频繁项集挖掘方法 被引量：1

参考文献7

二级参考文献10

共引文献1

同被引文献4

引证文献1

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

MLFI:新的最大长度频繁项集挖掘方法被引量：1