期刊文献+

基于极大团和FP-Tree的挖掘关联规则的改进算法 被引量:30

An Improved Algorithm Based on Maximum Clique and FP-Tree for Mining Association Rules
下载PDF
导出
摘要 融合了关联规则挖掘的FP-Tree算法和图论的极大团理论的优势,做了以下主要工作:(1) 提出了用邻接矩阵的产生频繁2-项集的改进方法;(2) 提出了极大有序频繁集的概念,证明了Head关系的等价性、划分定理、局部复杂性定理和归并收敛值域定理;(3) 提出并实现了基于极大团划分的MaxCFPTree算法,扫描时间复杂性小于O(n2);(4) 做了相关实验,以验证算法的正确性.新方法缓解了项目数量巨大而内存不足的矛盾,提高了系统效率和伸缩性. This paper integrates the advantage of the FP-Tree algorithm for mining association rules and the maximum clique theory of graph. The main contributions include: (1) An improved method to mine frequent 2-itemset by adjacency matrix is proposed. (2) The concept of maximum ordered frequent itemset is proposed, and the equivalence of Head Relation is proved as along with the theorems about Local Complexity and Merge Convergence Range. (3) The MaxCFPTree algorithm based on Maximum-clique partition is proposed and implemented with complexity O(n2). (4) The algorithms are validated by extensive experiments. The conflict between memory and huge number of items is resolved, and the system efficiency and scalability are improved.
出处 《软件学报》 EI CSCD 北大核心 2004年第8期1198-1207,共10页 Journal of Software
基金 国家自然科学基金 国家教育部博士点专项基金 广西自然科学基金~~
关键词 关联规则 FP—Tree 极大团 邻接矩阵 归并收敛 association rules FP-tree maximum clique adjacency matrix merge convergence
  • 相关文献

参考文献9

  • 1Agrawa lR, Imielinski T, Swami A. Mining association rules between sets of items in large databases (C). In: Buneman P, Jajodia S,eds. Proc. of the ACM SIGMOD Conf. on Management of Data (SIGMOD'93). New York: ACM Press, 1993. 207~216.
  • 2Agrawa lR, Srikant R. Fast algorithms for mining association rules in large databases. In: Bocca JB, Jarke M, Zaniolo C, eds. Proc. of the 20th Int'l Conf. on Very Large Data Bases. Santiago: Morgan Kaufmann, 1994. 478~499.
  • 3Aly HH, Taha Y, Amr AA. Fast mining of association rules in large-scale problems. In: Abdel-Wahab H, Jeffay K, eds. Proc. of the 6th IEEE Symp. on Computers and Communications (ISCC 2001). New York: IEEE Computer Society Press, 2001. 107~113.
  • 4Tsai CF, Lin YC, Chen CP. A new fast algorithms for mining association rules in large databases. In: Kamel AE, Mellouli K, Borne P, eds. Proc. Of the 2002 IEEE Int'l Conf. On Systems, Man and Cybernetics (SMC 2002). IEEE Computer Society Press, 2002. 251
  • 5Han J, Pei J, Yin Y. Mining frequent patterns without candidate generation. In: Chen WD, Naughton J, Bernstein PA, eds. Proc. of the 2000 ACM SIGMOD Int'l Conf. on Management of Data (SIGMOD 2000). New York: ACM Press, 2000. 1~12.
  • 6Han JW, Kember M. Data Mining. Concepts and Techniques. 2nd ed. Beijing: Higher Education Press, 2001. 240-243.
  • 7Zaki MJ. Scalable algorithms for association mining. IEEE Trans. on Knowledge and Data Engineering, 2000,12(3):372-390.
  • 8王树和.图论及其算法[M].合肥:中国科学技术大学出版社,1990.246-250.
  • 9孙淑玲.代数结构[M].合肥:中国科技大学出版社,1990..

共引文献1

同被引文献237

引证文献30

二级引证文献157

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部