基于项头表节点的Fp-growth改进算法

An improved FP-growth algorithm based on item head table node

下载PDF

导出

摘要关联规则中的Fp-growth算法是不产生候选集的代表,将原算法FP-tree和项头表的Node_link字段删除,把Ln当作项头表。对任意频繁项ai,首先找到所有FP-tree节点的item-name与ai的项名相同的节点,对每个树节点寻找它的频繁模式,找到频繁项ai的所有频繁模式可节省1/5树的空间,把Ln当作项头表,省去项头表的空间,从而提高算法效率。实验结果表明,改进后的算法性能优于原算法性能。 In the association rule＇ s mining, FP-growth algorithm is one of the most high effective algorithm, and it doesn＇ t produce the candidate sets. It deletes the original FP-tree algorithm and the node_link field in the item header table, makes Ln as the item header table. For any frequent item ai, at first, it finds all the FP-tree nodes whose item-name has the same item name with ai, looking for the frequent pattern for every node, it can save 1/5 tree space when we found the frequent pattern for every frequent item, makes Ln as the item header table, saves head table space in order to improve the efficiency of the algorithm. The experimental results show that the improved algorithm is better than the original algorithm in performance.

作者陈君葛莉

机构地区渭南师范学院数学与信息科学学院

出处《信息技术》 2012年第12期34-35,40,共3页 Information Technology

基金国家统计局课题项目(2011LY092) 渭南师范学院科研计划项目(12YKZ044)

关键词数据挖掘频繁项目集关联规则 FP-TREE data mining frequent item sets association rules FP-tree

分类号 TP301.6 [自动化与计算机技术—计算机系统结构] TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献7

1Hart J, Pei J, Yin Y. Mining Frequent Patterns Without Candidate Generation[ C]. Proc of the 2000 ACM SIGMOD Int' 1 Conf on Managementof Data,2000 : 1 - 12.
2Pei Jian, Hart Jiawei. Mining sequential patterns by pattern - growth : the pmfixSpan approach [ J ]. IEEE Transactions on Knowledge and Data Engineering,2004,6 (10) : 1 - 17.
3吉根林,杨明,宋余庆,孙志挥.最大频繁项目集的快速更新[J].计算机学报,2005,28(1):128-135. 被引量：47
4朱玉全,汪晓刚.一种新的关联规则增量式更新算法[J].计算机工程,2002,28(4):25-27. 被引量：12
5刘大有,刘亚波,尹治东.关联规则最大频繁项目集的快速发现算法[J].吉林大学学报（理学版）,2004,42(2):212-215. 被引量：10
6Han J, Pei J, Yin Y. Mining frequent patterns without candidate generation[ C]//Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data Dallas, Texas, United States, 2000:1 - 12.
7庹文利,姚勇.基于FP_tree的最大频繁项目集增量式更新算法[J].计算机工程与应用,2009,45(19):117-119. 被引量：2

二级参考文献23

1钟勇发,吕红兵.基于FP-growth的关联规则增量更新算法[J].计算机工程与应用,2004,40(26):174-175. 被引量：5
2Agrawal R,Imielinski T,Swami A.Mining association rules between sets of items in large databases[C]//Proc of the 1993 ACM SIGMOD Int' 1 Conf on Management of Data(SIGMOD' 93), 1993 : 207-216.
3Agrawal R,Srikant R.Fast algorithms for mining association rules[C]// Proc of the 20th Intel'l Conf on Very Large Data Bases(VLDB' 94), 1994 : 487-499.
4Han J.Mining frequent patterns without candidate generation[C]// Proceedings of the 2000 ACM SIGMOD Conference on Management of Data,Dallas,TX,2000:1-12.
5Cheung D W.Maintenance of discovered association rules in large databases:An incremental updating technique[C]//Proceedings of the 12th International Conference on Data Engineering, 1996:106- 114.
6Han J.W.,Kamber M..Data Mining:Concepts and Techniques.Beijing:Higher Education Press,2001.
7Agrawal R.,ImielinSki T.,Swami A..Mining association rules between sets of items in large database.In:Proceedings of the ACM SIGMOD International Conference on Managementof Data,Washington,DC,1993,2:207-216.
8Srikant A.R..Fast algorithms for mining association rules.In:Proceedings of the 20th International Conference Very Large Data Bases(VLDB’94).Santiago,Chile,1994,487-499.
9Han J.W.,Pei J.,Yin Y..Mining partial periodicity using frequent pattern tree.Simon Fraser University:Technical Report TR-99-10,1999.
10Cheung D.,Han J.W.,Ng V.,Wong V..Maintenance of discovered association rules in large databases:An incremental updating technique.In:Proceedings of the 12th International Conference on Data Engineering(ICDE),New Orleans,Louisiana.1996.106-114.

共引文献64

1谢志强,朱孟杰,杨静.基于改进FP-树的最大项目集挖掘算法[J].计算机应用研究,2009,26(2):502-505. 被引量：1
2胡陈勇,刘大有,刘亚波.一种扩展的关联规则挖掘算法[J].吉林大学学报（理学版）,2005,43(2):153-156. 被引量：1
3孙沛涛,孙俊清.最大频繁项目集的增量式更新算法[J].计算机工程与设计,2005,26(12):3213-3215. 被引量：4
4李红,胡学钢.基于CIE-树的关联规则最大频繁项集的求解[J].计算机工程与应用,2006,42(3):180-182. 被引量：3
5汤亚玲,崔志明.遗传算法在Web关联挖掘中的应用研究[J].微电子学与计算机,2006,23(6):126-129. 被引量：4
6蒙韧,苏毅娟,朱晓峰,张继连.数据挖掘中的增量式关联规则更新算法[J].广西科学院学报,2006,22(2):125-128. 被引量：4
7武坤,李乃雄,魏庆,姜保庆.基于集合枚举树的关联规则生成算法[J].计算机工程与应用,2006,42(26):152-155. 被引量：4
8王涛伟.基于Web日志的频繁访问页面挖掘研究[J].计算机系统应用,2006,15(10):30-34. 被引量：1
9卓月明,覃遵跃,胡斌.基于Rough集的单维布尔关联规则的挖掘算法[J].吉首大学学报（自然科学版）,2006,27(4):64-67.
10胡斌,蒋外文,黄天强,陈生萍,施渊.一种最大频繁项集快速更新算法[J].计算机应用研究,2006,23(12):81-83.

1ARM推出全新IP工具套件[J].单片机与嵌入式系统应用,2015,15(8):86-86.
2吉顺如,万锋,杨泽平.PE-Link与TCP/IP协议转换网关的研究[J].电气自动化,2007,29(2):29-31.
3张绍军,刘辉,孙君强,郑自发.码垛机器人控制系统应用与改造[J].化工管理,2014(17):167-167.
4谭作亘,李光辉.E-link在智能小区建设中的应用[J].低压电器,2004(1):21-23.
5彭坡.在家中享受大影院的效果体验HT 300 E-LINK DLP 投影机[J].家电大视野,2006(2):31-33.
6吴楠,郭培源.基于E-LINK网络数据传输技术的远程机器人通信系统[J].北京工商大学学报（自然科学版）,2007,25(1):52-54. 被引量：1
7免费500MB的免费空间[J].网友世界,2009(3):57-57.
8周斌,李文印.利用E-link扩展单片机的网络接口功能[J].电子技术（上海）,2003,30(5):46-47. 被引量：1
9SIM2 HT500 E-LINK DLP投影机[J].音响世界,2005(7):16-16.
10季铎,郑伟,蔡东风.潜在语义索引中特征优化技术的研究[J].中文信息学报,2009,23(2):69-76. 被引量：7

信息技术

2012年第12期

浏览历史

内容加载中请稍等...

基于项头表节点的Fp-growth改进算法

参考文献7

二级参考文献23

共引文献64

相关作者

相关机构

相关主题

浏览历史