基于充分挖掘增量事务的关联规则更新算法被引量：3

Updating Algorithm for Association Rules Based on Fully Mining Incremental Transactions

下载PDF

导出

摘要目前已提出了许多快速的关联规则增量更新挖掘算法,但是它们在处理对新增事务敏感的问题时,往往会丢失一些重要规则。为此,文章提出了一种新的挖掘增量更新后的数据库中频繁项集的算法EUFIA(Entirety Update Frequent Itemsets Algorithm),该算法先对新增事务数据分区,然后快速扫描各分区,能全面有效地挖掘出其中的频繁项集,且不丢失重要规则。同时,最多只扫描1次原数据库也能获得更新后事务数据库的全局频繁项集。研究表明,该算法具有很好的可测量性。 Incremental Association rules Mining is an important content of data mining technology. This study proposes a new algorithm, called the Entirety Update Frequent Itemsets Algorithm （EUFIA） for efficiently incrementally mining association rules from large transaction database. Rather than rescanning the original database for some new generated frequent itemsets, EUFIA partitions the incremental database logically according to unit time interval, then accumulates the occurrence counts of new generated frequent itemsets and deletes infrequent itemsets obviously by backward method. Thus, EUFIA can discover newly generated frequent itemsets more efficiently and need rescan the original database only once to get overall frequent itemsets in the final database if necessary. EUFIA has good scalability in our simulation.

作者蔡进薛永生林丽张东站

机构地区厦门大学计算机科学系

出处《计算机科学》 CSCD 北大核心 2007年第2期220-222,233,共4页 Computer Science

基金国家自然科学基金项目(50474033) 福建省自然科学基金项目(A0310008) 福建省高新技术研究开放计划重点项目(2003H043)

关键词关联规则增量式更新强频繁项集次频繁项集弱频繁项集 Association rules, Incremental updating, Powerful frequent itemsets, Inferior frequent itemsets, Weak frequent itemsets

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献15

1Agrawal R, Imielinski T, Swami A. Mining association rules between sets of items in large databases. In; Proceedings of ACM SIGMOD International Conference on Management of Date,Washington D C, 1993. 207-216
2Agrawal R, Srikant R. Fast algorithm for mining association rules, In: Proceedings of the 20th International Conference on VLDB,Santiago,Chile, 1994. 487-499
3Han J ,Kamber M. Data Mining: Concepts and Techniques. Beijing : Higher Education Press, 2001
4Han J,Jian P, et al. Mining frequent patterns without candidate generation. In: Proceedings of ACM SIGMOD International Conference on Management of Data,Dallas,TX,May 2000. 1-12
5Shoemaker C, Ruiz C. Association Rule Mining Algorithms for Set-valued Data. In:Proc. 4th Intl. Conf. on Intelligent Data Engineering and Automated Learning. LNCS Vol. 2690, Hong Kong, china, 2003. 669-676
6Wang Jianyong, Han J, Lu Y, Tzvetkov P. TFP:an efficient algorithm for mining top-k frequent closed itemsets. Knowledge and Data Engineering, IEEE Transactions, 2005,17(5) : 652-663
7颜跃进,李舟军,陈火旺.频繁项集挖掘算法[J].计算机科学,2004,31(3):112-114. 被引量：20
8Cheung D W, et al, Maintenance of discovered association rules in large databases: an incremental updating technique, In; Proceedings of the 12th International Conference on Data Engineering, New Orleans, Louisana, 1996. 106-114
9Cheung DW, LEE SD, Kao B. A general incremental technique for maintaining discovered association rules. In: Topor RW, Tanaka K,eds. Proc. of the 5th Int'l Conf. on Database Systems for Advanced Applications. World Scientific, 1997. 185-194
10Hong T P, Wang C Y, Tao Y H. A new incremental data mining algorithm using pre-large itemsets. Intelligent Data Analysis,2001,5(2): 111-129

二级参考文献36

1[1]Agrawal R, Imielinski T, Swami A. Mining association rules between sets of items in large databases. In: Proceedings of ACM SIGMOD International Conference on Management of Date, Washington DC, 1993.207～216
2[2]Agrawal R, Srikant R. Fast algorithm for mining association rules. In: Proceedings of the 20th International Conference on VLDB, Santiago, Chile, 1994. 487～499
3[3]Han J, Kamber M. Data Mining: Concepts and Techniques. Beijing: Higher Education Press, 2001
4[5]Agrawal R, Shafer J C. Parallel mining of association rules:Design, implementation, and experience. IBM Research Report RJ 10004,1996
5[6]Savasere A, Omiecinski E, Navathe S. An efficient algorithm for mining association rules. In: Proceedings of the 21th International Conference on VLDB, Zurich, Switzerland, 1995. 432～444
6[7]Hah J, Jian P et al. Mining frequent patterns without candidate generation. In: Proceedings of ACM SIGMOD International Conference on Management of Data, Dallas, TX, 2000.1～12
7[8]Cheung D W, Lee S D, Kao B. A general incremental technique for maintaining discovered association rules. In: Proceedings of databases systems for advanced applications, Melbourne, Australia, 1997. 185～194
8[10]Han J, Jian P. Mining access patterns efficiently from web logs. In: Proceedings of Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'00), Kyoto, Japan,2000. 396～407
9[11]Agrawal R, Srikant R. Mining sequential pattern. In: Proceedings of the 11th International Conference on Data Engineering, Taipei, 1995. 3～14
10Liu J Q,Pan Y H,Wang K,Han J W. Mining Frequent Item Sets by Opportunistic Projection, KDD'02, Edmonton, Canada, July 2002

共引文献97

1徐龙,杨君锐.基于数据库变化的关联规则增量式更新算法[J].重庆科技学院学报（自然科学版）,2007,9(4):67-70. 被引量：1
2易彤,徐宝文,吴方君.一种基于FP树的挖掘关联规则的增量更新算法[J].计算机学报,2004,27(5):703-710. 被引量：32
3邓小妮,罗雪山.一种基于事务时间分割的关联规则增量式更新方法[J].计算机工程与应用,2004,40(23):176-179. 被引量：1
4朱玉全,宋余庆,陈耿.约束最大频繁项目集的增量式更新算法[J].计算机工程,2004,30(18):31-32.
5杨君锐.频繁项目集二次挖掘方法研究[J].系统工程与电子技术,2004,26(11):1701-1704.
6李清峰,杨路明,张晓峰.关联规则中最大频繁项目集的研究[J].计算机应用研究,2005,22(1):93-95. 被引量：3
7缪红保,李卫.基于数据挖掘的用户安全行为分析[J].计算机应用研究,2005,22(2):105-107. 被引量：11
8郭伟,唐晓君,刘万军.一种基于划分的聚类算法分析与改进[J].辽宁工程技术大学学报（自然科学版）,2004,23(6):826-828. 被引量：4
9李华君,周海岩.基于项目集知识库的关联规则挖掘与更新的高效算法[J].计算机工程与设计,2004,25(12):2198-2201. 被引量：4
10颜跃进,李舟军,陈火旺.基于FP-Tree有效挖掘最大频繁项集[J].软件学报,2005,16(2):215-222. 被引量：68

同被引文献34

1李清峰,杨路明,张晓峰,龙艳军.数据挖掘中关联规则的一种高效Apriori算法[J].计算机应用与软件,2004,21(12):84-86. 被引量：29
2董祥军,王淑静,宋瀚涛,陆玉昌.负关联规则的研究[J].北京理工大学学报,2004,24(11):978-981. 被引量：33
3高俊,施伯乐.快速关联规则挖掘算法研究[J].计算机科学,2005,32(3):200-201. 被引量：10
4张师超,张继连,陈峰,倪艾玲.负增量式关联规则更新算法[J].计算机科学,2005,32(9):153-155. 被引量：7
5梁志瑞,陈鹏,苏海锋.关联规则挖掘在电厂设备故障监测中应用[J].电力自动化设备,2006,26(6):17-19. 被引量：20
6颜宏文.市场环境下发电机组竞争能力的关联规则挖掘方法[J].电网技术,2006,30(13):61-65. 被引量：3
7杨秀金,孟军.基于频繁模式表的增量更新算法[J].计算机应用,2006,26(B06):110-112. 被引量：2
8刘德喜,何炎祥,邢显黎.一种新的频繁项集挖掘算法[J].计算机应用研究,2007,24(2):17-19. 被引量：8
9邹力鹍,张其善.基于多最小支持度的加权关联规则挖掘算法[J].北京航空航天大学学报,2007,33(5):590-593. 被引量：17
10马占欣,陆玉昌.负关联规则挖掘中的频繁项集爆炸问题[J].清华大学学报（自然科学版）,2007,47(7):1212-1215. 被引量：10

引证文献3

1戴小廷.Apriori算法的改进及其在电力数据挖掘中的应用[J].沈阳理工大学学报,2010,29(1):18-22. 被引量：5
2陈智,梁娟.基于GM(1,1)模型的元规则挖掘研究[J].微计算机信息,2012,28(4):175-176. 被引量：1
3王斌,马俊杰,房新秀,魏天佑.基于时间戳和垂直格式的关联规则挖掘算法[J].计算机科学,2019,46(10):71-76. 被引量：7

二级引证文献13

1郎振红.网络化物业管理系统中数据挖掘的应用[J].沈阳教育学院学报,2011,13(4):88-91.
2陈立军,张亚红,海冉冉.一种新型融合离群点的稳态检测方法[J].化工自动化及仪表,2013,40(5):582-586. 被引量：3
3李秋硕,王岩,孙宇军,肖勇,卢耿城.Apriori算法在用电客户交互痕迹分析中的应用[J].自动化与仪器仪表,2017(3):181-184. 被引量：1
4刘志先,赵荣阳.数据挖掘中改进的Apriori算法的应用[J].信息记录材料,2017,18(1):39-40.
5房千淏.电子点单方式下关联分析在菜品建议方面的应用[J].电脑编程技巧与维护,2020,0(4):84-85.
6李珺,刘鹤,朱良宽.基于Apriori关联规则算法的草莓叶片含水状况研究[J].北方园艺,2020(19):146-151. 被引量：1
7纪文璐,王海龙,苏贵斌,柳林.基于关联规则算法的推荐方法研究综述[J].计算机工程与应用,2020,56(22):33-41. 被引量：48
8覃健荣,梁盛盛,韦宗慧,陈勇成.基于区块链的电力交易数据安全存储研究[J].电子技术与软件工程,2021(24):154-157.
9叶伟,邓刚,叶攀,宋海波.基于关联规则算法的电池剩余电量数据监测方法[J].电子设计工程,2022,30(13):105-108. 被引量：2
10沈毅波.RBF神经网络在关联数据一致性挖掘中的应用[J].福建电脑,2022,38(8):5-9.

1吴文妹.一种高效的关联规则更新算法[J].福建电脑,2006,22(6):117-118.
2邓甦,付长贺.一种最小支持度变小的关联规则更新算法[J].辽宁教育行政学院学报,2005,22(11):92-93.
3王晗,孔令富.一种新的增量式关联规则数据挖掘方法研究[J].仪器仪表学报,2009,30(2):438-443. 被引量：10
4张秀玉.基于现有数据挖掘结果的关联规则更新算法[J].闽江学院学报,2005,26(5):58-61. 被引量：1
5张秀玉.基于现有数据挖掘结果的关联规则更新算法[J].福建信息技术教育,2005,0(3):2-5.
6邹长忠,傅清祥.分布式数据库的关联规则更新算法[J].福州大学学报（自然科学版）,2008,36(5):655-659. 被引量：1
7刘学敏,王枫.一种改进的指纹图像增强算法研究[J].信息通信,2014,27(6):37-38. 被引量：1
8张海芹,须文波.基于移动Agent的新型分布式入侵检测系统[J].微计算机信息,2006,22(08X):76-77. 被引量：8
9王霞,陈云志.一种基于MA的分布式入侵检测系统研究[J].微计算机信息,2006(05X):61-63. 被引量：3
10陆颖.指纹自动识别原理与方法综述[J].工程数学学报,2004,21(6):1003-1010. 被引量：14

计算机科学

2007年第2期

浏览历史

内容加载中请稍等...

基于充分挖掘增量事务的关联规则更新算法被引量：3

参考文献15

二级参考文献36

共引文献97

同被引文献34

引证文献3

二级引证文献13

相关作者

相关机构

相关主题

浏览历史

基于充分挖掘增量事务的关联规则更新算法 被引量：3

参考文献15

二级参考文献36

共引文献97

同被引文献34

引证文献3

二级引证文献13

相关作者

相关机构

相关主题

浏览历史

基于充分挖掘增量事务的关联规则更新算法被引量：3