分布式数据库关联规则更新算法

Updating Mining Algorithm for Distributed Association Rules

下载PDF

导出

摘要提出了一种分布式关联规则增量更新算法(IUAAR),它可对数据库发生变化的情况进行归类.该算法主要采用改进了的FP树结构,通过传送被约束子树来挖掘全局频繁项目集,并充分利用快速分布式挖掘算法建立的各局部FP树,只对新增加了的全局频繁项目修改相应的改进FP树,挖掘其对应的被约束子树,同时利用已挖掘的全局频繁项目集对原全局频繁项目对应的被约束子树进行有效修剪.实验结果表明,该算法的运算速度比快速分布式挖掘算法提高了1倍,在最坏的情况下,对各局部数据库也仅需要扫描一遍,从而可提高数据库的维护效率. A new algorithm IUAAR （incremental updating algorithm for association rules） is introduced, by which the change of database records can be classified. The improved FP-tree structure is adopted and the global frequent itemsets are mined by transmitting constrained sub-tree. Utilizing the local FP-tree created by FDMA （fast distributed mining algorithm） only the FP-tree of the added global frequent items is modified. Moreover, using the mined results, the constrained sub-trees of the incremental global frequent itemsets that are transmitted in network are mined. The constrained sub-trees of the original global frequent itemsets can be pruned without transmitting therm Experiments show that in the worst case, IUAAR only scans every local transaction database once, thus the communication cost is dramatically decreased and the maintenance efficiency of global frequent itemsets is improved, and the mining speed of IUAAR algorithm is increased by at least two times in comoarison with FDMA.

作者宋宝莉覃征

机构地区西安交通大学计算机科学与技术系

出处《西安交通大学学报》 EI CAS CSCD 北大核心 2007年第4期416-420,共5页 Journal of Xi'an Jiaotong University

基金国家自然科学基金资助项目(60542004)

关键词分布式数据库全局频繁项目集约束子树增量更新 distributed database global frequent itemset constrained sub-tree incremental updating

分类号 TP301 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献8

1Agrawal R,Srikant R.Fast algorithms for mining association rules[EB/OL].[2006-05-11].http://www.rsrikant.com/papers/vldb94.pdf.
2Cheung D W,Han J W,Ng V T,et al.Maintenance of discovered association rules in large databases:an incremental updating technique[C] // Proceedings of the 12th International Conference on Data Engineering.Los Alamitos:IEEE Computer Society,1996:106-114.
3易彤,徐宝文,吴方君.一种基于FP树的挖掘关联规则的增量更新算法[J].计算机学报,2004,27(5):703-710. 被引量：32
4Agrawal R,Shafer J.Parallel mining of association rules[J].IEEE Trans on Knowledge and Data Engineering,1996,8(6):962-969.
5Schuster A,Wolff R.Communication efficient distributed mining of association rules[C]//Proceedings of the ACM SIGMOD International Conference on Management of Data.New York:ACM,2001:473-484.
6宋宝莉,覃征.分布式全局频繁项目集的快速挖掘方法[J].西安交通大学学报,2006,40(8):923-927. 被引量：11
7冯玉才,冯剑琳.关联规则的增量式更新算法[J].软件学报,1998,9(4):301-306. 被引量：227
8杨明,孙志挥,宋余庆.快速更新全局频繁项目集[J].软件学报,2004,15(8):1189-1197. 被引量：18

二级参考文献33

1Ramaswamy S. et al.. On the discovery of interesting patterns in association rules. In: Proceedings of the 24th International Conference on Very Large Data Bases (VLDB), New York, 1998, 368～379
2Srikant R. et al.. Mining quantitative association rules in large relational tables. In: Proceedings of the 1996 ACM SIGMOD Conference on Management of Data, Montreal, 1996, 1～12
3Srikant R. et al.. Mining generalized association rules. In: Proceedings of the 21st International Conference on Very Large Data Bases (VLDB), Zurich, Switzerland, 1995, 407～419
4Pen J. et al.. CLOSET: An efficient algorithm mining frequent closed itemsets. In: Proceedings of the 2000 ACM SIGMOD International Workshop on Data Mining and Knowledge Discovery, Dallas, TX, 2000, 11～20
5Zaki M.J. et al.. CHARM: An efficient algorithm for closed association rule mining. Computer Science, Rensselaer Polytechnic Institute, Troy, New York: Technical Report 99-10, 1999, 1～24
6Han J. et al.. Mining frequent patterns without candidate generation. In: Proceedings of the 2000 ACM SIGMOD Conference On Management of Data, Dallas, TX, 2000, 1～12
7Bing Liu et al.. Analyzing the subjective interestingness of association rules. Intelligent Systems, 2000, 15(5): 47～55
8Cheung D.W. et al.. Maintenance of discovered association rules in large databases: an incremental updating technique. In: Proceedings of the 1996 International Conference on Data Engineering, New Orleans, Louisiana, 1996, 106～114
9Feldman R. et al.. Efficient algorithms for discovering frequent sets in incremental databases. In: Proceedings of the 1997 ACM SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery, Tucson, Arizona, 1997, 59～66
10Agrawal R. et al.. Mining associations between sets of items in massive databases. In: Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, Washington D C, 1993, 207～216

共引文献270

1徐龙,杨君锐.基于数据库变化的关联规则增量式更新算法[J].重庆科技学院学报（自然科学版）,2007,9(4):67-70. 被引量：1
2简友光,简曙光.空间数据关联规则挖掘研究综述[J].计算机与数字工程,2007,35(7):52-55.
3敬会.关联规则增量式更新算法[J].科技资讯,2007,5(26).
4廖启明.基于数据新增关联规则的更新算法研究[J].光盘技术,2007(6):19-21.
5钱进,孟祥萍,徐冬寅.一种有效的关联规则增量式更新算法[J].长春工程学院学报（自然科学版）,2003,4(3):11-14. 被引量：4
6杨明,孙志挥,宋余庆.快速更新全局频繁项目集[J].软件学报,2004,15(8):1189-1197. 被引量：18
7邓小妮,罗雪山.一种基于事务时间分割的关联规则增量式更新方法[J].计算机工程与应用,2004,40(23):176-179. 被引量：1
8苏占东,游福成,杨炳儒.关联规则的综合评价方法研究与实例验证[J].计算机应用,2004,24(10):17-20. 被引量：27
9朱玉全,宋余庆,陈耿.约束最大频繁项目集的增量式更新算法[J].计算机工程,2004,30(18):31-32.
10朱红蕾,李明.一种高效维护关联规则的增量算法[J].计算机应用研究,2004,21(9):107-109. 被引量：9

1宋宝莉,覃征.分布式数据库的全局频繁项目集高效更新算法[J].计算机工程与应用,2006,42(31):157-160. 被引量：1
2宋宝莉,覃征.分布式全局频繁项目集的快速挖掘方法[J].西安交通大学学报,2006,40(8):923-927. 被引量：11
3杨明,孙志挥,吉根林.一种基于分布式数据库的全局频繁项目集更新算法[J].东南大学学报（自然科学版）,2002,32(6):879-883. 被引量：4
4吴文妹.一种高效的关联规则更新算法[J].福建电脑,2006,22(6):117-118.
5邓甦,付长贺.一种最小支持度变小的关联规则更新算法[J].辽宁教育行政学院学报,2005,22(11):92-93.
6彭国星.基于分布式数据入侵检测模型研究[J].计算机仿真,2010,27(6):175-178. 被引量：1
7杨明,孙志挥,宋余庆.快速更新全局频繁项目集[J].软件学报,2004,15(8):1189-1197. 被引量：18
8张秀玉.基于现有数据挖掘结果的关联规则更新算法[J].闽江学院学报,2005,26(5):58-61. 被引量：1
9张秀玉.基于现有数据挖掘结果的关联规则更新算法[J].福建信息技术教育,2005,0(3):2-5.
10宋宝莉,覃征.分布式环境下关联规则的安全挖掘算法[J].计算机工程,2006,32(21):35-37. 被引量：6

西安交通大学学报

2007年第4期

浏览历史

内容加载中请稍等...

分布式数据库关联规则更新算法

参考文献8

二级参考文献33

共引文献270

相关作者

相关机构

相关主题

浏览历史