基于集合枚举树的关联规则生成算法被引量：4

Association Rules Generating Algorithm Based on Set-Enumeration Tree

下载PDF

导出

摘要在经典算法中由频繁项集生成关联规则需要生成频繁项集的所有非空子集作为候选后件集。李雄飞对此做出改进,提出逐层搜索后件的宽度优先算法。求下集极大元的Boundary算法也可用于求所有关联规则后件。论文提出一个深度优先算法GRSET(GenerateRulesbyusingSet-EnumerationTree),该算法利用集合枚举树,按照深度优先的方法逐一找出所有关联规则后件并得到相应的关联规则。通过实验对这三种算法进行比较,结果显示GRSET算法效率较高。 The classical algorithm of mining association rules gnerated by a frequent itemset has to generate all nonempty subsets of the frequent itemset as candidate set of consequences,Li Xiongfei aimed at this and proposed an improved algorithm.The algorithm finds all consequences layer by layer,so it is breadth-first.We also can use Boundary algorithm of finding all maximal elements of a lower segment to get all consequences of the association rules,ln this paper,we propose a new algorithm GRSET（Generate Rules by using Set-Enumeration Tree） which uses the structure of Set-Enumeration Tree and depth-first method to find all consequences of the association rules one by one and get all association rules corresponding to the consequences.Experiments show that GRSET algorithm is more efficient than the other two algorithms.

作者武坤李乃雄魏庆姜保庆

机构地区河南大学数据与知识工程研究所广西工学院信息与计算科学系

出处《计算机工程与应用》 CSCD 北大核心 2006年第26期152-155,共4页 Computer Engineering and Applications

基金国家自然科学基金资助项目(编号:60474022) 河南省骨干教师资助项目(编号:G2002026) 河南省自然科学计划资助项目(编号:200510475028)

关键词数据挖掘频繁项集关联规则深度优先算法 data mining,frequent itemset,association rules,depth-first algorithm

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献10

1Jiawei Han,Micheline Kamber.Data Mining Concepts and Techniques[M].First edition,Beijing:Higher Education Press,2001:152～157
2R Agrawal,R Srikant.Fast algorithms for mining association rules in large databases[C].In:Proc of the 20th Int Conf on Very Large Data Bases (VLDB'94),Santiago,Chile,1994:487～499
3颜跃进,李舟军,陈火旺.频繁项集挖掘算法[J].计算机科学,2004,31(3):112-114. 被引量：20
4Burdick D,Calimlim M,Gehrke J.MAFIA:A maximal frequent itemset algorithm for transactional databases[C].In:Proc Of the 17th Int'l Conf On Data Engineering,2001:443～452
5Zhou QH,Wesley C,Lu BJ.SmartMiner-A depth 1st algorithm guided by tail information for mining maximal frequent itemsets[C].In:Proc of the IEEE Int'l Conf On Data Mining(ICDM2002),2002:570～577
6Dao-I Lin,Zvi M Kedem.Pincer-Search:An Efficient Algorithm for Discovering the Maximum Frequent Set[J].IEEE transactions on knowledge and data engineering,2002; 14 (3):553～566
7Agarwal RC,Aggarwal CC,Prasad VVV.Depth First generation of long patterns[C].In:Proc Of the 6th ACM SIGKDD Int'l Conf On Knowledge Discovery and Data Mining,2000:108～118
8高俊,施伯乐.快速关联规则挖掘算法研究[J].计算机科学,2005,32(3):200-201. 被引量：10
9刘大有,刘亚波,尹治东.关联规则最大频繁项目集的快速发现算法[J].吉林大学学报（理学版）,2004,42(2):212-215. 被引量：10
10Rymon R.Search Through Systematic Set Enumeration[C].In:Proc Of Third International Conference on Principles of Knowledge Representation and Reasoning,1992:539～550

二级参考文献38

1Imielinski T, Virmani A. MSQL: Aquery languang for database mining. Data Mining and Knowledge Discovery, 1999,3: 373-408
2Groth R. Data Mining: Building Competitive Advantage. Prentice Hall,1999
3Goebel M,Gruenwald L. A survey of data mining and knowledge discovery software tools. SIGKDD Explorations, 1999,1:20-33
4Grahne G. Efficient mining of constrained correlated sets. In:Proc. 2000 Intl. Conf. Data Engineering (ICDE'00), San Diego:2000. 512-521
5Han J. Mining frequent patterns without candidate generation. In:Proc. ACM-SIGMOD Int. Conf. Dallas. 2000
6Han J,Pei J. Freespan: Frequent pattern-projected sequential pattern mining: [Technical Report CMPT2000-06]. Simon Fraser University, 2000. 6-12
7Han J. Data Mining: Concepts and Techniques. Burnaby: Simon Fraser University, 2000. 155-163
8Liu J Q,Pan Y H,Wang K,Han J W. Mining Frequent Item Sets by Opportunistic Projection, KDD'02, Edmonton, Canada, July 2002
9Han J, Pei J,Yin Y. Mining Frequent Patterns without Candidate Generation. In: Proc. 2000ACM-SIGMOD Int. Conf. on Management of Data (SIGMOD'00),Dallas, TX, May 2000
10Pei J,Hah J,Lu H, et al. H-Mine: Hyper-Structure Mining of Frequent Patterns in Large Databases,In:Proc. 2001 Int. Conf. on Data Mining(ICDM'01) ,San Jose,CA,Nov. 2001

共引文献37

1颜跃进,李舟军,陈火旺.基于FP-Tree有效挖掘最大频繁项集[J].软件学报,2005,16(2):215-222. 被引量：68
2颜跃进,李舟军,陈火旺.一种挖掘最大频繁项集的深度优先算法[J].计算机研究与发展,2005,42(3):462-467. 被引量：20
3胡陈勇,刘大有,刘亚波.一种扩展的关联规则挖掘算法[J].吉林大学学报（理学版）,2005,43(2):153-156. 被引量：1
4皇甫罡,王宁博.数据挖掘在食管癌和贲门癌研究中的应用[J].郑州轻工业学院学报（自然科学版）,2005,20(2):90-93.
5李红,胡学钢.基于CIE-树的关联规则最大频繁项集的求解[J].计算机工程与应用,2006,42(3):180-182. 被引量：3
6唐德权,王绪峰,朱林立,谢文君.一种快速挖掘频繁项集算法的研究[J].湖南科技学院学报,2006,27(5):117-120. 被引量：3
7赵连朋.基于关联规则的医疗处方智能监督方法的研究[J].计算机工程与应用,2006,42(32):223-225. 被引量：3
8蔡进,薛永生,林丽,张东站.基于充分挖掘增量事务的关联规则更新算法[J].计算机科学,2007,34(2):220-222. 被引量：3
9李芸,李青山.基于约束的最大频繁项集挖掘算法[J].计算机工程与应用,2007,43(17):160-163. 被引量：12
10武坤,姜保庆,魏庆.A Depth-first Algorithm of Finding All Association Rules Generated by a Frequent Itemset[J].Journal of Donghua University(English Edition),2006,23(6):1-4.

同被引文献38

1宫文浩,兰天莹,莫清莲,杨燕,戴启刚,陈莎莎,唐子西,刘悠江,艾军.基于决策树和人工神经网络的小儿肺炎痰热闭肺证诊断模型研究[J].世界科学技术-中医药现代化,2020,22(7):2548-2555. 被引量：14
2丁纪元,孟昭琳.黄芪四君子汤在晚期非小细胞肺癌化疗中的应用[J].浙江中西医结合杂志,2006,16(1):28-29. 被引量：19
3陆剑江.通用模式的移动办公平台设计方案研究[J].计算机工程与设计,2006,27(4):695-697. 被引量：34
4姜保庆,李建,徐扬.布尔关联规则集的结构[J].河南大学学报（自然科学版）,2006,36(1):88-90. 被引量：2
5孙纳新,周婉婷.北京市财政局综合办公平台[J].办公自动化,2006(4):10-11. 被引量：1
6冯平,黄名选.由频繁项集生成关联规则的算法设计和实现[J].广西工学院学报,2007,18(1):56-59. 被引量：4
7Agrawal R, Mannila H , Srikant R , et al. Fast discovery rules[ C]//Fayyad U Advances in Knowledge Discovery and Data Mining. Menlo Park: AAAI Press, 1996:307 - 328.
8Cercone V, Tsuchiya M. Luesy editor' s introduction [ J ]. IEEE Transaction on Knowledge and Data Engineering, 1993,5(6) :901.
9Brin S, Motwani R, Ullman J, et al. Dynamic itemset counting and implication rules for market basket data [ EB/OL] . ( 1999 - 05 - 23 ) [ 2007 - 05 - 09 ]. http :// citeseer. njnec.com/brin97dynamic html.
10Pasquier N, Bastide Y, Taouil R , et al. Discovering frequent closed itmesets for association rides [ EB/OL]. ( 1999 - 11 - 23 ) [ 2007 - 04 - 18 ]. http: // citeseer. njnec. com / pasquier99 discovering html.

引证文献4

1魏庆,任剑峰.特种设备检验机构智能网络办公平台的设计与实现[J].光盘技术,2006(4):12-14. 被引量：1
2马莉,任学军,赵纪涛.一种挖掘关联规则的改进算法[J].郑州轻工业学院学报（自然科学版）,2008,23(3):117-120.
3武坤.一种快速挖掘关联规则的改进算法[J].河南财政税务高等专科学校学报,2016,30(1):91-95.
4章新友,徐华康,唐琍萍,张亚明,刘梦玲,周小玲.基于策略模式的中医药数据智能挖掘平台设计与应用[J].科学技术与工程,2023,23(14):5946-5954. 被引量：2

二级引证文献3

1刘三江,陈祖志,薄柯,李光海.智能网联移动式承压设备检验模式分析——以长管拖车为例[J].科技和产业,2020,20(3):178-182. 被引量：5
2江巍,刘帆.基于电子病历的中医诊疗数据中心决策分析系统设计[J].消费电子,2024(3):45-47.
3段雪莹.混合属性网络多维多层关联数据智能挖掘算法[J].智能计算机与应用,2024,14(3):207-211.

1武坤.一种快速挖掘关联规则的改进算法[J].河南财政税务高等专科学校学报,2016,30(1):91-95.
2马莉,任学军,赵纪涛.一种挖掘关联规则的改进算法[J].郑州轻工业学院学报（自然科学版）,2008,23(3):117-120.
3徐凤生,赵永华.一种新的关联规则挖掘算法[J].德州学院学报,2002,18(4):45-47.
4黄波.基于完全信息树的关联规则生成算法在入侵检测中的应用[J].中国商界,2009,0(11X):205-206.
5谢霖铨,章恩.基于FP-Tree的概念格量化约简及其在GIS的应用[J].南昌大学学报（理科版）,2014,38(3):289-294.
6徐凤生,陆玉昌.模糊关联规则的挖掘算法[J].德州学院学报,2002,18(2):65-68. 被引量：3
7蒋瑜.基于集合枚举树的最小属性约简算法[J].计算机工程与应用,2013,49(11):101-104. 被引量：2
8武坤,魏涛.由频繁项集生成关联规则的深度优先算法[J].科学咨询,2009(11):36-37. 被引量：1
9刘芳,路松峰,卢正鼎,胡和平.一种基于限制的关联规则数据开采的算法[J].华中科技大学学报（自然科学版）,2001,29(3):27-29. 被引量：1
10何云峰.Apriori改进算法综述[J].微型机与应用,2013,32(6):1-3. 被引量：7

计算机工程与应用

2006年第26期

浏览历史

内容加载中请稍等...

基于集合枚举树的关联规则生成算法被引量：4

参考文献10

二级参考文献38

共引文献37

同被引文献38

引证文献4

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

基于集合枚举树的关联规则生成算法 被引量：4

参考文献10

二级参考文献38

共引文献37

同被引文献38

引证文献4

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

基于集合枚举树的关联规则生成算法被引量：4