期刊文献+

基于领域知识的冗余关联规则消除算法 被引量:3

Elimination algorithm of redundant rules in association rules mining based on domain knowledge
下载PDF
导出
摘要 关联规则挖掘算法用于从大型数据库中提取感兴趣的规则,然而,在领域知识中已经能清晰表示的知识并没有被充分考虑,关联规则挖掘算法提取的规则中包含了大量已知的关联性,从而产生了很多冗余规则。文章提出一种算法DKARM,同时考虑了数据本身以及相关的领域知识,以消除在领域知识中清晰表示的已知关联性。实验表明,该算法合理消除了冗余规则,有效降低了规则数目。 Many association rule mining algorithms have been developed to extract interesting patterns from large databases.However,a large amount of knowledge explicitly represented in domain knowledge(DK) has not been used to reduce the number of association rules.A significant number of well known dependences are unnecessarily extracted by association rule mining algorithms,which results in the generation of hundreds or thousands of non-interesting association rules.This paper presents a DKARM algorithm,which takes both database and relative DK into account,to eliminate all associations explicitly represented in DK.Experiments on the proposed algorithm show the significant reduction of the number of rules and the elimination of non-interesting rules.
出处 《合肥工业大学学报(自然科学版)》 CAS CSCD 北大核心 2011年第2期246-250,共5页 Journal of Hefei University of Technology:Natural Science
基金 安徽省教学研究课题资助项目(2008jyxm240) 合肥工业大学科学研究发展基金资助项目(2009HGXJ0035)
关键词 数据挖掘 关联规则 领域知识 冗余规则 data mining association rule domain knowledge(DK) redundant rule
  • 相关文献

参考文献9

  • 1Han Jiawei,Kamber M.Data mining-concepts and techniques[M].Morgan Kaufman Publishers,2001:1-30.
  • 2Agrawal R,Imielinski T,Swami A.Mining association rules between sets of items in large databases[C]//Proc of the ACM SIGMOD Conference on Management of Data,1993:207-216.
  • 3Han Jiawei,Pei Jian,Yin Y.Mining frequent patterns without candidate generation[C]//Proc of the ACM SIGMOD Conference on Management of Data,2000:1-12.
  • 4Fayyad U,Piatetsky-Shapiro G,Smyth P.From data mining to knowledge discovery:an overview[C]//Advances in Knowledge Discovery and Data Mining.Menlo Park,California:AAAI Press,1996:1-35.
  • 5Zaki M.Generating non-redundant association rules[C]//Proc of the 6th ACM-SIGKDD International Conference on Knowledge Discovery and Data Mining,Boston,Massachusetts,USA,2000:34-43.
  • 6Liu B,Hsu W,Ma Y.Pruning and summarizing the discovered associations[C]//Proc of the 5th ACM-SIGKDD International Conference on Knowledge Discovery and Data Mining,San Diego,California,1999:125-134.
  • 7邢平平,施鹏飞,赵奕.基于本体论的数据挖掘方法[J].计算机工程,2001,27(5):15-16. 被引量:32
  • 8钟佩思,徐文胜,曾庆良,熊光楞.领域知识获取与有效性检查过程集成研究[J].机械科学与技术,2001,20(5):798-800. 被引量:5
  • 9谢亮,张晶,胡学钢.主从关系数据库中关联规则挖掘算法研究[J].合肥工业大学学报(自然科学版),2009,32(5):663-666. 被引量:5

二级参考文献20

  • 1张伟,杨炳儒,宋威.多关系数据挖掘研究综述[J].计算机工程与应用,2006,42(2):1-6. 被引量:4
  • 2徐光美,杨炳儒,张伟,宁淑荣.多关系数据挖掘方法研究[J].计算机应用研究,2006,23(9):8-12. 被引量:5
  • 3Dehaspe L,Raedt L D.Mining association rules in multiple relations[C]//Dzeroski S,Lavrac N.Proc of the 7th Int'l Workshop on Inductive Logic Programming,LNAI 1297.Berlin:Springer-Verlag,1997:125-132.
  • 4Dehaspe L,Toivonen H.Discovery of frequent datalog patterns[J].Data Mining and Knowledge Discovery,1999,3(1):7-36.
  • 5Nijssen S,Kok J.Faster association rules for multiple relations[C]//Nebel B.Proc of the 17th Int'l Joint Conf on Artificial Intelligence (IJCAI 2001),2001:891-896.
  • 6Lavrac N,Dzeroski S.Inductive logic programming:techniques and and applications[M].Prentice Hall,1994:1.
  • 7Yin Xiaoxin,Han Jiawei,Yang Jiong,et al.Efficient classification across multiple database relations:a CrossMine approach[J].IEEE Transactions on Knowledge and Data Engineering,2006,18(6):770-783.
  • 8Zhang Shichao,Wu Xindong,Zhang Chengqi.Multi-database ming[J].IEEE Computational Intelligence Bulletin,2003,2(1):5-13.
  • 9Yin Xiaoxin,Han Jiawei,Yang Jiong,et al.CrossMine:efficient classification across multiple database relations[C]//ICDE,2004:399-410.
  • 10Liu Huan,Lu Hongjun,Yao Jun.Toward multidatabase mining:identifying relevant databases[J].IEEE Transactions on Knowledge and Data Engineering,2001,13(4):541-553.

共引文献39

同被引文献50

  • 1潘海为,韩启龙,印桂生,张炜,李建中.基于领域知识指导的医学图像关联规则挖掘[J].计算机研究与发展,2007,44(z3):424-428. 被引量:3
  • 2李蕾,高铁曼.产生式规则专家系统的原理与实现[J].微计算机应用,2006,27(5):631-634. 被引量:21
  • 3赵秀梅.规则库的冗余测试[J].电脑开发与应用,2006,19(12):6-8. 被引量:1
  • 4CHEN Guo-qing, LIU Hong-yan, YU Lan, et al. A new approach to classification based on association rule mining [ J ]. Decision Sup- port Systems,2006,42 (2) :674- 689.
  • 5Aronis J M,Kolluri V,Buchanan F J. The WoRLD:Knowledge discovery from multiple distributed databases[A].1997.337-341.
  • 6Liu H,Lu H J,Yao J. Identifying relevant databases for multi database mining[A].1998.210-221.
  • 7Yan J,Liu N,Yang Q. Mining adaptive ratio rules from distributed data sources[J].Data Mining and Knowledge Discovery,2006,(2/3):249-273.
  • 8Peng W C,Liao Z X. Mining sequential patterns across multiple sequence databases[J].Data and Knowledge Engineering,2009,(10):1014-1033.
  • 9Zhang S C,Zhang C Q,Wu XD. Knowledge discovery in multiple databases[M].New York:springer-verlag,2004.110-120.
  • 10Adhikari A,Rao R R,Pedrycz W. Developing multi-database mining applications[M].New York:springer-verlag,2010.78-92.

引证文献3

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部