期刊文献+

基于小生境遗传算法的关联规则挖掘方法 被引量:5

Association Rule Mining Method Based on Niching Genetic Algorithm
下载PDF
导出
摘要 通过采用惩罚函数设置支持度阈值,有效地解决了规则冗余问题。在小生境遗传算法方面,采用了一种新颖的素因子染色体编码方法并引入了最大频繁项分布表。该编码方法把原来用字符表示的事务转化为用一个整数进行表示,把字符串运算转化为数值运算,将事务数据库属性项压缩为一个数值型的项。通过引入最大频繁项分布表,使算法总是能在最大频繁项密集区进行挖掘,对组合搜索空间进行了有效修剪。实验结果表明,该方法对事务数据库压缩比超过25%,效率至少能提高47%。 The paper effectively solves the problem of regulation redundancy through adopting penalty function to set up the threshold of supporting degree. In respect of niching genetic algorithm. it adopts a new prime factor chromosome encoding method and introduces the maximal frequency item distribution list. This encoding method changes the transaction originally expressed in character into the one expressed in integer and changes the character string operation into the numerical value operation and compresses the property item of the transaction database to a numerical value item. Through introducing the maximal frequency item distribution list, it can always mine in the maximal frequency item compact district and thus, it effectively prunes the assembling searchspace. Experimental results show that the adopting method makes the compressing ratio of the transaction database in excess of 25%, and the effect can increase by at least 47%.
出处 《计算机工程》 CAS CSCD 北大核心 2008年第10期163-165,共3页 Computer Engineering
基金 国家自然科学基金资助项目(60573067)
关键词 关联规则 小生境遗传算法 染色体 杂交操作 association rule niching genetic algorithm chromosome hybrid operation
  • 相关文献

参考文献6

  • 1Agrawal R,Imielinski T,Swami A.Mining Association Rules Between Sets of Items in Large Databases[C]//Proceedings of the ACM SIGMOD.Washington D.C.,USA:[s.n.],1993.
  • 2贺志,黄厚宽,田盛丰.一种优化相关规则的发现方法[J].计算机学报,2006,29(6):906-913. 被引量:12
  • 3Omiecinski E R.Alternative Interest Measures for Mining Association in Databases[J].IEEE Transactions on Knowledge and Data Engineering,2003,15(1):57-69.
  • 4Srikant R,Agrawal R.Mining Generalized Association Rules[C]//Proceedings of the 21th International Conference on Very Large Data Bases.Tokyo,Japan:[s.n.],1995.
  • 5Cohen E.Finding Interesting Associations Without Support Pruning[J].IEEE Transactions on Knowledge and Data Engineering,2001,13(1):64-78.
  • 6Xiong Hui,Tan Pangning,Kumar V.Mining Strong Affinity Association Patterns in Data Sets with Skewed Support Distribution[C]//Proceedings of the 3rd IEEE International Conference on Data Mining.Melbourne,Florida,USA:[s.n.],2003

二级参考文献17

  • 1Carlin B.P.,Louis T.A..Bayes and Empirical Bayes Methods for Data Analysis.2nd Edition.London U.K.:Chapman &Hall,2000
  • 2Light R.J.,Margolin B.H..An analysis of variance for categorical data.Journal of the American Statistical Association,1971,66:534~544
  • 3Steven R.,Matthew S.etal.Integrated Public Use Microdata Series:Version 2.0.Historical Census Projects,University of Minnesota,Minneapolis,1997
  • 4Au W.-H.,Chan K.C.C.,Yao X..A novel evolutionary data mining algorithm with applications to churn prediction.IEEE Transactions on Evolutionary Computation,2003,7 (6):532~545
  • 5http://fuzzy.cs.uni-magdeburg.de/~borgelt,2004年5月9日.
  • 6Agrawal R.,Imielinski T.,Swami A..Mining association rules between sets of items in large databases.In:Proceedings of the ACM SIGMOD,Washington,DC,1993,207~216
  • 7Omiecinski E.R..Alternative interest measures for mining associations in databases.IEEE Transactions on Knowledge and Data Engineering,2003,15(1):57~69
  • 8Srikant R.,Agrawal R..Mining generalized association rules.In:Proceedings of the 21th International Conference on Very Large Data Bases,Tokyo,Japan,1995,409~419
  • 9Srikant R.,Agrawal R..Mining quantitative association rules in large relational tables.In:Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data,Montreal,Quebec,Canada,1996,1~12
  • 10Tan Pang-Ning,Kumar V..Interestingness measures for association patterns:A perspective.Department of Computer Science and Engineering,University of Minnesota,Minneapolis,USA:Technical Report,2000

共引文献11

同被引文献24

  • 1汪小霞.基于MapReduce的大数据时代数据处理技术探究[J].计算机产品与流通,2020,0(5):156-156. 被引量:1
  • 2郏宣耀,王芳.一种改进的小生境遗传算法[J].重庆邮电学院学报(自然科学版),2005,17(6):721-723. 被引量:23
  • 3宁伟,赵明清.关于决策表离散化贪心算法的进一步改进[J].计算机工程与应用,2007,43(3):173-174. 被引量:10
  • 4李锦泽,叶晓俊.关联规则挖掘算法研究现状[C]//第18届计算机技术与应用学术会议(CACIS).2007:216-220.
  • 5韩家炜.数据挖掘:概念与技术[M].3版北京:机械工业出版社,2012.
  • 6(美)Pang-NingTan,(美)MichaelSteinbach,(美)VipinKumar著,范明,范宏建等.数据挖掘导论[M]人民邮电出版社,2006.
  • 7周明;孙树栋.遗传算法原理及应用[M]北京:国防工业出版社,1999.
  • 8Agrawal R,Imielinski S A. Mining association rulesBetween sets of items in large databases[A].Washington,DC:USA,1993.
  • 9Wu Kaixing, Hao Juan, Wang Chunhua. Application of fuzzy association rules in intrusion detection [C] //International Conference on Internet Computing and Information Services, 2011: 269-272.
  • 10Ahadeh MS, Harold M, Jafar H. Design and analysis of ge- netic fuzzy systems for intrusion detection in computer networks [J]. Expert System with Applications, 2011, 38: 7067-7075.

引证文献5

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部