期刊文献+

关于Top-N最频繁项集挖掘的研究

Research on Top-N Most Frequent Itemsets Mining
下载PDF
导出
摘要 最频繁项集挖掘决定了文本关联规则挖掘算法的性能,是文本关联规则挖掘中研究的重点和难点。该文分析了当前最频繁项集挖掘方面的不足,改进了传统的倒排表,结合最小支持度阈值动态调整策略,提出了一个新的基于改进的倒排表和集合理论的Top-N最频繁项集挖掘算法。同样,给出了几个命题和推论,并把它们用于该文算法以提高性能,实验结果表明,所提算法的规则有效率和时间性能优于NApriori算法和IntvMatrix算法。 Most frequent item sets mining is the focus and the difficulty of text association rules mining,and it directly determines the performance of text association rules mining algorithms.Firstly,several most frequent item sets mining algorithms are analyzd and summarized.And then,traditional inverted list is improved.Based on the improved list and set theory,a new TOP-N most frequent itemset mining algorithm combined minimum support threshold dynamic adjustment strategy is presented.In addition,several propositions and deductions for improving the performance of the performance of the provided algorithm are offered.Experimental results show that the provided algorithm is better than Napriori and IntvMatrix.
出处 《电子科技大学学报》 EI CAS CSCD 北大核心 2010年第5期757-761,773,共6页 Journal of University of Electronic Science and Technology of China
基金 四川省科技计划项目(2008GZ0003)
关键词 关联规则 倒排表 频繁项集 集合理论 支持度 association rules inverted list requent itemsets set theory supports
  • 相关文献

参考文献14

  • 1HAN Jia-wei,PEI Jian,YIN Yi-wen.Mining frequent patterns without candidate generation:a frequent pattern tree approach[J].Data Mining and Knowledge Discovery,2004,8(1):53-87.
  • 2FU A W C,KWONG R W W,TANG J.Mining N-most interesting itemsets[C] //Proceedings of 2000 ISMIS.Berlin:Springer,2000:59-67.
  • 3BODON F.A survey on frequent itemset mining[C] //Proceedings of the ACM SIGKDD Workshop on OSDM'04.Chicago,USA:[s.n.] ,2004:523-531.
  • 4陈晓云,胡运发.N个最频繁项集挖掘算法[J].模式识别与人工智能,2007,20(4):512-518. 被引量:6
  • 5HAJJ M E,ZAIANE O R.Inverted matrix:Efficient discovery of frequent items in large datasets in the context of interactive mining[C] //2003 Int'l Conf on Data Mining and Knowledge Discovery(ACM SIGKDD).Califomia,USA:[s.n.] ,2003:109-118.
  • 6HAJJ M E,ZAIANE O R.Non recursive generation of frequent k-itemsets from frequent pattern tree representations[C] //Proceedings of 5th International Conference on Data Warehousing and Knowledge Discovery.Melbourne:Australia,2003:371-380.
  • 7RACZ B.NonordFP:an FP-growth variation without rebuilding the FP-tree[C] //Proceedings of the IEEE ICDM Workshop on FIMI'04.Brighton,UK:[s.n.] ,2004:1089-1097.
  • 8LIU Gui-mei,LU Hong-jun.AFOPT:an efficient implemefitation of pattern growth approach[C] //Proceedings of the IEEE ICDM Workshop on FIMI'04.Brighton,UK:[s.n.] ,2004:2056-2067.
  • 9陈耿,朱玉全,杨鹤标,陆介平,宋余庆,孙志挥.关联规则挖掘中若干关键技术的研究[J].计算机研究与发展,2005,42(10):1785-1789. 被引量:62
  • 10陈晓云,陈袆,王雷,李荣陆,胡运发.基于分类规则树的频繁模式文本分类[J].软件学报,2006,17(5):1017-1025. 被引量:19

二级参考文献42

共引文献109

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部