期刊文献+

FP-growth算法的优化 被引量:1

Optimization of FP-growth Algorithm
下载PDF
导出
摘要 FP-growth算法是关联规则挖掘中效率较高的算法,以自底向上方式探索树,由FP树产生频繁项集。本文针对FP树构造过程中需多次遍历频繁项列表L的缺点,提出了一种基于散列表的改进算法,实现了项名称关键字到存储地址的映射,进而实现了项名称关键字到其支持度计数的映射。在查找某项的支持度计数时,只需给出其名称关键字,无需从头遍历频繁项列表L,时间复杂度由O(n)提高到O(1)。实验结果表明,改进算法的性能优于原算法,节省了遍历时间,提高了挖掘效率。 FP-growth is one of the most efifcient algorithm among all the association rule algorithms.It is a kind of algorithms that explores the FP-tree by a bottom-up way,then it generates frequent items by mining the FP-tree. This article puts forward an optimizing algorithm which is based on hash table against the defects during the process of FP-tree construction,because it usually traverses the frequent item table L time and time again.The new algorithm has achieved a mapping of a key name to the storage address,thus it also achieved a mapping of a key name to its supporting number.As a result, just give an item-key or item-name when you want to search the supporting number of an item. The hash function will help you to calculate the logical address according to the item-key you provided,you will obtain the supporting number directlly according to the logical address.There is no point at all in traversing the frequent item table L.Obviously,time complexity of searching one supporting number of an item improves from O(n) to O(1).At last,experimental results show that optimizing algorithm is indeed better than the original algorithm in terms of the running time.It spends less time than the original one.It saves traversal time and improvs mining efifciency.
作者 闫越 姜昌金
出处 《信息技术与信息化》 2013年第6期125-128,共4页 Information Technology and Informatization
关键词 数据挖掘 散列表 频繁项列表L FP-growth Data mining Frequent item table L FP-growth Hash table
  • 相关文献

参考文献5

  • 1闪四清;陈茵;程雁;Mehmed Kantardzic.数据挖掘-概念、模型、方法和算法[M]{H}北京:清华大学出版社,2003149-152.
  • 2Pang-Ning Tan;Michael Steinbach;Vipin Kumar;范明;范宏建.数据挖掘导论[M]{H}北京:人民邮电出版社,2011223-228.
  • 3郑岩.数据仓库与数据挖掘原理及应用[M]{H}北京:清华大学出版社,2011167-175.
  • 4徐孝凯.数据结构实用教程(c/c++描述)[M]{H}北京:清华大学出版社,1999264-270.
  • 5Jiawei Han;Micheline Kamber;范明;孟小峰.数据挖掘概念与技术[M]{H}北京:机械工业出版社,2007157-159.

同被引文献13

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部