期刊文献+

基于哈希表与线性表建立FP-Tree的改进算法 被引量:1

A New Algorithm of FP-tree Establishment on the Basis of Hash Table and Linear Ojbect Table
下载PDF
导出
摘要 对于超市销售记录进行关联挖掘,项目集庞大,每次事务中涉及到项目数非常少。针对这类稀疏数据,提出了基于事务哈希表和线性对象表的FP-Tree改进算法,其只需扫描数据库一次,把相关信息压入事务哈希表和线性对象表中。当支持度和事务记录变化时,可不用重新扫描数据库或扫描数据库更新部分。试验结果验证了该改进算法相对于原算法在建树中的优势,特别在大数据集下,降低了建立FP-Tree的时间。 For the record of supermarket sales of association data mining,the project set waslarge,each transaction involved in the number of items was very small,sparse data for this and similar proposed transaction-based hash table and the linear object table of the FP-TREE improved algorithm,It scans the database only once to press into the affairs related to information and linear hash table and object table.When the change of support and transaction records,it can not re-scan the database or scan the database update part.The results are used to verify the improved algorithm related to the original FP-TREE algorithm in the achievements of the advantages,especially in large data sets,the association rule mining greatly reduces the time of construction of FP-TREE.
出处 《长江大学学报(自科版)(上旬)》 CAS 2010年第1期76-79,共4页 JOURNAL OF YANGTZE UNIVERSITY (NATURAL SCIENCE EDITION) SCI & ENG
基金 福建省自然科学基金资助项目(2009J01294) 宁德师范学院科研资助项目(2008Y007)
关键词 事务哈希表 线性对象表 FP-GROWTH 关联规则 transaction hash table linear object table FP-growth Association rules
  • 相关文献

参考文献7

  • 1Agrawal R, Srikant R. Fast algorithm for mining association rules [J] . Proceedings of the 20th International Conference on VLDB. Santiago, 1994, 12 (2): 487-499.
  • 2Houtsma M, Swami A. Set-Oriented mining for association rules in relational databases [J] . Proceedings of the International Conference on Data Engineering. Los Alamitos: IEEE Computer Society Press, 1995, 38 (4) : 25-33.
  • 3Han J K.数据挖掘:概念与技术[M].范明,孟小峰译.北京:机械工业出版社,2001.
  • 4邓丰义,刘震宇.基于模式矩阵的FP-growth改进算法[J].厦门大学学报(自然科学版),2005,44(5):629-633. 被引量:17
  • 5熊忠阳 耿晓斐 张玉芳.一种新频繁项集挖掘算法.计算机学报,2009,36(4):42-44.
  • 6Zaki M J. Scanlable Algorithms for Association Mining [J]. Knowledge and Data Engineering, 2000, 19 (3): 52-55.
  • 7路松峰,卢正鼎.快速开采最大频繁项目集[J].软件学报,2001,12(2):293-297. 被引量:113

二级参考文献8

  • 1Lin Dao I,Proc the 6th European Conference on Extending Database Technology,1998年,105页
  • 2Agrawal R,Proc the 11th Inter Conference on Data Engineering,1995年,3页
  • 3Park J S,Chen M S,Yu P S.An effective hash-based algorithm for mining association rules[A].In Proc.1995 ACM-SIGMOD Int.Conf.Management of Data[C].San Jose,CA:1995.175-186.
  • 4Toivonen H.Sampling large databases for association rules[A].In Proc.1996 Int.Conf.Very Large Data Bases(VLDB'98)[C].Bombay,India:1996.134-145.
  • 5Han J Kamber.Data Mining:Concepts and Techniques[M].Beijing:Higher Education Press,2001.
  • 6HanJKamber 范明 孟小峰 译.数据挖掘:概念与技术[M].北京:机械工业出版社,2001..
  • 7Han J,Jian P,Yiwen Y.Mining frequent patterns without candidate generation[A].In:Proceedings of the 2000 ACM SIGMOD International Conference Management of Data[C].Dallas.Texas:2000.1-12.
  • 8Agrawal R,Srikant R.Fast algorithm for mining association rules[A].In:Proceedings of the 20th International Conference on VLDB[C].Santiago,Chile:1994.487-499.

共引文献128

同被引文献4

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部