期刊文献+

A New Hybrid Algorithm for Association Rule Mining 被引量:1

A New Hybrid Algorithm for Association Rule Mining
下载PDF
导出
摘要 HA(hashing array),a new algorithm,for mining frequent itemsets of large database is proposed.It employs a structure hash array,ItemArray() to store the information of database and then uses it instead of database in later iteration.By this improvement,only twice scanning of the whole database is necessary,thereby the computational cost can be reduced significantly.To overcome the performance bottleneck of frequent 2-itemsets mining,a modified algorithm of HA,DHA(direct-addressing hashing and array) is proposed,which combines HA with direct-addressing hashing technique.The new hybrid algorithm,DHA,not only overcomes the performance bottleneck but also inherits the advantages of HA.Extensive simulations are conducted in this paper to evaluate the performance of the proposed new algorithm,and the results prove the new algorithm is more efficient and reasonable. HA (hashing array), a new algorithm, for mining frequent itemsets of large database is proposed. It employs a structure hash array, ltemArray ( ) to store the information of database and then uses it instead of database in later iteration. By this improvement, only twice scanning of the whole database is necessary, thereby the computational cost can be reduced significantly. To overcome the performance bottleneck of frequent 2-itemsets mining, a modified algorithm of HA, DHA (directaddressing hashing and array) is proposed, which combines HA with direct-addressing hashing technique. The new hybrid algorithm, DHA, not only overcomes the performance bottleneck but also inherits the advantages of HA. Extensive simulations are conducted in this paper to evaluate the performance of the proposed new algorithm, and the results prove the new algorithm is more efficient and reasonable.
出处 《Journal of Donghua University(English Edition)》 EI CAS 2007年第5期598-603,共6页 东华大学学报(英文版)
关键词 数据挖掘 散列法 数据库 混合算法 联合规则挖掘 association rule data mining hashing database analysis
  • 相关文献

参考文献12

  • 1陈文庆,许棠.关联规则挖掘Apriori算法的改进与实现[J].微机发展,2005,15(8):155-157. 被引量:21
  • 2马盈仓.挖掘关联规则中Apriori算法的改进[J].计算机应用与软件,2004,21(11):82-84. 被引量:24
  • 3Agrawal R,I mielinski T,Swami A.Mining Association Rules between Sets of Items in Large Databases[].Proc of the ACM SIGMOD Conference on Management of Data.1993
  • 4Houts ma M,Swami A.Set-oriented Mining of Association Rules [ R][].Research Report RJIBM Al maden Research Center.1993
  • 5Agrawal R,Srikant S.Fast Algorithm for Mining Association Rules [ C][].Proc of theth International Conference on Very Large Data Bases.1994
  • 6Tsai C F,Lin Y C,Chen C P.A New Fast Algorithmfor Mining Association Rules in Large Database[].Proc of the IEEE International Conference on SystemsMan andCybernetics.2002
  • 7Yang D L,Pan C T,Chung Y C.An Efficient Hash-based Method for Discovering the Maxi mal Frequent Set [ C][].Proc of the IEEE Computer Society s International Computer Software and Applications Conference.2001
  • 8Lin D,Kedem Z M.Pincer-Search: A New Algorithmfor Discovering the Maxi mum Frequent Set[].Proc ofⅥ Int Conference on Extending Database Technology.1998
  • 9Brin S,Motwani R,Ull man J D,et al.Dynamic Itemset Counting and I mplication Rules for Marketing Basket Data [ C ][].Proc ofACM-SIGMOD Int Conf Management of Data.1997
  • 10Pei J,Han J W,Lu HJ,et al.Hyper-Structure Mining of Frequent Patterns in Large Database[ C][].Proc of the IEEE International Conference on Data Mining.2001

二级参考文献6

共引文献41

同被引文献9

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部