期刊文献+

带负项值的on-shelf效用项集并行挖掘算法

A Parallel Algorithm for Mining on-shelf Utility Itemset with Negative Item Values
下载PDF
导出
摘要 为了提高带负项值的on-shelf效用项集挖掘算法的挖掘效率,提出带负项值的on-shelf效用项集并行挖掘算法DTP-Houn,算法基于MapReduce框架,充分利用其on-shelf时间段因素,将原始事务数据库按照时间段进行分片。算法将挖掘过程转化为MapReduce工作,Map阶段在分片数据库中挖掘候选项集,Reduce阶段并行计算候选项集的on-shelf效用值。实验结果表明,算法取得了较高的挖掘效率。 In order to improve the mining efficiency of the on-shelf utility itemset mining algorithms with negative item values,the paper proposed a parallel algorithm for mining on-shelf utility itemset with negative item values named DTP-Houn(distributed TPHoun algorithm). Based on MapReduce,the algorithm divides the database according to the on-shelf time periods. The algorithm transforms the mining work into MapReduce job,the Map phase to mine candidates in database fragments,and the Reduce phase to calculate the on-shelf utility values of the candidates in parallel. The experimental results show that the DTP-Houn algorithm has a good performance.
出处 《计算机与现代化》 2018年第4期13-16,21,共5页 Computer and Modernization
基金 福建省自然科学基金资助项目(2014J01229)
关键词 效用项集挖掘 on-shelf时间段 MAPREDUCE 负项值 utility itemset mining on-shelf time periods MapReduce negative item values
  • 相关文献

参考文献4

二级参考文献50

  • 1Owen S,Anil R,Dunning T,et al.Mahout in action[M].[S.l.].Manning Publications ,2011.
  • 2Chu C T, Kim S K, Lin Y A,et al.Map-reduce for machinelearning on multicore[J] .Advances in Neural InformationProcessing Systems,2007,19.
  • 3Ghemawat S, Gobioff H, Leung S T.The Google file system[C]//SOSP,03,2003.
  • 4Dean J, Ghemawat S.MapReduce: simplified data processingon large clusters[J].Communications of the ACM, 2008,51(1).
  • 5Chang F, Dean J, Ghemawat S, et al.Bigtable: a distributedstorage system for structured data[J].ACM Transactions onComputer Systems (TOCS) ,2008,26(2).
  • 6White T.Hadoop: the definitive guide[M].[S.l.] : Yahoo Press,2010.
  • 7Han J, Kamber M, Pei J.Data mining: concepts and tech-niques[M].[S.l.] :Morgan Kaufmann,2011.
  • 8Huang Z.Extensions to the 灸-means algorithm for cluster-ing large data sets with categorical values[J].Data Miningand Knowledge Discovery, 1998,2(3) :283-304.
  • 9Zaki M J.Scalable algorithms for association mining [J].IEEE Transactions on Knowledge and Data Engineering,2000,12(3).
  • 10Zhuo Tang,Junqing Zhou,Kenli Li,Ruixuan Li.A MapReduce task scheduling algorithm for deadline constraints[J]. Cluster Computing . 2013 (4)

共引文献216

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部