期刊文献+

基于PFP-Growth算法的海量频繁项集挖掘 被引量:2

Mining Massive Frequent Items Based on PFP-Growth Algorithm
下载PDF
导出
摘要 随着互联网技术的发展,网络数据变得越来越巨大,如何从中挖掘有效信息成为人们研究的重点。近年来频繁项集挖掘由于其在关联规则挖掘、相关挖掘等任务中的相关重要作用,越来越受到人们的重视。文中针对分布式计算环境下频繁项集挖掘算法的研究,对PFP-Growth算法进行了改进,通过MapReduce编程模型对改进的PFP-Growth算法进行了实现和应用,使用户可以从海量数据中高效地获得所有需要的频繁项集。实验结果表明算法在针对海量数据时具有较高的效率和伸缩性。 As the development of Intemet,the data on it becomes more massive. How to mine useful information from the Interact is the key of study. In recent years, frequent item mining which plays an important role in associations rule mining and correlations mining be- comes popular among researchers. By the study of mining frequent itemsets based on cloud computing, the PFP-Growth algorithm is im- proved. Run the algorithm under the MapReduce model which allows users to obtain all required frequent itemsets efficiently from mas- sive data, the results of experiment shows the algorithm has good efficiency and flexibility.
作者 江雨燕 李平
出处 《计算机技术与发展》 2013年第9期63-65,198,共4页 Computer Technology and Development
基金 安徽高校省级自然科学研究项目(kj2011z039) 安徽工业大学硕士研究生导师创新基金项目(D2011024)
关键词 频繁项集 海量数据 PFP-Growth frequent itcmset massive data PFP-Growth
  • 相关文献

参考文献12

  • 1Agrawal R,Imielinski T,Swami A.Mining association rulesbetween sets of items in large database [ C]//Proc.of 1993ACM SIGMOD Conf.on Management of Data.WashingtonDC:ACM Press,1993:207-216.
  • 2Agrawal R,Srikant R.Fast Algorithms for Mining AssociationRules[ C]//Proc.of the 20th Inti Conference on Very LargeDatabases.Santiago,Chile : [ s.n.] ,1994.
  • 3Han J,Pei J,Yin Y.Mining frequent patterns without candi-date generation [ C]//Proc.of 2000 ACM SIGMOD Int' 1Conf.on Management of Data.Dallas,TX,New York: ACMPress,2000:1-12.
  • 4颜跃进,李舟军,陈火旺.基于FP-Tree有效挖掘最大频繁项集[J].软件学报,2005,16(2):215-222. 被引量:68
  • 5杨云,罗艳霞.FP-Growth算法的改进[J].计算机工程与设计,2010,31(7):1506-1509. 被引量:25
  • 6张玉芳,熊忠阳,彭燕,赵鹰.基于FP-Tree含正负项目的频繁项集挖掘算法[J].模式识别与人工智能,2008,21(2):246-253. 被引量:4
  • 7Agrawal R,Shafer J.Parallel mining of association rules[ J].IEEE Trans,on Knowledge and Data Engineering,1996,8(6):962-969.
  • 8Li Haoyuan,Wang Yi,Zhang Dong,et al.PFP: Parallel FP-Growth for Query Recommendation [ C] //Proceedings of the2008 ACM Conference on Recommender Systems.Lousanne,Switzerland: [s.n.] ,2008: 125-137.
  • 9Zhou L,Zhong Z,Chang J,et al.Balanced Parallel FP-Growthwith Map-Reduce [ C]//Proceedings of 2010 IEEE YouthComference on Information Computing and Telecommunica-tions(YC-ICT).Beijing: [ s.n.] ,2010:243-246.
  • 10宋威,刘文博,李晋宏.基于动态裁剪频繁模式树的频繁项集并发挖掘算法[J].山东大学学报(工学版),2011,41(4):49-55. 被引量:3

二级参考文献62

  • 1秦亮曦,苏永秀,刘永彬,梁碧珍.基于压缩FP-树和数组技术的频繁模式挖掘算法[J].计算机研究与发展,2008,45(z1):244-249. 被引量:16
  • 2卢炎生,饶丹.一种挖掘带否定关联规则的算法[J].计算机工程与科学,2004,26(10):63-65. 被引量:6
  • 3徐前方,阚建杰,李永春,李荣盛,郭军.一种具有时序特征的告警关联规则挖掘算法[J].微电子学与计算机,2007,24(3):23-26. 被引量:6
  • 4李志云,周国祥.一种基于MFP树的快速关联规则挖掘算法[J].计算机技术与发展,2007,17(6):94-96. 被引量:6
  • 5Agrawal R.and Ramakrishnan S.,Fast Algorithms for Mining Association Rules in Large Databases.In Proceedings of the Twentieth International Conference on Very Large Databases,pp.487-499,Santiago,Chile,September 1994.
  • 6Mannila H.,Toivonen H.,Levelwise Search and Borders of Theories in Know Ledge Discovery.Data Mining and Knowledge Discovery,Volume 1,Issue 3,1997,pp.241-258.
  • 7Agrawal R.,Srikant R.,Fast Algorithms for Mining Association Rules.In Proceedings of the 20th International Conference on Very Large Data Bases,September 12-15,1994,Santiago de Chile,Chile,1994,pp.487-499.
  • 8Groth D,Robertson E.,Discovering Frequent Item Sets in the Presence of Highly Frequent Items.In Proceedings of Rule Based Data Mining,2001,pp.84-91.
  • 9N.Pasquier,Y.Bastide,R.Taouil,Discovering Frequent Closed Iternsets for Association Rules.In Proceedings of the 7th International Conference on Database Theory,1999,pp398-416.
  • 10HERNANDEZ-LEON R, PALANCAR J H, CARRASCO-OCHOA J A, et al. Algorithms for mining frequent itemsets in static and dynamic datasets [ J ]. Intelligent Data Analysis, 2010, 14(3) :419-435.

共引文献97

同被引文献5

引证文献2

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部