期刊文献+

云计算下的SPRINT并行算法研究 被引量:5

Research on SPRINT Algorithm in Cloud Computing
下载PDF
导出
摘要 目前,由于云计算的出现,越来越多的中小企业在分析海量数据时能便利地找到廉价的解决方案。本文,鉴于MapReduce作为Hadoop中的重要编程模型,在介绍基于云计算的Hadoop平台和数据挖掘技术中的SPRINT分类算法的基础上,详细描述SPRINT的并行算法在MapReduce编程模型上的执行流程,并利用研究出的决策树模型对输入数据进行分类。 At present, because of the presence of cloud computing, more and more small and medium sized enterprises can find low-cost solution easily when analyzing mass data. In this paper, whereas MapReduce being the important programming model of Hadoop, in the base of introducing the Hadoop platform and SPRINT algorithm of data mining, proposes the detailed procedure of SPRINT algorithm on MapReduce,and classifies the input data by the model of decision tree.
作者 张春艳
出处 《软件》 2010年第11期57-61,共5页 Software
关键词 云计算 HADOOP MAPREDUCE 数据挖掘 SPRINT Cloud computing Hadoop MapReduce data mining SPRINT
  • 相关文献

参考文献6

  • 1Michael Miller.云计算[M].北京:机械工业出版社,2009.
  • 2Jeffrey Dean,Sanjay Ghemawat.MapReduce:Symplified Date Processing on Large Clusters[J].New York:ACM,2008,51(1):107-113.
  • 3韩家炜,坎伯.数据挖掘概念与技术[M].北京:机械工业出版社.2008.
  • 4John Sharer,Rakesh Agrawal,Manish Mehta.SPRINT:A Scalable Parallel Classifier for Data Mining[C].U.S:IBM Almaden Research Center,1996:544-555.
  • 5魏红宁.基于SPRINT方法的并行决策树分类研究[J].计算机应用,2005,25(1):39-41. 被引量:18
  • 6于蕾,刘大有,高滢,田野.改进SPRINT算法及其在分布式环境下的研究[J].吉林大学学报(理学版),2008,46(6):1119-1124. 被引量:5

二级参考文献19

  • 1栾丽华,吉根林.决策树分类技术研究[J].计算机工程,2004,30(9):94-96. 被引量:110
  • 2Frawley W J, Piatetsky-Shapiro G, Matheus C J. Knowledge Discovery in Databases: an Overview [ C]//Knowledge Discovery in Databases. California: AAAI Press, 1992 : 57-70.
  • 3Cheeseman P, Stutz J. Bayesian Classification (Auto Class) : Theory and Results [ C ]//Advances in Knowledge Discovery and Data Mining. California: AAAI Press, 1996: 153-180.
  • 4Quinlan J R. Induction of Decision Trees [J]. Machine Learning, 1986, 1 (1) : 81-106.
  • 5Krose B, Van Der Smagt P. An Introduction to Neural Networks [ M]. 8th ed. Amsterdam: Faculty of Mathematics and Computer Science, 1996: 11-31.
  • 6Swiniarski R W. Rough Sets Methods in Feature Reduction and Classification [ J ]. Appl Math Comput Sci, 2001, 11(3) : 565-582.
  • 7PEI Min, Goodman E D, YING Ding, et al. Genetic Algorithms for Classification and Feature Extraction [ C ]// Proceeding of Classification Society of North America Annual Meeting. Denver: [ s. n. ], 1995: 1-28.
  • 8Rastogi R, Shim K. PUBLIC: a Decision Tree Classifier That Integrates Building and Pruning [ J ]. Data Mining and Knowlegde Discovery, 2000, 4(4): 315-344.
  • 9Shafer J, Agrawal R, Mehta M. SPRINT: a Scalable Parallel Classifier for Data Mining [ C ]//Proceedings of the 22nd VLDB Conference Mumbai(Bombay). Mumbai : Morgan Kaufmann, 1996 : 544-555.
  • 10HAN EH, SRIVASTAVA A, KUMAR V. Parallel formulation of inductive classification learning algorithm[ R]. Minneapolis, USA: University of Minnesota, 1996.

共引文献28

同被引文献36

引证文献5

二级引证文献52

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部