期刊文献+

云计算下的海量数据挖掘研究 被引量:26

Research on Mass Data Mining under Cloud Computing
下载PDF
导出
摘要 云计算的出现为愈来愈多的中小企业分析海量数据提供廉价的解决方案。在介绍基于云计算的Hadoop集群框架和数据挖掘技术中的SPRINT分类算法的基础上,详细描述SPRINT并行算法在Hadoop中的MapReduce编程模型上的执行流程,并利用分析出的决策树模型对输入数据进行分类。 Cloud Computing provides a low-priced way for small and medium sized enterprises to analyze mass data. Based on Hadoop of Cloud Computing and SPRINT algorithm of data mining, proposes the detailed procedure of SPRINT algorithm on MapR.eduee, and classifies the input data is by the model of decision tree.
作者 王鄂 李铭
出处 《现代计算机》 2009年第11期22-25,50,共5页 Modern Computer
关键词 云计算 数据挖掘 HADOOP SPRINT MAPREDUCE Cloud Computing Data Mining Hadoop SPRINT MapReduce
  • 相关文献

参考文献6

  • 1Michael Miller姜进磊,孙瑞志,向勇等译.云计算[M].北京:机械出版社.2009.
  • 2Jeffrey Dean, Sanjay Ghemawat. MapReduce: Symplified Date Processing on Large Clusters[J]. New York:ACM,2008, 51(1):107-113.
  • 3韩家炜,坎伯.数据挖掘概念与技术[M].北京:机械工业出版社.2008.
  • 4John Shafer, Rakesh Agrawal,Manish Mehta. SPRINT:A Scalable Parallel Classifier for Data Mining [C].U.S:IBM Almaden Research Center,1996:544-555.
  • 5魏红宁.基于SPRINT方法的并行决策树分类研究[J].计算机应用,2005,25(1):39-41. 被引量:18
  • 6于蕾,刘大有,高滢,田野.改进SPRINT算法及其在分布式环境下的研究[J].吉林大学学报(理学版),2008,46(6):1119-1124. 被引量:5

二级参考文献19

  • 1栾丽华,吉根林.决策树分类技术研究[J].计算机工程,2004,30(9):94-96. 被引量:110
  • 2Frawley W J, Piatetsky-Shapiro G, Matheus C J. Knowledge Discovery in Databases: an Overview [ C]//Knowledge Discovery in Databases. California: AAAI Press, 1992 : 57-70.
  • 3Cheeseman P, Stutz J. Bayesian Classification (Auto Class) : Theory and Results [ C ]//Advances in Knowledge Discovery and Data Mining. California: AAAI Press, 1996: 153-180.
  • 4Quinlan J R. Induction of Decision Trees [J]. Machine Learning, 1986, 1 (1) : 81-106.
  • 5Krose B, Van Der Smagt P. An Introduction to Neural Networks [ M]. 8th ed. Amsterdam: Faculty of Mathematics and Computer Science, 1996: 11-31.
  • 6Swiniarski R W. Rough Sets Methods in Feature Reduction and Classification [ J ]. Appl Math Comput Sci, 2001, 11(3) : 565-582.
  • 7PEI Min, Goodman E D, YING Ding, et al. Genetic Algorithms for Classification and Feature Extraction [ C ]// Proceeding of Classification Society of North America Annual Meeting. Denver: [ s. n. ], 1995: 1-28.
  • 8Rastogi R, Shim K. PUBLIC: a Decision Tree Classifier That Integrates Building and Pruning [ J ]. Data Mining and Knowlegde Discovery, 2000, 4(4): 315-344.
  • 9Shafer J, Agrawal R, Mehta M. SPRINT: a Scalable Parallel Classifier for Data Mining [ C ]//Proceedings of the 22nd VLDB Conference Mumbai(Bombay). Mumbai : Morgan Kaufmann, 1996 : 544-555.
  • 10HAN EH, SRIVASTAVA A, KUMAR V. Parallel formulation of inductive classification learning algorithm[ R]. Minneapolis, USA: University of Minnesota, 1996.

共引文献56

同被引文献204

引证文献26

二级引证文献212

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部