期刊文献+

基于云计算的海量数据挖掘研究 被引量:97

Study of Massive Data Mining Based on Cloud Computing
下载PDF
导出
摘要 为了实现高效率低成本的海量数据挖掘,为企业决策提供参考,提出了基于云计算的海量数据挖掘模型。该模型中海量数据的处理和存储都是在云计算环境中进行的,首先对海量的数据进行一定的预处理,形成结构一致的数据后,应用云计算平台上的MapReduce模型进行高效的并行数据处理,最后得到所需的数据挖掘结果。基于云计算的海量数据挖掘的效率明显高于传统的数据挖掘,并且数据挖掘结果的准确性有了一定的提高,而且随着数据量的增多,该模型的优势会愈发明显。 In order to achieve high efficiency and low cost of massive data mining, and provide decision references for enterprise, the mod- el of massive data mining based on cloud computing has been proposed. The massive data:s processing and storage of the model were car- ried on the cloud computing environment. Firstly, take some certain preprocessing for the massive data to form data with the same struc- ture. Then, use the MapReduce model on the cloud computing platform to parallelly process the data efficiently. Finally, get the needed re- sult of data mining. The efficiency of massive data mining based on cloudcomputing is clearly higher than traditional data mining. Mean- while, the accuracy of data mining will be improved. Along with the increase of data, the advantage of the model will increasingly obvi- ous.
出处 《计算机技术与发展》 2013年第2期69-72,共4页 Computer Technology and Development
基金 国家自然科学基金资助项目(61100165/F020508) 陕西省自然科学基金(2007F18)
关键词 云计算 数据挖掘 海量数据 MAPREDUCE 数据预处理 cloud computing data mining massive data MapReduce data preprocessing
  • 相关文献

参考文献11

二级参考文献143

  • 1周锋,李旭伟.一种改进的MapReduce并行编程模型[J].科协论坛(下半月),2009(2):65-66. 被引量:14
  • 2刘华元,袁琴琴,王保保.并行数据挖掘算法综述[J].电子科技,2006,19(1):65-68. 被引量:15
  • 3王轶,达新宇.分布式并行数据挖掘计算框架及其算法研究[J].微电子学与计算机,2006,23(9):223-225. 被引量:10
  • 4Michael Miller姜进磊,孙瑞志,向勇等译.云计算[M].北京:机械出版社.2009.
  • 5STEELC,NAGAPPANR,LAIR.安全模式[M].陈秋萍,罗邓,袁国忠,译.北京:机械工业出版社,2006.
  • 6Wikipedia. Cloud computing [ EB/OL ]. (2007-03-03) [ 2008-12- 20]. http ://en. wikipedia, org/wiki/Cloud computing.
  • 7Wikipedia. John McCarthy ( computer scientist) [ EB/OL]. (2008- 10-07) [2008-12-10]. http://en. wikipcdia, org/wiki/John_McCarthy_(computer_scientist).
  • 8IBM, C, oogle and IBM announced university initiative to address intemetscale computing challenges [EB/OL]. (2007-10-08) [2008-10-15]. http ://www-03. ibm. com/press/us/en/pressrelease/22414. wss.
  • 9HEWITT C. ORGs for scalable, robust privacy-friendly client cloud computing [ J]. IEEE Intemet Computing, 2008,12 (5) :96- 99.
  • 10WANG Li-zhe, TAO Jie, KUNZE M. Scientific cloud computing: early definition and experience[ C ]//Proc of the 10th IEEE International Conference on High Performance Computing and Communications. 2008:825- 830.

共引文献1626

同被引文献768

引证文献97

二级引证文献342

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部