期刊文献+

Hadoop下基于粗糙集与贝叶斯的气象数据挖掘研究 被引量:6

RESEARCH ON ROUGH SET AND BAYES-BASED METEOROLOGICAL DATA MINING ON HADOOP PLATFORM
下载PDF
导出
摘要 随着气象信息化程度不断提高,气象部门积累了海量的气象数据,如何从海量的数据中获取有用的知识,成为人们关注的重点。气象数据具有维度高、依赖性强等特点,这就对气象数据挖掘提出了更高的要求。经典数据挖掘算法在处理海量气象数据时在性能与准确率方面无法获得较好的结果。在分析了MapReduce计算模型与粗糙集、贝叶斯分类的基础上,给出了基于MapReduce的计算等价类的数据约简算法与朴素贝叶斯分类算法。最后在Hadoop平台上进行了相关实验。实验结果表明,该并行数据挖掘方案可以有效处理海量气象数据,并具有良好的扩展性。 With the continuous development of meteorological informatisation level,massive meteorological data has been piled up in meteorological departments,how to extract useful knowledge from massive data becomes the focus of attention.Meteorological data has the features of high dimensions and strong dependence,which puts forward higher requirements to meteorological data mining.Classic data mining algorithms cannot achieve better results in performance and accuracy when processing massive meteorological data.On the basis of analysing MapReduce calculation model,rough set theory and Bayesian classification,we propose a MapReduce-based data reduction algorithm and native Bayesian classification algorithm for computing equivalence class.Finally,on Hadoop platform we carry out the correlated experiment. It is demonstrated by the experimental results that this paralleled data mining scheme can efficiently process massive meteorological data and has good scalability.
出处 《计算机应用与软件》 CSCD 2015年第4期72-76,90,共6页 Computer Applications and Software
基金 国家自然科学基金项目(61363052) 内蒙古研究生科研创新项目(S20131012810) 内蒙古教育厅自然科学基金项目(NJZY12052) 内蒙古工业大学重点项目(ZD201118)
关键词 粗糙集 朴素贝叶斯 MAPREDUCE 气象数据 Rough set Native Bayes MapReduce Meteorological data
  • 相关文献

参考文献14

  • 1Juraj Bartoka, Ondrej Habalab, Peter Bednarc, et al. Data Mining and Integration for Predicting Significant Meteorological Phenomena [ J ]. Procedia Computer Science,2012,17 (1) : 37 -46.
  • 2胡邦辉,袁野,王学忠,丛爱丽.基于贝叶斯分类方法的雷暴预报[J].解放军理工大学学报(自然科学版),2010,11(5):578-584. 被引量:14
  • 3路志英,赵智超,郝为,林孔元,刘还珠.基于人工神经网络的多模型综合预报方法[J].计算机应用,2004,24(4):50-51. 被引量:10
  • 4Dean J,Ghemawat S. Mapreduce: Simplified data processing on large- clusters[ J ]. Communication of the ACM, 2008,51 ( 1 ) :107 - 113.
  • 5Deng Dayong, Yan Dianxun, Wang Jiyi. Parallel reducts based on at- tribute significance[ C] //Proceedings of the 5th International Confer- ence on Rough Set and Knowledge Technology ( RSKT' l0 ) , Bei- jing, 2010. Berlin, Heidelberg: Springer-Verlag, 2010: 336-343.
  • 6Liang Jiye, Wang Feng, Dang Chuangyin, et al. An efficient rough feature selection algorithm with a multi-granulation view [ J ]. Interna- tional Journal of Approximate Reasoning, 2012, 53(6) : 912 -926.
  • 7Chu C T, Kin S, Lin Y A, et al. MapReduce for machine leaming on multicore [ C ]//Schlkopf Bernhard, Platt John, Hofmann Thomas. Advances in Neural Information Processing System 19. MIT Press, 2006:281 - 288.
  • 8王元卓,靳小龙,程学旗.网络大数据:现状与展望[J].计算机学报,2013,36(6):1125-1138. 被引量:711
  • 9王双成,杜瑞杰,刘颖.连续属性完全贝叶斯分类器的学习与优化[J].计算机学报,2012,35(10):2129-2138. 被引量:37
  • 10幸莉仙,黄慧连.MapReduce框架下的朴素贝叶斯算法并行化研究[J].计算机系统应用,2013,22(2):108-111. 被引量:9

二级参考文献147

共引文献1180

同被引文献44

引证文献6

二级引证文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部