期刊文献+

基于决策树挖掘算法的气象大数据云平台设计 被引量:5

Design of Meteorological Big Data Cloud Platform Based on Classification and Regression Trees Mining Algorithm
下载PDF
导出
摘要 大数据、云计算技术的迅猛发展为挖掘气象数据丰富的科研和经济价值提供了技术支撑,促进了Hadoop及其包含的文件存储系统(HDFS,hadoop distributed file system)和分布式计算模型在气象数据处理领域广泛应用;由于气象数据具有大数据的4 V特征,还需要引入新的数据处理算法来提高气象数据处理效率;通过对决策树算法原理的研究,基于Hadoop云平台,创建随机森林模型,为数据挖掘算法在云平台上的应用提供一种新的可能性;基于决策树(CART,classification and regression trees)挖掘算法的气象大数据云平台设计,采用Hadoop系统架构和MapReduce工作流程,对气象大数据云平台采用集群部署;平台总体架构分为基础设施层、数据管理与处理层、应用层,减少了决策树建立的时间,实现了气象数据高效加工和挖掘分析等平台功能。 The rapid development of big data and cloud computing technology provides the technical support for mining the rich scientific research and economic value of meteorological data.It promotes the wide application of Hadoop and Hadoop distributed file system(HDFS)and distributed computing model in the field of meteorological data processing.Due to the 4 V characteristics of big data,the new data processing algorithms need to be introduced to improve the efficiency of meteorological data processing.Through the research on the principle of classification and regression trees(CART)algorithm,based on Hadoop cloud platform,a random forest model is constructed,which provides a new possibility for the application of data mining algorithm on the cloud platform.The design of meteorological big data cloud platform based on CART mining algorithm adopts Hadoop system architecture and MapReduce workflow to deploy the meteorological big data cloud platform in clusters.The overall architecture of the platform is divided into the infrastructure layer,data management and processing layer,application layer,which reduces the time to establish the decision tree and realizes the functions of the big data cloud platform such as efficient processing and mining analysis of the meteorological data.
作者 王立俊 杜建华 刘骥超 王双双 谢寒生 赵冰 WANG Lijun;DU Jianhua;LIU Jichao;WANG Shuangshuang;XIE Hansheng;ZHAO Bing(Meteorological Information Center of Hainan Province,Haikou 570203,China;Key Laboratory of South China Sea Meteorological Disaster Prevention and Mitigation of Hainan Province,Haikou 570203)
出处 《计算机测量与控制》 2022年第11期140-146,共7页 Computer Measurement &Control
基金 国家自然科学基金(41775011) 海南省气象局科技创新项目(HNQXSJ202114)。
关键词 气象数据 气象大数据云平台 决策树算法 HADOOP MAPREDUCE meteorological data meteorological big data cloud platform classification and regression trees Hadoop MapReduce
  • 相关文献

参考文献16

二级参考文献228

共引文献116

同被引文献58

引证文献5

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部