期刊文献+

基于LightGBM算法的能见度预测模型 被引量:13

Visibility forecast model based on LightGBM algorithm
下载PDF
导出
摘要 为了提高能见度预报的准确率,尤其是低能见度预报的准确率,提出一种基于集成学习随机森林和LightGBM的能见度预测模型。首先,以数值模式系统的气象预报数据为基础,结合地面气象观测数据、PM2.5浓度观测数据,利用随机森林算法构建特征向量;其次,针对不同时间跨度的缺失数据,设计了3种缺失值处理方法对缺失值进行替代,生成用于训练和测试的连续性较好的数据样本集;最后,建立基于LightGBM的能见度预测模型,并用网络搜索法对其进行参数优化。把所提模型与支持向量机(SVM)、多元线性回归(MLR)、人工神经网络(ANN)在性能上进行对比。实验结果表明,对于不同的等级的能见度,应用LightGBM的能见度预测模型获得预兆得分(TS)均较高,而对于<2 km的低能见度,该模型对各观测站点的能见度预测值与各观测站点的能见度实况值的平均相关系数为0.75,平均均方误差为6.49。可见基于LightGBM的预测模型能有效提高能见度预测精度。 In order to improve the accuracy of visibility forecast,especially the accuracy of low-visibility forecast,an ensemble learning model based on random forest and LightGBM for visibility forecast was proposed.Firstly,based on the meteorological forecast data of the numerical modeling system,combined with meteorological observation data and PM2.5concentration observation data,the random forest method was used to construct the feature vectors.Secondly,for the missing data with different time spans,three missing value processing methods were designed to replace the missing values,and then the data sample set with good continuity for training and testing was created.Finally,a visibility forecast model based on LightGBM was established,and its parameters were optimized by using the network search method.The proposed model was compared to Support Vector Machine(SVM),Multiple Linear Regression(MLR)and Artificial Neural Network(ANN)on performance.Experimental results show that for different levels of visibility,the proposed visibility forecast model based on LightGBM algorithm obtains the highest Threat Score(TS);when the visibility is less than 2 km,the average correlation coefficient between the visibility values of observation stations predicted by the model and the observation values of visibility of observation stations is 0.75,the average mean square error between them is 6.49.It can be seen that the forecast model based on LightGBM can effectively improve the accuracy of visibility forecast.
作者 余东昌 赵文芳 聂凯 张舸 YU Dongchang;ZHAO Wenfang;NIE Kai;ZHANG Ge(Beijing Institute of Urban Meteorology,Beijing 100089,China;Beijing Meteorological Information Center,Beijing 100089,China;Beijing Meteorological Observation Center,Beijing 100176,China;XinTuZhiXing(Beijing)Technology Corporation Limited,Beijing 100022,China)
出处 《计算机应用》 CSCD 北大核心 2021年第4期1035-1041,共7页 journal of Computer Applications
关键词 能见度预测 集成学习 随机森林算法 LightGBM算法 visibility forecast ensemble learning random forest algorithm LightGBM algorithm
  • 相关文献

参考文献20

二级参考文献280

共引文献246

同被引文献169

引证文献13

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部