期刊文献+

基于spark平台的供电煤耗并行回归预测 被引量:1

Parallel regression prediction of coal consumption based on spark platform
下载PDF
导出
摘要 针对火电厂数据量大且复杂的特点,通过采用基于spark的并行回归算法,解决了传统供电煤耗回归预测模型所需的运行时间较长且预测精度较低的问题。本文采用了大数据平台中采集到的某电厂周期为一年的运行数据,对数据进行异常值筛选、空值填补等清洗及预处理过程,并对工况进行判稳,选取稳定工况下的健康数据进行数据分析,最后利用灰色关联度分析方法选择关联度最大的12个特征,对火电厂供电煤耗进行预测。通过参数调优建立基于spark的火电厂供电煤耗的随机森林和梯度提升决策树的并行回归模型,最后对实验结果进行比较分析和总结。结果表明,随机森林回归模型和梯度提升决策树回归模型对火电厂的供电煤耗都有较好的预测效果,但随机森林回归模型预测的准确度相对更高。 Aiming at the characteristics of large and complex data of thermal power plant,the problem of long running time and low prediction accuracy of traditional regression prediction model of power supply coal consumption is solved by using parallel regression algorithm based on spark.In this paper,the operation data of a power plant with a cycle of one year collected in the big data platform is used to carry out the cleaning and preprocessing such as abnormal value screening and null value filling,judge the stability of the working condition,select the health data under the stable working condition for data analysis,then use the grey relational analysis to select the 12 features with the largest correlation degree,the coal consumption for power supply of thermal power plant is predicted.Through parameter optimization,a parallel regression model of stochastic forest and gradient lifting decision tree for power supply coal consumption of thermal power plant based on spark is established.Finally,the experimental results are compared,analyzed and summarized.The results show that both the stochastic forest regression model and the gradient lifting decision tree regression model have good prediction effects on the power supply coal consumption of thermal power plants,but the prediction accuracy of the stochastic forest regression model is relatively higher.
作者 李偲希 白全生 舒畅 肖祥武 LI Caixi;Bai Quansheng;Shu Chang;Xiao Xiangwu(Big Data Business Unit,Hunan Datang Xianyi Technology Co.,Ltd.,Changsha 410000 Hunan,China)
出处 《电力大数据》 2021年第11期85-92,共8页 Power Systems and Big Data
关键词 决策树 集成学习 并行回归 随机森林算法 梯度提升决策树算法 decision tree ensemble learning parallel regression random forest regression gradient boosting decision tree regression
  • 相关文献

参考文献20

二级参考文献134

共引文献371

同被引文献8

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部