期刊文献+

基于机器学习和SHAP算法的农业用水量预测模型构建

Development of Agricultural Water-Consumption Prediction Model Based on Machine Learning and SHAP
下载PDF
导出
摘要 农业用水预测是区域水资源规划中的关键环节,对于实现水资源合理开发,保障粮食安全具有重要的指导意义。然而,现有农业用水预测模型普遍存在输入参数冗余、模型精度不够等问题,不利于有效地进行水资源管理和优化决策。因此,选择内蒙河套灌区作为研究对象,首先对灌区农业用水量相关驱动因子进行主成分分析(Principal Components Analysis,简称PCA),筛选出影响灌区农业用水量的关键因子;其次构建多种基于机器学习的农业用水预测模型;最后,利用Shapley加法解释方法(SHapley Additive exPlanations,SHAP)验证最优模型应用的合理性,并深入挖掘各特征值对农业用水量的贡献影响。结果表明:多层感知器神经网络(MLP)机器学习模型可以有效的预测农业用水量,其R2评价指标为0.84,相较于其他五种不同机器学习模型(最小绝对收缩和选择算子回归Lasso、岭回归Ridge、决策树DT、随机森林RF、极限梯度提升XGboost),该模型预测结果较好。采用SHAP值法对MLP机器学习模型中的输入参数进行量化分析,发现第一产业总产值与粮食产量有较高的绝对平均SHAP值,而在不同灌域中SHAP值贡献大小略有差异。构建农业用水量预测筛选模型可以准确预测农业用水量,从而实现灌区农业精准灌溉并提高水资源利用效率,对于缓解未来河套灌区水资源供需矛盾具有重要的实际意义。 Predicting agricultural water usage is a key element in regional water resource planning,essential for ensuring the rational development of water resources and ensuring food security.However,existing models for predicting agricultural water usage often suffer from issues such as redundant input parameters and insufficient accuracy,hindering effective water resource management and optimal decisionmaking.Therefore,this study selects the Hetao irrigation district in Inner Mongolia as the research area.Firstly,principal component analysis(PCA)is conducted on the driving factors related to agricultural water usage to identify the key factors influencing water usage in the irrigation district.Secondly,various machine learning-based models were constructed for predicting agricultural water usage.Finally,the SHapley Additive exPlanations(SHAP)method is used to validate the applicability of the optimal model and to deeply explore the contribution of each feature to agricultural water usage.The results show that the Multilayer Perceptron(MLP)neural network model effectively predicts agricultural water usage,with an R²evaluation index of 0.84,performing better than five other machine learning models:Least Absolute Shrinkage and Selection Operator Regression(Lasso),Ridge Regression(Ridge),Decision Tree(DT),Random Forest(RF),and eXtreme Gradient Boosting(XGBoost).Using the SHAP method to quantitatively analyze the input parameters of the MLP model reveals that the total output value of the primary industry and grain yield have higher absolute mean SHAP values,with slight differences in SHAP value contributions among different irrigation regions.Constructing an agricultural water usage prediction and screening model can accurately predict water usage,thereby achieving precise irrigation in the irrigation district and improving water resource utilization efficiency.This has significant practical implications for alleviating future water resource supply and demand conflicts in the Hetao irrigation district.
作者 昝子懿 岳卫峰 赵航正 曹倡铭 胡竞丹 胡立堂 徐洋 陈爱萍 ZAN Zi-yi;YUE Wei-feng;ZHAO Hang-zheng;CAO Chang-ming;HU Jing-dan;HU Li-tang;XU Yang;CHEN Ai-ping(College of Water Sciences,Beijing Normal University,Beijing 100875,China;Experimental Station of Yichang Subcenter of Water Development Center of Hetao Irrigation Dsitrict in Inner Mongolia,Wuyuan 015100,Inner Mongolia,China)
出处 《节水灌溉》 北大核心 2024年第12期102-110,共9页 Water Saving Irrigation
基金 国家重点研发计划项目(2021YFC3201204) 清华大学-宁夏银川水联网数字治水联合研究院专项统筹重点项目(SKL-IOW-2023TC2307)。
关键词 农业用水量 机器学习 用水量预测 SHAP 河套灌区 agricultural water usage machine learning water usage prediction SHAP Hetao irrigation district
  • 相关文献

参考文献14

二级参考文献114

共引文献163

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部