A reliable and accurate forecasting model for crop yields is crucial for effective decision-making in every agricultural sector.Machine learning approaches allow for building such predictive models,but the quality of ...A reliable and accurate forecasting model for crop yields is crucial for effective decision-making in every agricultural sector.Machine learning approaches allow for building such predictive models,but the quality of predictions decreases if data is scarce.In this work,we proposed data-augmentation for wheat yield forecasting in the presence of small data sets of two distinct Provinces in Algeria.We first increased the dimension of each data set by adding more features,and then we augmented the size of the data by merging the two data sets.To assess the effectiveness of data-augmentation approaches,we conducted three sets of experiments based on three data sets:the primary data sets,data sets with additional features and the augmented data sets obtained by merging,using five regression models(Support Vector Regression,Random Forest,Extreme Learning Machine,Artificial Neural Network,Deep Neural Network).To evaluate the models,we used cross-validation;the results showed an overall increase in performance with the augmented data.DNN outperformed the other models for the first Province with a Root Mean Square Error(RMSE)of 0.04 q/ha and R_Squared(R^(2))of 0.96,whereas the Random Forest outperformed the other models for the second Province with RMSE of 0.05 q/ha.展开更多
文摘A reliable and accurate forecasting model for crop yields is crucial for effective decision-making in every agricultural sector.Machine learning approaches allow for building such predictive models,but the quality of predictions decreases if data is scarce.In this work,we proposed data-augmentation for wheat yield forecasting in the presence of small data sets of two distinct Provinces in Algeria.We first increased the dimension of each data set by adding more features,and then we augmented the size of the data by merging the two data sets.To assess the effectiveness of data-augmentation approaches,we conducted three sets of experiments based on three data sets:the primary data sets,data sets with additional features and the augmented data sets obtained by merging,using five regression models(Support Vector Regression,Random Forest,Extreme Learning Machine,Artificial Neural Network,Deep Neural Network).To evaluate the models,we used cross-validation;the results showed an overall increase in performance with the augmented data.DNN outperformed the other models for the first Province with a Root Mean Square Error(RMSE)of 0.04 q/ha and R_Squared(R^(2))of 0.96,whereas the Random Forest outperformed the other models for the second Province with RMSE of 0.05 q/ha.