Improving the accuracy of flood prediction and mapping is crucial for reducing damage resulting from flood events.In this study,we proposed and validated three ensemble models based on the Best First Decision Tree(BFT...Improving the accuracy of flood prediction and mapping is crucial for reducing damage resulting from flood events.In this study,we proposed and validated three ensemble models based on the Best First Decision Tree(BFT)and the Bagging(Bagging-BFT),Decorate(Bagging-BFT),and Random Subspace(RSS-BFT)ensemble learning techniques for an improved prediction of flood susceptibility in a spatially-explicit manner.A total number of 126 historical flood events from the Nghe An Province(Vietnam)were connected to a set of 10 flood influencing factors(slope,elevation,aspect,curvature,river density,distance from rivers,flow direction,geology,soil,and land use)for generating the training and validation datasets.The models were validated via several performance metrics that demonstrated the capability of all three ensemble models in elucidating the underlying pattern of flood occurrences within the research area and predicting the probability of future flood events.Based on the Area Under the receiver operating characteristic Curve(AUC),the ensemble Decorate-BFT model that achieved an AUC value of 0.989 was identified as the superior model over the RSS-BFT(AUC=0.982)and Bagging-BFT(AUC=0.967)models.A comparison between the performance of the models and the models previously reported in the literature confirmed that our ensemble models provided a reliable estimate of flood susceptibilities and their resulting susceptibility maps are trustful for flood early warning systems as well as development of mitigation plans.展开更多
The groundwater potential map is an important tool for a sustainable water management and land use planning,particularly for agricultural countries like Vietnam.In this article,we proposed new machine learning ensembl...The groundwater potential map is an important tool for a sustainable water management and land use planning,particularly for agricultural countries like Vietnam.In this article,we proposed new machine learning ensemble techniques namely AdaBoost ensemble(ABLWL),Bagging ensemble(BLWL),Multi Boost ensemble(MBLWL),Rotation Forest ensemble(RFLWL)with Locally Weighted Learning(LWL)algorithm as a base classifier to build the groundwater potential map of Gia Lai province in Vietnam.For this study,eleven conditioning factors(aspect,altitude,curvature,slope,Stream Transport Index(STI),Topographic Wetness Index(TWI),soil,geology,river density,rainfall,land-use)and 134 wells yield data was used to create training(70%)and testing(30%)datasets for the development and validation of the models.Several statistical indices were used namely Positive Predictive Value(PPV),Negative Predictive Value(NPV),Sensitivity(SST),Specificity(SPF),Accuracy(ACC),Kappa,and Receiver Operating Characteristics(ROC)curve to validate and compare performance of models.Results show that performance of all the models is good to very good(AUC:0.75 to 0.829)but the ABLWL model with AUC=0.89 is the best.All the models applied in this study can support decision-makers to streamline the management of the groundwater and to develop economy not only of specific territories but also in other regions across the world with minor changes of the input parameters.展开更多
Water level predictions in the river,lake and delta play an important role in flood management.Every year Mekong River delta of Vietnam is experiencing flood due to heavy monsoon rains and high tides.Land subsidence m...Water level predictions in the river,lake and delta play an important role in flood management.Every year Mekong River delta of Vietnam is experiencing flood due to heavy monsoon rains and high tides.Land subsidence may also aggravate flooding problems in this area.Therefore,accurate predictions of water levels in this region are very important to forewarn the people and authorities for taking timely adequate remedial measures to prevent losses of life and property.There are so many methods available to predict the water levels based on historical data but nowadays Machine Learning(ML)methods are considered the best tool for accurate prediction.In this study,we have used surface water level data of 18 water level measurement stations of the Mekong River delta from 2000 to 2018 to build novel time-series Bagging based hybrid ML models namely:Bagging(RF),Bagging(SOM)and Bagging(M5P)to predict historical water levels in the study area.Performances of the Bagging-based hybrid models were compared with Reduced Error Pruning Trees(REPT),which is a benchmark ML model.The data of 19 years period was divided into 70:30 ratio for the modeling.The data of the period 1/2000 to 5/2013(which is about 70%of total data)was used for the training and for the period 5/2013 to 12/2018(which is about 30%of total data)was used for testing(validating)the models.Performance of the models was evaluated using standard statistical measures:Coefficient of Determination(R2),Root Mean Square Error(RMSE)and Mean Absolute Error(MAE).Results show that the performance of all the developed models is good(R2>0.9)for the prediction of water levels in the study area.However,the Bagging-based hybrid models are slightly better than another model such as REPT.Thus,these Bagging-based hybrid time series models can be used for predicting water levels at Mekong data.展开更多
In this paper,we developed highly accurate ensemble machine learning models integrating Reduced Error Pruning Tree(REPT)as a base classifier with the Bagging(B),Decorate(D),and Random Subspace(RSS)ensemble learning te...In this paper,we developed highly accurate ensemble machine learning models integrating Reduced Error Pruning Tree(REPT)as a base classifier with the Bagging(B),Decorate(D),and Random Subspace(RSS)ensemble learning techniques for spatial prediction of rainfallinduced landslides in the Uttarkashi district,located in the Himalayan range,India.To do so,a total of 103 historical landslide events were linked to twelve conditioning factors for generating training and validation datasets.Root Mean Square Error(RMSE)and Area Under the receiver operating characteristic Curve(AUC)were used to evaluate the training and validation performances of the models.The results showed that the single REPT model and its derived ensembles provided a satisfactory accuracy for the prediction of landslides.The D-REPT model with RMSE=0.351 and AUC=0.907 was identified as the most accurate model,followed by RSS-REPT(RMSE=0.353 and AUC=0.898),B-REPT(RMSE=0.396 and AUC=0.876),and the single REPT model(RMSE=0.398 and AUC=0.836),respectively.The prominent ensemble models proposed and verified in this study provide engineers and modelers with insights for development of more advanced predictive models for different landslide-susceptible areas around the world.展开更多
基金funding from the Vietnam National Foundation for Science and Technology Development(NAFOSTED)under Grant No.105.08-2019.03。
文摘Improving the accuracy of flood prediction and mapping is crucial for reducing damage resulting from flood events.In this study,we proposed and validated three ensemble models based on the Best First Decision Tree(BFT)and the Bagging(Bagging-BFT),Decorate(Bagging-BFT),and Random Subspace(RSS-BFT)ensemble learning techniques for an improved prediction of flood susceptibility in a spatially-explicit manner.A total number of 126 historical flood events from the Nghe An Province(Vietnam)were connected to a set of 10 flood influencing factors(slope,elevation,aspect,curvature,river density,distance from rivers,flow direction,geology,soil,and land use)for generating the training and validation datasets.The models were validated via several performance metrics that demonstrated the capability of all three ensemble models in elucidating the underlying pattern of flood occurrences within the research area and predicting the probability of future flood events.Based on the Area Under the receiver operating characteristic Curve(AUC),the ensemble Decorate-BFT model that achieved an AUC value of 0.989 was identified as the superior model over the RSS-BFT(AUC=0.982)and Bagging-BFT(AUC=0.967)models.A comparison between the performance of the models and the models previously reported in the literature confirmed that our ensemble models provided a reliable estimate of flood susceptibilities and their resulting susceptibility maps are trustful for flood early warning systems as well as development of mitigation plans.
基金funded by Vietnam National Foundation for Science and Technology Development(NAFOSTED)under grant number 105.08-2019.03.
文摘The groundwater potential map is an important tool for a sustainable water management and land use planning,particularly for agricultural countries like Vietnam.In this article,we proposed new machine learning ensemble techniques namely AdaBoost ensemble(ABLWL),Bagging ensemble(BLWL),Multi Boost ensemble(MBLWL),Rotation Forest ensemble(RFLWL)with Locally Weighted Learning(LWL)algorithm as a base classifier to build the groundwater potential map of Gia Lai province in Vietnam.For this study,eleven conditioning factors(aspect,altitude,curvature,slope,Stream Transport Index(STI),Topographic Wetness Index(TWI),soil,geology,river density,rainfall,land-use)and 134 wells yield data was used to create training(70%)and testing(30%)datasets for the development and validation of the models.Several statistical indices were used namely Positive Predictive Value(PPV),Negative Predictive Value(NPV),Sensitivity(SST),Specificity(SPF),Accuracy(ACC),Kappa,and Receiver Operating Characteristics(ROC)curve to validate and compare performance of models.Results show that performance of all the models is good to very good(AUC:0.75 to 0.829)but the ABLWL model with AUC=0.89 is the best.All the models applied in this study can support decision-makers to streamline the management of the groundwater and to develop economy not only of specific territories but also in other regions across the world with minor changes of the input parameters.
基金funded by Vietnam Academy of Science and Technology(VAST)under Project Codes KHCBTÐ.02/19-21 and UQÐTCB.02/19-20.
文摘Water level predictions in the river,lake and delta play an important role in flood management.Every year Mekong River delta of Vietnam is experiencing flood due to heavy monsoon rains and high tides.Land subsidence may also aggravate flooding problems in this area.Therefore,accurate predictions of water levels in this region are very important to forewarn the people and authorities for taking timely adequate remedial measures to prevent losses of life and property.There are so many methods available to predict the water levels based on historical data but nowadays Machine Learning(ML)methods are considered the best tool for accurate prediction.In this study,we have used surface water level data of 18 water level measurement stations of the Mekong River delta from 2000 to 2018 to build novel time-series Bagging based hybrid ML models namely:Bagging(RF),Bagging(SOM)and Bagging(M5P)to predict historical water levels in the study area.Performances of the Bagging-based hybrid models were compared with Reduced Error Pruning Trees(REPT),which is a benchmark ML model.The data of 19 years period was divided into 70:30 ratio for the modeling.The data of the period 1/2000 to 5/2013(which is about 70%of total data)was used for the training and for the period 5/2013 to 12/2018(which is about 30%of total data)was used for testing(validating)the models.Performance of the models was evaluated using standard statistical measures:Coefficient of Determination(R2),Root Mean Square Error(RMSE)and Mean Absolute Error(MAE).Results show that the performance of all the developed models is good(R2>0.9)for the prediction of water levels in the study area.However,the Bagging-based hybrid models are slightly better than another model such as REPT.Thus,these Bagging-based hybrid time series models can be used for predicting water levels at Mekong data.
文摘In this paper,we developed highly accurate ensemble machine learning models integrating Reduced Error Pruning Tree(REPT)as a base classifier with the Bagging(B),Decorate(D),and Random Subspace(RSS)ensemble learning techniques for spatial prediction of rainfallinduced landslides in the Uttarkashi district,located in the Himalayan range,India.To do so,a total of 103 historical landslide events were linked to twelve conditioning factors for generating training and validation datasets.Root Mean Square Error(RMSE)and Area Under the receiver operating characteristic Curve(AUC)were used to evaluate the training and validation performances of the models.The results showed that the single REPT model and its derived ensembles provided a satisfactory accuracy for the prediction of landslides.The D-REPT model with RMSE=0.351 and AUC=0.907 was identified as the most accurate model,followed by RSS-REPT(RMSE=0.353 and AUC=0.898),B-REPT(RMSE=0.396 and AUC=0.876),and the single REPT model(RMSE=0.398 and AUC=0.836),respectively.The prominent ensemble models proposed and verified in this study provide engineers and modelers with insights for development of more advanced predictive models for different landslide-susceptible areas around the world.