期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
A Stacked Ensemble Deep Learning Approach for Imbalanced Multi-Class Water Quality Index Prediction
1
作者 Wen Yee Wong Khairunnisa Hasikin +4 位作者 Anis Salwa Mohd Khairuddin Sarah Abdul Razak Hanee Farzana Hizaddin Mohd Istajib Mokhtar muhammad mokhzaini azizan 《Computers, Materials & Continua》 SCIE EI 2023年第8期1361-1384,共24页
A common difficulty in building prediction models with real-world environmental datasets is the skewed distribution of classes.There are significantly more samples for day-to-day classes,while rare events such as poll... A common difficulty in building prediction models with real-world environmental datasets is the skewed distribution of classes.There are significantly more samples for day-to-day classes,while rare events such as polluted classes are uncommon.Consequently,the limited availability of minority outcomes lowers the classifier’s overall reliability.This study assesses the capability of machine learning(ML)algorithms in tackling imbalanced water quality data based on the metrics of precision,recall,and F1 score.It intends to balance the misled accuracy towards the majority of data.Hence,10 ML algorithms of its performance are compared.The classifiers included are AdaBoost,SupportVector Machine,Linear Discriminant Analysis,k-Nearest Neighbors,Naive Bayes,Decision Trees,Random Forest,Extra Trees,Bagging,and the Multilayer Perceptron.This study also uses the Easy Ensemble Classifier,Balanced Bagging,andRUSBoost algorithm to evaluatemulti-class imbalanced learning methods.The comparison results revealed that a highaccuracy machine learning model is not always good in recall and sensitivity.This paper’s stacked ensemble deep learning(SE-DL)generalization model effectively classifies the water quality index(WQI)based on 23 input variables.The proposed algorithm achieved a remarkable average of 95.69%,94.96%,92.92%,and 93.88%for accuracy,precision,recall,and F1 score,respectively.In addition,the proposed model is compared against two state-of-the-art classifiers,the XGBoost(eXtreme Gradient Boosting)and Light Gradient Boosting Machine,where performance metrics of balanced accuracy and g-mean are included.The experimental setup concluded XGBoost with a higher balanced accuracy and G-mean.However,the SE-DL model has a better and more balanced performance in the F1 score.The SE-DL model aligns with the goal of this study to ensure the balance between accuracy and completeness for each water quality class.The proposed algorithm is also capable of higher efficiency at a lower computational time against using the standard SyntheticMinority Oversampling Technique(SMOTE)approach to imbalanced datasets. 展开更多
关键词 Water quality classification imbalanced data SMOTE stacked ensemble deep learning sensitivity analysis
下载PDF
Water Quality Index Using Modified Random Forest Technique: Assessing Novel Input Features 被引量:2
2
作者 Wen Yee Wong Ayman Khallel Ibrahim Al-Ani +5 位作者 Khairunnisa Hasikin Anis Salwa Mohd Khairuddin Sarah Abdul Razak Hanee Farzana Hizaddin Mohd Istajib Mokhtar muhammad mokhzaini azizan 《Computer Modeling in Engineering & Sciences》 SCIE EI 2022年第9期1011-1038,共28页
Water quality analysis is essential to understand the ecological status of aquatic life.Conventional water quality index(WQI)assessment methods are limited to features such as water acidic or basicity(pH),dissolved ox... Water quality analysis is essential to understand the ecological status of aquatic life.Conventional water quality index(WQI)assessment methods are limited to features such as water acidic or basicity(pH),dissolved oxygen(DO),biological oxygen demand(BOD),chemical oxygen demand(COD),ammoniacal nitrogen(NH3-N),and suspended solids(SS).These features are often insufficient to represent the water quality of a heavy metal–polluted river.Therefore,this paper aims to explore and analyze novel input features in order to formulate an improved WQI.In this work,prospective insights on the feasibility of alternative water quality input variables as new discriminant features are discussed.The new discriminant features are a step toward formulating adaptive water quality parameters according to the land use activities surrounding the river.The results and analysis obtained from this study have proven the possibility of predicting WQI using new input features.This work analyzes 17 new input features,namely conductivity(COND),salinity(SAL),turbidity(TUR),dissolved solids(DS),nitrate(NO3),chloride(Cl),phosphate(PO4),arsenic(As),chromium(Cr),zinc(Zn),calcium(Ca),iron(Fe),potassium(K),magnesium(Mg),sodium(Na),E.coli,and total coliform,in predicting WQI using machine learning techniques.Five regression algorithms-random forest(RF),AdaBoost,support vector regression(SVR),decision tree regression(DTR),and multilayer perception(MLP)-are applied for preliminary model selection.The results show that the RF algorithm exhibits better prediction performance,with R2 of 0.974.Then,this work proposes a modified RF by incorporating the synthetic minority oversampling technique(SMOTE)into the conventional RF method.The proposed modified RF method is shown to achieve 77.68%,74%,69%,and 71%accuracy,precision,recall,and F1-score,respectively.In addition,the sensitivity analysis is included to highlight the importance of the turbidity variable in WQI prediction.The results of sensitivity analysis highlight the importance of certain water quality variables that are not present in the conventional WQI formulation. 展开更多
关键词 Artificial intelligence random forest environmental modeling alternative inputs SMOTE
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部