Soil extracellular electron transfer(EET)is a pivotal biological process within the realm of soil.Unfortunately,EET suffers from a lack of predictive models.Herein,an intricately crafted machine learning model has bee...Soil extracellular electron transfer(EET)is a pivotal biological process within the realm of soil.Unfortunately,EET suffers from a lack of predictive models.Herein,an intricately crafted machine learning model has been developed for the purpose of predicting soil EET by using the physicochemical properties of soil as independent input variables and the EET capabilities in terms of current density(j_(max))and Coulombic charge(C_(out))as dependent output variables.An autoencoder ensemble stacking(AES)model was developed to address the aforementioned issue by integrating support vector machine,multilayer perceptron,extreme gradient boosting,and light gradient boosting machine algorithms as the stacking algorithms.With 10-fold crossvalidation,the AES model exhibited notable improvements in predicting j_(max)and C_(out),with average test R^(2)values of 0.83 and 0.84,respectively,surpassing those of single machine learning(ML)models and the basic ensemble model.By utilizing partial correlation plots(PDPs),Shapley Additive explanations(SHAP)values,and SHAP decision plots,we quantitatively explained the impact and contribution of the input molecules on the AES model’s predictions of j_(max)and C_(out).In the context of the SHAP method for the AES model,total carbon(TC)was identified as the most correlated descriptor for j_(max),while total organic carbon(TOC)stood out as the most relevant descriptor for C_(out).In the prediction tasks of j_(max)and C_(out)within the AES model,employing a multitask ML approach allowed the model to benefit from the shared information of input variables,thereby enhancing its overall generalizability.This study provides a feasible tool for the prediction of soil EET from soil physiochemical properties and an advanced understanding of the relationship between soil physiochemical properties and EET capability.展开更多
基金supported by Guangdong Basic and Applied Basic Research Foundation(Grant No.2023B1515040022)the National Natural Science Foundation of China(Grant Nos.42177270 and 42207340).
文摘Soil extracellular electron transfer(EET)is a pivotal biological process within the realm of soil.Unfortunately,EET suffers from a lack of predictive models.Herein,an intricately crafted machine learning model has been developed for the purpose of predicting soil EET by using the physicochemical properties of soil as independent input variables and the EET capabilities in terms of current density(j_(max))and Coulombic charge(C_(out))as dependent output variables.An autoencoder ensemble stacking(AES)model was developed to address the aforementioned issue by integrating support vector machine,multilayer perceptron,extreme gradient boosting,and light gradient boosting machine algorithms as the stacking algorithms.With 10-fold crossvalidation,the AES model exhibited notable improvements in predicting j_(max)and C_(out),with average test R^(2)values of 0.83 and 0.84,respectively,surpassing those of single machine learning(ML)models and the basic ensemble model.By utilizing partial correlation plots(PDPs),Shapley Additive explanations(SHAP)values,and SHAP decision plots,we quantitatively explained the impact and contribution of the input molecules on the AES model’s predictions of j_(max)and C_(out).In the context of the SHAP method for the AES model,total carbon(TC)was identified as the most correlated descriptor for j_(max),while total organic carbon(TOC)stood out as the most relevant descriptor for C_(out).In the prediction tasks of j_(max)and C_(out)within the AES model,employing a multitask ML approach allowed the model to benefit from the shared information of input variables,thereby enhancing its overall generalizability.This study provides a feasible tool for the prediction of soil EET from soil physiochemical properties and an advanced understanding of the relationship between soil physiochemical properties and EET capability.