In this editorial,we comment on the article by Zhang et al entitled Development of a machine learning-based model for predicting the risk of early postoperative recurrence of hepatocellular carcinoma.Hepatocellular ca...In this editorial,we comment on the article by Zhang et al entitled Development of a machine learning-based model for predicting the risk of early postoperative recurrence of hepatocellular carcinoma.Hepatocellular carcinoma(HCC),which is characterized by high incidence and mortality rates,remains a major global health challenge primarily due to the critical issue of postoperative recurrence.Early recurrence,defined as recurrence that occurs within 2 years posttreatment,is linked to the hidden spread of the primary tumor and significantly impacts patient survival.Traditional predictive factors,including both patient-and treatment-related factors,have limited predictive ability with respect to HCC recurrence.The integration of machine learning algorithms is fueled by the exponential growth of computational power and has revolutionized HCC research.The study by Zhang et al demonstrated the use of a groundbreaking preoperative prediction model for early postoperative HCC recurrence.Challenges persist,including sample size constraints,issues with handling data,and the need for further validation and interpretability.This study emphasizes the need for collaborative efforts,multicenter studies and comparative analyses to validate and refine the model.Overcoming these challenges and exploring innovative approaches,such as multi-omics integration,will enhance personalized oncology care.This study marks a significant stride toward precise,efficient,and personalized oncology practices,thus offering hope for improved patient outcomes in the field of HCC treatment.展开更多
Climate change has an impact on forest fire patterns.In the context of global warming,it is important to study the possible effects of climate change on forest fires,carbon emission reductions,carbon sink effects,fore...Climate change has an impact on forest fire patterns.In the context of global warming,it is important to study the possible effects of climate change on forest fires,carbon emission reductions,carbon sink effects,forest fire management,and sustainable development of forest ecosystems.This study is based on MODIS active fire data from 2001-2020 and the influence of climate,topography,vegetation,and social factors were integrated.Temperature and precipitation information from different scenarios of the BCC-CSM2-MR climate model were used as future climate data.Under climate change scenarios of a sustainable low development path and a high conventional development path,the extreme gradient boosting model predicted the spatial distribution of forest fire occurrence in China in the 2030s(2021-2040),2050s(2041-2060),2070s(2061-2080),and2090s(2081-2100).Probability maps were generated and tested using ROC curves.The results show that:(1)the area under the ROC curve of training data(70%)and validation data(30%)were 0.8465 and 0.8171,respectively,indicating that the model can reasonably predict the occurrence of forest fire in the study area;(2)temperature,elevation,and precipitation were strongly correlated with fire occurrence,while land type,slope,distance from settlements and roads,and slope direction were less strongly correlated;and,(3)based on future climate change scenarios,the probability of forest fire occurrence will tend to shift from the south to the center of the country.Compared with the current climate(2001-2020),the occurrence of forest fires in 2021-2040,2041-2060,2061-2080,and 2081-2100 will increase significantly in Henan Province(Luoyang,Nanyang,S anmenxia),Shaanxi Province(Shangluo,Ankang),Sichuan Province(Mianyang,Guangyuan,Ganzi),Tibet Autonomous Region(Shannan,Linzhi,Changdu),Liaoning Province(Liaoyang,Fushun,Dandong).展开更多
The robust guarantee of train control on-board equipment is inextricably linked to the safe functioning of a high-speed train.A fault diagnostic model of on-board equipment is built utilizing the integrated learning X...The robust guarantee of train control on-board equipment is inextricably linked to the safe functioning of a high-speed train.A fault diagnostic model of on-board equipment is built utilizing the integrated learning XGBoost(eXtreme Gradient Boosting)algorithm to help technicians assess the malfunction category of high-speed train control on-board equipment accurately and rapidly.The XGBoost algorithm iterates multiple decision tree models to improve the accuracy of fault diagnosis by lifting the predicted residual and adding regular terms.To begin,the text features were extracted using the improved TF-IDF(Term Frequency-Inverse Document Frequency)approach,and 24 fault feature words were chosen and converted into weight word vectors.Secondly,considering the imbalanced fault categories in the data set,the ADASYN(Adaptive Synthetic sampling)adaptive synthetically oversampling technique was used to synthesize a few category fault samples.Finally,the data samples were split into training and test sets based on the fault text data of CTCS-3train control on-board equipment recorded by Guangzhou Railway Group maintenance personnel.The XGBoost model was utilized to realize the automatic fault location of the test set after optimized parameter tuning through grid search.Compared with other methods,the evaluation index of the XGBoost model was significantly improved.The diagnostic accuracy reached 95.43%,which verifies the effectiveness of the method in text fault diagnosis.展开更多
This study investigates the application of machine learning models to address after-sales service issues in cross-border e-commerce,focusing on predicting order returns to reduce return costs and optimize customer exp...This study investigates the application of machine learning models to address after-sales service issues in cross-border e-commerce,focusing on predicting order returns to reduce return costs and optimize customer experience.Using H cross-border e-commerce company as a case study,the research employs Random Forest and XGBoost models to identify high-risk return orders.By comparing the performance of these two models,the study highlights their respective strengths and weaknesses and proposes optimization strategies.The findings provide a valuable reference for e-commerce companies to refine their business models,reduce return rates,improve operational efficiency,and enhance customer satisfaction.展开更多
As an essential property of frozen soils,change of unfrozen water content(UWC)with temperature,namely soil-freezing characteristic curve(SFCC),plays significant roles in numerous physical,hydraulic and mechanical proc...As an essential property of frozen soils,change of unfrozen water content(UWC)with temperature,namely soil-freezing characteristic curve(SFCC),plays significant roles in numerous physical,hydraulic and mechanical processes in cold regions,including the heat and water transfer within soils and at the land–atmosphere interface,frost heave and thaw settlement,as well as the simulation of coupled thermo-hydro-mechanical interactions.Although various models have been proposed to estimate SFCC,their applicability remains limited due to their derivation from specific soil types,soil treatments,and test devices.Accordingly,this study proposes a novel data-driven model to predict the SFCC using an extreme Gradient Boosting(XGBoost)model.A systematic database for SFCC of frozen soils compiled from extensive experimental investigations via various testing methods was utilized to train the XGBoost model.The predicted soil freezing characteristic curves(SFCC,UWC as a function of temperature)from the well-trained XGBoost model were compared with original experimental data and three conventional models.The results demonstrate the superior performance of the proposed XGBoost model over the traditional models in predicting SFCC.This study provides valuable insights for future investigations regarding the SFCC of frozen soils.展开更多
文摘In this editorial,we comment on the article by Zhang et al entitled Development of a machine learning-based model for predicting the risk of early postoperative recurrence of hepatocellular carcinoma.Hepatocellular carcinoma(HCC),which is characterized by high incidence and mortality rates,remains a major global health challenge primarily due to the critical issue of postoperative recurrence.Early recurrence,defined as recurrence that occurs within 2 years posttreatment,is linked to the hidden spread of the primary tumor and significantly impacts patient survival.Traditional predictive factors,including both patient-and treatment-related factors,have limited predictive ability with respect to HCC recurrence.The integration of machine learning algorithms is fueled by the exponential growth of computational power and has revolutionized HCC research.The study by Zhang et al demonstrated the use of a groundbreaking preoperative prediction model for early postoperative HCC recurrence.Challenges persist,including sample size constraints,issues with handling data,and the need for further validation and interpretability.This study emphasizes the need for collaborative efforts,multicenter studies and comparative analyses to validate and refine the model.Overcoming these challenges and exploring innovative approaches,such as multi-omics integration,will enhance personalized oncology care.This study marks a significant stride toward precise,efficient,and personalized oncology practices,thus offering hope for improved patient outcomes in the field of HCC treatment.
基金funded by the National Postdoctoral Innovative Talents Support Plan China Postdoctoral Science Foundation (BX20220038)Key R&D Projects in Hainan Province (ZDYF2021SHFZ256)。
文摘Climate change has an impact on forest fire patterns.In the context of global warming,it is important to study the possible effects of climate change on forest fires,carbon emission reductions,carbon sink effects,forest fire management,and sustainable development of forest ecosystems.This study is based on MODIS active fire data from 2001-2020 and the influence of climate,topography,vegetation,and social factors were integrated.Temperature and precipitation information from different scenarios of the BCC-CSM2-MR climate model were used as future climate data.Under climate change scenarios of a sustainable low development path and a high conventional development path,the extreme gradient boosting model predicted the spatial distribution of forest fire occurrence in China in the 2030s(2021-2040),2050s(2041-2060),2070s(2061-2080),and2090s(2081-2100).Probability maps were generated and tested using ROC curves.The results show that:(1)the area under the ROC curve of training data(70%)and validation data(30%)were 0.8465 and 0.8171,respectively,indicating that the model can reasonably predict the occurrence of forest fire in the study area;(2)temperature,elevation,and precipitation were strongly correlated with fire occurrence,while land type,slope,distance from settlements and roads,and slope direction were less strongly correlated;and,(3)based on future climate change scenarios,the probability of forest fire occurrence will tend to shift from the south to the center of the country.Compared with the current climate(2001-2020),the occurrence of forest fires in 2021-2040,2041-2060,2061-2080,and 2081-2100 will increase significantly in Henan Province(Luoyang,Nanyang,S anmenxia),Shaanxi Province(Shangluo,Ankang),Sichuan Province(Mianyang,Guangyuan,Ganzi),Tibet Autonomous Region(Shannan,Linzhi,Changdu),Liaoning Province(Liaoyang,Fushun,Dandong).
基金supported by the Science and Tec hnology Research and Development Plan Contract of China National Railway Group Co.,Ltd(Grant No.N2022G012)the Railway Science and Technology Research and Development Center Project(Project No.SYF2022SJ004).
文摘The robust guarantee of train control on-board equipment is inextricably linked to the safe functioning of a high-speed train.A fault diagnostic model of on-board equipment is built utilizing the integrated learning XGBoost(eXtreme Gradient Boosting)algorithm to help technicians assess the malfunction category of high-speed train control on-board equipment accurately and rapidly.The XGBoost algorithm iterates multiple decision tree models to improve the accuracy of fault diagnosis by lifting the predicted residual and adding regular terms.To begin,the text features were extracted using the improved TF-IDF(Term Frequency-Inverse Document Frequency)approach,and 24 fault feature words were chosen and converted into weight word vectors.Secondly,considering the imbalanced fault categories in the data set,the ADASYN(Adaptive Synthetic sampling)adaptive synthetically oversampling technique was used to synthesize a few category fault samples.Finally,the data samples were split into training and test sets based on the fault text data of CTCS-3train control on-board equipment recorded by Guangzhou Railway Group maintenance personnel.The XGBoost model was utilized to realize the automatic fault location of the test set after optimized parameter tuning through grid search.Compared with other methods,the evaluation index of the XGBoost model was significantly improved.The diagnostic accuracy reached 95.43%,which verifies the effectiveness of the method in text fault diagnosis.
文摘This study investigates the application of machine learning models to address after-sales service issues in cross-border e-commerce,focusing on predicting order returns to reduce return costs and optimize customer experience.Using H cross-border e-commerce company as a case study,the research employs Random Forest and XGBoost models to identify high-risk return orders.By comparing the performance of these two models,the study highlights their respective strengths and weaknesses and proposes optimization strategies.The findings provide a valuable reference for e-commerce companies to refine their business models,reduce return rates,improve operational efficiency,and enhance customer satisfaction.
基金supported by the National Natural Science Foundation of China(Grant No.42177291)Innovation Capability Support Program of Shaanxi Province(2023-JC-JQ-25 and 2021KJXX-11).
文摘As an essential property of frozen soils,change of unfrozen water content(UWC)with temperature,namely soil-freezing characteristic curve(SFCC),plays significant roles in numerous physical,hydraulic and mechanical processes in cold regions,including the heat and water transfer within soils and at the land–atmosphere interface,frost heave and thaw settlement,as well as the simulation of coupled thermo-hydro-mechanical interactions.Although various models have been proposed to estimate SFCC,their applicability remains limited due to their derivation from specific soil types,soil treatments,and test devices.Accordingly,this study proposes a novel data-driven model to predict the SFCC using an extreme Gradient Boosting(XGBoost)model.A systematic database for SFCC of frozen soils compiled from extensive experimental investigations via various testing methods was utilized to train the XGBoost model.The predicted soil freezing characteristic curves(SFCC,UWC as a function of temperature)from the well-trained XGBoost model were compared with original experimental data and three conventional models.The results demonstrate the superior performance of the proposed XGBoost model over the traditional models in predicting SFCC.This study provides valuable insights for future investigations regarding the SFCC of frozen soils.