This work constructed a machine learning(ML)model to predict the atmospheric corrosion rate of low-alloy steels(LAS).The material properties of LAS,environmental factors,and exposure time were used as the input,while ...This work constructed a machine learning(ML)model to predict the atmospheric corrosion rate of low-alloy steels(LAS).The material properties of LAS,environmental factors,and exposure time were used as the input,while the corrosion rate as the output.6 dif-ferent ML algorithms were used to construct the proposed model.Through optimization and filtering,the eXtreme gradient boosting(XG-Boost)model exhibited good corrosion rate prediction accuracy.The features of material properties were then transformed into atomic and physical features using the proposed property transformation approach,and the dominant descriptors that affected the corrosion rate were filtered using the recursive feature elimination(RFE)as well as XGBoost methods.The established ML models exhibited better predic-tion performance and generalization ability via property transformation descriptors.In addition,the SHapley additive exPlanations(SHAP)method was applied to analyze the relationship between the descriptors and corrosion rate.The results showed that the property transformation model could effectively help with analyzing the corrosion behavior,thereby significantly improving the generalization ability of corrosion rate prediction models.展开更多
Accurate prediction of formation pore pressure is essential to predict fluid flow and manage hydrocarbon production in petroleum engineering.Recent deep learning technique has been receiving more interest due to the g...Accurate prediction of formation pore pressure is essential to predict fluid flow and manage hydrocarbon production in petroleum engineering.Recent deep learning technique has been receiving more interest due to the great potential to deal with pore pressure prediction.However,most of the traditional deep learning models are less efficient to address generalization problems.To fill this technical gap,in this work,we developed a new adaptive physics-informed deep learning model with high generalization capability to predict pore pressure values directly from seismic data.Specifically,the new model,named CGP-NN,consists of a novel parametric features extraction approach(1DCPP),a stacked multilayer gated recurrent model(multilayer GRU),and an adaptive physics-informed loss function.Through machine training,the developed model can automatically select the optimal physical model to constrain the results for each pore pressure prediction.The CGP-NN model has the best generalization when the physicsrelated metricλ=0.5.A hybrid approach combining Eaton and Bowers methods is also proposed to build machine-learnable labels for solving the problem of few labels.To validate the developed model and methodology,a case study on a complex reservoir in Tarim Basin was further performed to demonstrate the high accuracy on the pore pressure prediction of new wells along with the strong generalization ability.The adaptive physics-informed deep learning approach presented here has potential application in the prediction of pore pressures coupled with multiple genesis mechanisms using seismic data.展开更多
Historically,landslides have been the primary type of geological disaster worldwide.Generally,the stability of reservoir banks is primarily affected by rainfall and reservoir water level fluctuations.Moreover,the stab...Historically,landslides have been the primary type of geological disaster worldwide.Generally,the stability of reservoir banks is primarily affected by rainfall and reservoir water level fluctuations.Moreover,the stability of reservoir banks changes with the long-term dynamics of external disastercausing factors.Thus,assessing the time-varying reliability of reservoir landslides remains a challenge.In this paper,a machine learning(ML)based approach is proposed to analyze the long-term reliability of reservoir bank landslides in spatially variable soils through time series prediction.This study systematically investigated the prediction performances of three ML algorithms,i.e.multilayer perceptron(MLP),convolutional neural network(CNN),and long short-term memory(LSTM).Additionally,the effects of the data quantity and data ratio on the predictive power of deep learning models are considered.The results show that all three ML models can accurately depict the changes in the time-varying failure probability of reservoir landslides.The CNN model outperforms both the MLP and LSTM models in predicting the failure probability.Furthermore,selecting the right data ratio can improve the prediction accuracy of the failure probability obtained by ML models.展开更多
Accurate prediction of tropical cyclone(TC)intensity is challenging due to the complex physical processes involved.Here,we introduce a new TC intensity prediction scheme for the western North Pacific(WNP)based on a ti...Accurate prediction of tropical cyclone(TC)intensity is challenging due to the complex physical processes involved.Here,we introduce a new TC intensity prediction scheme for the western North Pacific(WNP)based on a time-dependent theory of TC intensification,termed the energetically based dynamical system(EBDS)model,together with the use of a long short-term memory(LSTM)neural network.In time-dependent theory,TC intensity change is controlled by both the internal dynamics of the TC system and various environmental factors,expressed as environmental dynamical efficiency.The LSTM neural network is used to predict the environmental dynamical efficiency in the EBDS model trained using besttrack TC data and global reanalysis data during 1982–2017.The transfer learning and ensemble methods are used to retrain the scheme using the environmental factors predicted by the Global Forecast System(GFS)of the National Centers for Environmental Prediction during 2017–21.The predicted environmental dynamical efficiency is finally iterated into the EBDS equations to predict TC intensity.The new scheme is evaluated for TC intensity prediction using both reanalysis data and the GFS prediction data.The intensity prediction by the new scheme shows better skill than the official prediction from the China Meteorological Administration(CMA)and those by other state-of-art statistical and dynamical forecast systems,except for the 72-h forecast.Particularly at the longer lead times of 96 h and 120 h,the new scheme has smaller forecast errors,with a more than 30%improvement over the official forecasts.展开更多
The scarcity of in-situ ocean observations poses a challenge for real-time information acquisition in the ocean.Among the crucial hydroacoustic environmental parameters,ocean sound velocity exhibits significant spatia...The scarcity of in-situ ocean observations poses a challenge for real-time information acquisition in the ocean.Among the crucial hydroacoustic environmental parameters,ocean sound velocity exhibits significant spatial and temporal variability and it is highly relevant to oceanic research.In this study,we propose a new data-driven approach,leveraging deep learning techniques,for the prediction of sound velocity fields(SVFs).Our novel spatiotemporal prediction model,STLSTM-SA,combines Spatiotemporal Long Short-Term Memory(ST-LSTM) with a self-attention mechanism to enable accurate and real-time prediction of SVFs.To circumvent the limited amount of observational data,we employ transfer learning by first training the model using reanalysis datasets,followed by fine-tuning it using in-situ analysis data to obtain the final prediction model.By utilizing the historical 12-month SVFs as input,our model predicts the SVFs for the subsequent three months.We compare the performance of five models:Artificial Neural Networks(ANN),Long ShortTerm Memory(LSTM),Convolutional LSTM(ConvLSTM),ST-LSTM,and our proposed ST-LSTM-SA model in a test experiment spanning 2019 to 2022.Our results demonstrate that the ST-LSTM-SA model significantly improves the prediction accuracy and stability of sound velocity in both temporal and spatial dimensions.The ST-LSTM-SA model not only accurately predicts the ocean sound velocity field(SVF),but also provides valuable insights for spatiotemporal prediction of other oceanic environmental variables.展开更多
The complex sand-casting process combined with the interactions between process parameters makes it difficult to control the casting quality,resulting in a high scrap rate.A strategy based on a data-driven model was p...The complex sand-casting process combined with the interactions between process parameters makes it difficult to control the casting quality,resulting in a high scrap rate.A strategy based on a data-driven model was proposed to reduce casting defects and improve production efficiency,which includes the random forest(RF)classification model,the feature importance analysis,and the process parameters optimization with Monte Carlo simulation.The collected data includes four types of defects and corresponding process parameters were used to construct the RF model.Classification results show a recall rate above 90% for all categories.The Gini Index was used to assess the importance of the process parameters in the formation of various defects in the RF model.Finally,the classification model was applied to different production conditions for quality prediction.In the case of process parameters optimization for gas porosity defects,this model serves as an experimental process in the Monte Carlo method to estimate a better temperature distribution.The prediction model,when applied to the factory,greatly improved the efficiency of defect detection.Results show that the scrap rate decreased from 10.16% to 6.68%.展开更多
The accuracy of landslide susceptibility prediction(LSP)mainly depends on the precision of the landslide spatial position.However,the spatial position error of landslide survey is inevitable,resulting in considerable ...The accuracy of landslide susceptibility prediction(LSP)mainly depends on the precision of the landslide spatial position.However,the spatial position error of landslide survey is inevitable,resulting in considerable uncertainties in LSP modeling.To overcome this drawback,this study explores the influence of positional errors of landslide spatial position on LSP uncertainties,and then innovatively proposes a semi-supervised machine learning model to reduce the landslide spatial position error.This paper collected 16 environmental factors and 337 landslides with accurate spatial positions taking Shangyou County of China as an example.The 30e110 m error-based multilayer perceptron(MLP)and random forest(RF)models for LSP are established by randomly offsetting the original landslide by 30,50,70,90 and 110 m.The LSP uncertainties are analyzed by the LSP accuracy and distribution characteristics.Finally,a semi-supervised model is proposed to relieve the LSP uncertainties.Results show that:(1)The LSP accuracies of error-based RF/MLP models decrease with the increase of landslide position errors,and are lower than those of original data-based models;(2)70 m error-based models can still reflect the overall distribution characteristics of landslide susceptibility indices,thus original landslides with certain position errors are acceptable for LSP;(3)Semi-supervised machine learning model can efficiently reduce the landslide position errors and thus improve the LSP accuracies.展开更多
Accurately predicting fluid forces acting on the sur-face of a structure is crucial in engineering design.However,this task becomes particularly challenging in turbulent flow,due to the complex and irregular changes i...Accurately predicting fluid forces acting on the sur-face of a structure is crucial in engineering design.However,this task becomes particularly challenging in turbulent flow,due to the complex and irregular changes in the flow field.In this study,we propose a novel deep learning method,named mapping net-work-coordinated stacked gated recurrent units(MSU),for pre-dicting pressure on a circular cylinder from velocity data.Specifi-cally,our coordinated learning strategy is designed to extract the most critical velocity point for prediction,a process that has not been explored before.In our experiments,MSU extracts one point from a velocity field containing 121 points and utilizes this point to accurately predict 100 pressure points on the cylinder.This method significantly reduces the workload of data measure-ment in practical engineering applications.Our experimental results demonstrate that MSU predictions are highly similar to the real turbulent data in both spatio-temporal and individual aspects.Furthermore,the comparison results show that MSU predicts more precise results,even outperforming models that use all velocity field points.Compared with state-of-the-art methods,MSU has an average improvement of more than 45%in various indicators such as root mean square error(RMSE).Through comprehensive and authoritative physical verification,we estab-lished that MSU’s prediction results closely align with pressure field data obtained in real turbulence fields.This confirmation underscores the considerable potential of MSU for practical applications in real engineering scenarios.The code is available at https://github.com/zhangzm0128/MSU.展开更多
In recent years,deep learning methods have gradually been applied to prediction tasks related to Arctic sea ice concentration,but relatively little research has been conducted for larger spatial and temporal scales,ma...In recent years,deep learning methods have gradually been applied to prediction tasks related to Arctic sea ice concentration,but relatively little research has been conducted for larger spatial and temporal scales,mainly due to the limited time coverage of observations and reanalysis data.Meanwhile,deep learning predictions of sea ice thickness(SIT)have yet to receive ample attention.In this study,two data-driven deep learning(DL)models are built based on the ConvLSTM and fully convolutional U-net(FC-Unet)algorithms and trained using CMIP6 historical simulations for transfer learning and fine-tuned using reanalysis/observations.These models enable monthly predictions of Arctic SIT without considering the complex physical processes involved.Through comprehensive assessments of prediction skills by season and region,the results suggest that using a broader set of CMIP6 data for transfer learning,as well as incorporating multiple climate variables as predictors,contribute to better prediction results,although both DL models can effectively predict the spatiotemporal features of SIT anomalies.Regarding the predicted SIT anomalies of the FC-Unet model,the spatial correlations with reanalysis reach an average level of 89%over all months,while the temporal anomaly correlation coefficients are close to unity in most cases.The models also demonstrate robust performances in predicting SIT and SIE during extreme events.The effectiveness and reliability of the proposed deep transfer learning models in predicting Arctic SIT can facilitate more accurate pan-Arctic predictions,aiding climate change research and real-time business applications.展开更多
In the existing landslide susceptibility prediction(LSP)models,the influences of random errors in landslide conditioning factors on LSP are not considered,instead the original conditioning factors are directly taken a...In the existing landslide susceptibility prediction(LSP)models,the influences of random errors in landslide conditioning factors on LSP are not considered,instead the original conditioning factors are directly taken as the model inputs,which brings uncertainties to LSP results.This study aims to reveal the influence rules of the different proportional random errors in conditioning factors on the LSP un-certainties,and further explore a method which can effectively reduce the random errors in conditioning factors.The original conditioning factors are firstly used to construct original factors-based LSP models,and then different random errors of 5%,10%,15% and 20%are added to these original factors for con-structing relevant errors-based LSP models.Secondly,low-pass filter-based LSP models are constructed by eliminating the random errors using low-pass filter method.Thirdly,the Ruijin County of China with 370 landslides and 16 conditioning factors are used as study case.Three typical machine learning models,i.e.multilayer perceptron(MLP),support vector machine(SVM)and random forest(RF),are selected as LSP models.Finally,the LSP uncertainties are discussed and results show that:(1)The low-pass filter can effectively reduce the random errors in conditioning factors to decrease the LSP uncertainties.(2)With the proportions of random errors increasing from 5%to 20%,the LSP uncertainty increases continuously.(3)The original factors-based models are feasible for LSP in the absence of more accurate conditioning factors.(4)The influence degrees of two uncertainty issues,machine learning models and different proportions of random errors,on the LSP modeling are large and basically the same.(5)The Shapley values effectively explain the internal mechanism of machine learning model predicting landslide sus-ceptibility.In conclusion,greater proportion of random errors in conditioning factors results in higher LSP uncertainty,and low-pass filter can effectively reduce these random errors.展开更多
Short-term(up to 30 days)predictions of Earth Rotation Parameters(ERPs)such as Polar Motion(PM:PMX and PMY)play an essential role in real-time applications related to high-precision reference frame conversion.Currentl...Short-term(up to 30 days)predictions of Earth Rotation Parameters(ERPs)such as Polar Motion(PM:PMX and PMY)play an essential role in real-time applications related to high-precision reference frame conversion.Currently,least squares(LS)+auto-regressive(AR)hybrid method is one of the main techniques of PM prediction.Besides,the weighted LS+AR hybrid method performs well for PM short-term prediction.However,the corresponding covariance information of LS fitting residuals deserves further exploration in the AR model.In this study,we have derived a modified stochastic model for the LS+AR hybrid method,namely the weighted LS+weighted AR hybrid method.By using the PM data products of IERS EOP 14 C04,the numerical results indicate that for PM short-term forecasting,the proposed weighted LS+weighted AR hybrid method shows an advantage over both the LS+AR hybrid method and the weighted LS+AR hybrid method.Compared to the mean absolute errors(MAEs)of PMX/PMY sho rt-term prediction of the LS+AR hybrid method and the weighted LS+AR hybrid method,the weighted LS+weighted AR hybrid method shows average improvements of 6.61%/12.08%and 0.24%/11.65%,respectively.Besides,for the slopes of the linear regression lines fitted to the errors of each method,the growth of the prediction error of the proposed method is slower than that of the other two methods.展开更多
The Bozhong Sag is the largest petroliferous sag in the Bohai Bay Basin,and the source rocks of Paleogene Dongying and Shahejie Formations were buried deeply.Most of the drillings were located at the structural high,a...The Bozhong Sag is the largest petroliferous sag in the Bohai Bay Basin,and the source rocks of Paleogene Dongying and Shahejie Formations were buried deeply.Most of the drillings were located at the structural high,and there were few wells that met good quality source rocks,so it is difficult to evaluate the source rocks in the study area precisely by geochemical analysis only.Based on the Rock-Eval pyrolysis,total organic carbon(TOC)testing,the organic matter(OM)abundance of Paleogene source rocks in the southwestern Bozhong Sag were evaluated,including the lower of second member of Dongying Formation(E_(3)d2L),the third member of Dongying Formation(E_(3)d_(3)),the first and second members of Shahejie Formation(E_(2)s_(1+2)),the third member of Shahejie Formation(E_(2)s_(3)).The results indicate that the E_(2)s_(1+2)and E_(2)s_(3)have better hydrocarbon generative potentials with the highest OM abundance,the E_(3)d_(3)are of the second good quality,and the E_(3)d2L have poor to fair hydrocarbon generative potential.Furthermore,the well logs were applied to predict TOC and residual hydrocarbon generation potential(S_(2))based on the sedimentary facies classification,usingΔlogR,generalizedΔlogR,logging multiple linear regression and BP neural network methods.The various methods were compared,and the BP neural network method have relatively better prediction accuracy.Based on the pre-stack simultaneous inversion(P-wave impedance,P-wave velocity and density inversion results)and the post-stack seismic attributes,the three-dimensional(3D)seismic prediction of TOC and S_(2)was carried out.The results show that the seismic near well prediction results of TOC and S_(2)based on seismic multi-attributes analysis correspond well with the results of well logging methods,and the plane prediction results are identical with the sedimentary facies map in the study area.The TOC and S_(2)values of E_(2)s_(1+2)and E_(2)s_(3)are higher than those in E_(3)d_(3)and E_(3)d_(2)L,basically consistent with the geochemical analysis results.This method makes up the deficiency of geochemical methods,establishing the connection between geophysical information and geochemical data,and it is helpful to the 3D quantitative prediction and the evaluation of high-quality source rocks in the areas where the drillings are limited.展开更多
The purpose of software defect prediction is to identify defect-prone code modules to assist software quality assurance teams with the appropriate allocation of resources and labor.In previous software defect predicti...The purpose of software defect prediction is to identify defect-prone code modules to assist software quality assurance teams with the appropriate allocation of resources and labor.In previous software defect prediction studies,transfer learning was effective in solving the problem of inconsistent project data distribution.However,target projects often lack sufficient data,which affects the performance of the transfer learning model.In addition,the presence of uncorrelated features between projects can decrease the prediction accuracy of the transfer learning model.To address these problems,this article propose a software defect prediction method based on stable learning(SDP-SL)that combines code visualization techniques and residual networks.This method first transforms code files into code images using code visualization techniques and then constructs a defect prediction model based on these code images.During the model training process,target project data are not required as prior knowledge.Following the principles of stable learning,this paper dynamically adjusted the weights of source project samples to eliminate dependencies between features,thereby capturing the“invariance mechanism”within the data.This approach explores the genuine relationship between code defect features and labels,thereby enhancing defect prediction performance.To evaluate the performance of SDP-SL,this article conducted comparative experiments on 10 open-source projects in the PROMISE dataset.The experimental results demonstrated that in terms of the F-measure,the proposed SDP-SL method outperformed other within-project defect prediction methods by 2.11%-44.03%.In cross-project defect prediction,the SDP-SL method provided an improvement of 5.89%-25.46% in prediction performance compared to other cross-project defect prediction methods.Therefore,SDP-SL can effectively enhance within-and cross-project defect predictions.展开更多
With the development of information technology,a large number of product quality data in the entire manufacturing process is accumulated,but it is not explored and used effectively.The traditional product quality pred...With the development of information technology,a large number of product quality data in the entire manufacturing process is accumulated,but it is not explored and used effectively.The traditional product quality prediction models have many disadvantages,such as high complexity and low accuracy.To overcome the above problems,we propose an optimized data equalization method to pre-process dataset and design a simple but effective product quality prediction model:radial basis function model optimized by the firefly algorithm with Levy flight mechanism(RBFFALM).First,the new data equalization method is introduced to pre-process the dataset,which reduces the dimension of the data,removes redundant features,and improves the data distribution.Then the RBFFALFM is used to predict product quality.Comprehensive expe riments conducted on real-world product quality datasets validate that the new model RBFFALFM combining with the new data pre-processing method outperforms other previous me thods on predicting product quality.展开更多
Landslides are destructive natural disasters that cause catastrophic damage and loss of life worldwide.Accurately predicting landslide displacement enables effective early warning and risk management.However,the limit...Landslides are destructive natural disasters that cause catastrophic damage and loss of life worldwide.Accurately predicting landslide displacement enables effective early warning and risk management.However,the limited availability of on-site measurement data has been a substantial obstacle in developing data-driven models,such as state-of-the-art machine learning(ML)models.To address these challenges,this study proposes a data augmentation framework that uses generative adversarial networks(GANs),a recent advance in generative artificial intelligence(AI),to improve the accuracy of landslide displacement prediction.The framework provides effective data augmentation to enhance limited datasets.A recurrent GAN model,RGAN-LS,is proposed,specifically designed to generate realistic synthetic multivariate time series that mimics the characteristics of real landslide on-site measurement data.A customized moment-matching loss is incorporated in addition to the adversarial loss in GAN during the training of RGAN-LS to capture the temporal dynamics and correlations in real time series data.Then,the synthetic data generated by RGAN-LS is used to enhance the training of long short-term memory(LSTM)networks and particle swarm optimization-support vector machine(PSO-SVM)models for landslide displacement prediction tasks.Results on two landslides in the Three Gorges Reservoir(TGR)region show a significant improvement in LSTM model prediction performance when trained on augmented data.For instance,in the case of the Baishuihe landslide,the average root mean square error(RMSE)increases by 16.11%,and the mean absolute error(MAE)by 17.59%.More importantly,the model’s responsiveness during mutational stages is enhanced for early warning purposes.However,the results have shown that the static PSO-SVM model only sees marginal gains compared to recurrent models such as LSTM.Further analysis indicates that an optimal synthetic-to-real data ratio(50%on the illustration cases)maximizes the improvements.This also demonstrates the robustness and effectiveness of supplementing training data for dynamic models to obtain better results.By using the powerful generative AI approach,RGAN-LS can generate high-fidelity synthetic landslide data.This is critical for improving the performance of advanced ML models in predicting landslide displacement,particularly when there are limited training data.Additionally,this approach has the potential to expand the use of generative AI in geohazard risk management and other research areas.展开更多
Floods are one of the most serious natural disasters that can cause huge societal and economic losses.Extensive research has been conducted on topics like flood monitoring,prediction,and loss estimation.In these resea...Floods are one of the most serious natural disasters that can cause huge societal and economic losses.Extensive research has been conducted on topics like flood monitoring,prediction,and loss estimation.In these research fields,flood velocity plays a crucial role and is an important factor that influences the reliability of the outcomes.Traditional methods rely on physical models for flood simulation and prediction and could generate accurate results but often take a long time.Deep learning technology has recently shown significant potential in the same field,especially in terms of efficiency,helping to overcome the time-consuming associated with traditional methods.This study explores the potential of deep learning models in predicting flood velocity.More specifically,we use a Multi-Layer Perceptron(MLP)model,a specific type of Artificial Neural Networks(ANNs),to predict the velocity in the test area of the Lundesokna River in Norway with diverse terrain conditions.Geographic data and flood velocity simulated based on the physical hydraulic model are used in the study for the pre-training,optimization,and testing of the MLP model.Our experiment indicates that the MLP model has the potential to predict flood velocity in diverse terrain conditions of the river with acceptable accuracy against simulated velocity results but with a significant decrease in training time and testing time.Meanwhile,we discuss the limitations for the improvement in future work.展开更多
BACKGROUND Cancer patients often suffer from severe stress reactions psychologically,such as anxiety and depression.Prostate cancer(PC)is one of the common cancer types,with most patients diagnosed at advanced stages ...BACKGROUND Cancer patients often suffer from severe stress reactions psychologically,such as anxiety and depression.Prostate cancer(PC)is one of the common cancer types,with most patients diagnosed at advanced stages that cannot be treated by radical surgery and which are accompanied by complications such as bodily pain and bone metastasis.Therefore,attention should be given to the mental health status of PC patients as well as physical adverse events in the course of clinical treatment.AIM To analyze the risk factors leading to anxiety and depression in PC patients after castration and build a risk prediction model.METHODS A retrospective analysis was performed on the data of 120 PC cases treated in Xi'an People's Hospital between January 2019 and January 2022.The patient cohort was divided into a training group(n=84)and a validation group(n=36)at a ratio of 7:3.The patients’anxiety symptoms and depression levels were assessed 2 wk after surgery with the Self-Rating Anxiety Scale(SAS)and the Selfrating Depression Scale(SDS),respectively.Logistic regression was used to analyze the risk factors affecting negative mood,and a risk prediction model was constructed.RESULTS In the training group,35 patients and 37 patients had an SAS score and an SDS score greater than or equal to 50,respectively.Based on the scores,we further subclassified patients into two groups:a bad mood group(n=35)and an emotional stability group(n=49).Multivariate logistic regression analysis showed that marital status,castration scheme,and postoperative Visual Analogue Scale(VAS)score were independent risk factors affecting a patient's bad mood(P<0.05).In the training and validation groups,patients with adverse emotions exhibited significantly higher risk scores than emotionally stable patients(P<0.0001).The area under the curve(AUC)of the risk prediction model for predicting bad mood in the training group was 0.743,the specificity was 70.96%,and the sensitivity was 66.03%,while in the validation group,the AUC,specificity,and sensitivity were 0.755,66.67%,and 76.19%,respectively.The Hosmer-Lemeshow test showed aχ^(2) of 4.2856,a P value of 0.830,and a C-index of 0.773(0.692-0.854).The calibration curve revealed that the predicted curve was basically consistent with the actual curve,and the calibration curve showed that the prediction model had good discrimination and accuracy.Decision curve analysis showed that the model had a high net profit.CONCLUSION In PC patients,marital status,castration scheme,and postoperative pain(VAS)score are important factors affecting postoperative anxiety and depression.The logistic regression model can be used to successfully predict the risk of adverse psychological emotions.展开更多
Predicting the displacement of landslide is of utmost practical importance as the landslide can pose serious threats to both human life and property.However,traditional methods have the limitation of random selection ...Predicting the displacement of landslide is of utmost practical importance as the landslide can pose serious threats to both human life and property.However,traditional methods have the limitation of random selection in sliding window selection and seldom incorporate weather forecast data for displacement prediction,while a single structural model cannot handle input sequences of different lengths at the same time.In order to solve these limitations,in this study,a new approach is proposed that utilizes weather forecast data and incorporates the maximum information coefficient(MIC),long short-term memory network(LSTM),and attention mechanism to establish a teacher-student coupling model with parallel structure for short-term landslide displacement prediction.Through MIC,a suitable input sequence length is selected for the LSTM model.To investigate the influence of rainfall on landslides during different seasons,a parallel teacher-student coupling model is developed that is able to learn sequential information from various time series of different lengths.The teacher model learns sequence information from rainfall intensity time series while incorporating reliable short-term weather forecast data from platforms such as China Meteorological Administration(CMA)and Reliable Prognosis(https://rp5.ru)to improve the model’s expression capability,and the student model learns sequence information from other time series.An attention module is then designed to integrate different sequence information to derive a context vector,representing seasonal temporal attention mode.Finally,the predicted displacement is obtained through a linear layer.The proposed method demonstrates superior prediction accuracies,surpassing those of the support vector machine(SVM),LSTM,recurrent neural network(RNN),temporal convolutional network(TCN),and LSTM-Attention models.It achieves a mean absolute error(MAE)of 0.072 mm,root mean square error(RMSE)of 0.096 mm,and pearson correlation coefficients(PCCS)of 0.85.Additionally,it exhibits enhanced prediction stability and interpretability,rendering it an indispensable tool for landslide disaster prevention and mitigation.展开更多
We read with interest the recent systematic reviewaArtificial intelligence and machine learning for hemorrhagic trauma careoby Peng et al.[1],which evaluated literature on machine learning(ML)in the management of trau...We read with interest the recent systematic reviewaArtificial intelligence and machine learning for hemorrhagic trauma careoby Peng et al.[1],which evaluated literature on machine learning(ML)in the management of traumatic haemorrhage.We thank the authors for their contribution to the role of ML in trauma.展开更多
Background:Choosing the appropriate antipsychotic drug(APD)treatment for patients with schizophrenia(SCZ)can be challenging,as the treatment response to APD is highly variable and difficult to predict due to the lack ...Background:Choosing the appropriate antipsychotic drug(APD)treatment for patients with schizophrenia(SCZ)can be challenging,as the treatment response to APD is highly variable and difficult to predict due to the lack of effective biomarkers.Previous studies have indicated the association between treatment response and genetic and epigenetic factors,but no effective biomarkers have been identified.Hence,further research is imperative to enhance precision medicine in SCZ treatment.Methods:Participants with SCZ were recruited from two randomized trials.The discovery cohort was recruited from the CAPOC trial(n=2307)involved 6 weeks of treatment and equally randomized the participants to the Olanzapine,Risperidone,Quetiapine,Aripiprazole,Ziprasidone,and Haloperidol/Perphenazine(subsequently equally assigned to one or the other)groups.The external validation cohort was recruited from the CAPEC trial(n=1379),which involved 8 weeks of treatment and equally randomized the participants to the Olanzapine,Risperidone,and Aripiprazole groups.Additionally,healthy controls(n=275)from the local community were utilized as a genetic/epigenetic reference.The genetic and epigenetic(DNA methylation)risks of SCZ were assessed using the polygenic risk score(PRS)and polymethylation score,respectively.The study also examined the genetic-epigenetic interactions with treatment response through differential methylation analysis,methylation quantitative trait loci,colocalization,and promoteranchored chromatin interaction.Machine learning was used to develop a prediction model for treatment response,which was evaluated for accuracy and clinical benefit using the area under curve(AUC)for classification,R^(2) for regression,and decision curve analysis.Results:Six risk genes for SCZ(LINC01795,DDHD2,SBNO1,KCNG2,SEMA7A,and RUFY1)involved in cortical morphology were identified as having a genetic-epigenetic interaction associated with treatment response.The developed and externally validated prediction model,which incorporated clinical information,PRS,genetic risk score(GRS),and proxy methylation level(proxyDNAm),demonstrated positive benefits for a wide range of patients receiving different APDs,regardless of sex[discovery cohort:AUC=0.874(95%CI 0.867-0.881),R^(2)=0.478;external validation cohort:AUC=0.851(95%CI 0.841-0.861),R^(2)=0.507].Conclusions:This study presents a promising precision medicine approach to evaluate treatment response,which has the potential to aid clinicians in making informed decisions about APD treatment for patients with SCZ.Trial registration Chinese Clinical Trial Registry(https://www.chictr.org.cn/),18 Aug 2009 retrospectively registered:CAPOC-ChiCTR-RNC-09000521(https://www.chictr.org.cn/showproj.aspx?proj=9014),CAPEC-ChiCTRRNC-09000522(https://www.chictr.org.cn/showproj.aspx?proj=9013).展开更多
基金the National Key R&D Program of China(No.2021YFB3701705).
文摘This work constructed a machine learning(ML)model to predict the atmospheric corrosion rate of low-alloy steels(LAS).The material properties of LAS,environmental factors,and exposure time were used as the input,while the corrosion rate as the output.6 dif-ferent ML algorithms were used to construct the proposed model.Through optimization and filtering,the eXtreme gradient boosting(XG-Boost)model exhibited good corrosion rate prediction accuracy.The features of material properties were then transformed into atomic and physical features using the proposed property transformation approach,and the dominant descriptors that affected the corrosion rate were filtered using the recursive feature elimination(RFE)as well as XGBoost methods.The established ML models exhibited better predic-tion performance and generalization ability via property transformation descriptors.In addition,the SHapley additive exPlanations(SHAP)method was applied to analyze the relationship between the descriptors and corrosion rate.The results showed that the property transformation model could effectively help with analyzing the corrosion behavior,thereby significantly improving the generalization ability of corrosion rate prediction models.
基金funded by the National Natural Science Foundation of China(General Program:No.52074314,No.U19B6003-05)National Key Research and Development Program of China(2019YFA0708303-05)。
文摘Accurate prediction of formation pore pressure is essential to predict fluid flow and manage hydrocarbon production in petroleum engineering.Recent deep learning technique has been receiving more interest due to the great potential to deal with pore pressure prediction.However,most of the traditional deep learning models are less efficient to address generalization problems.To fill this technical gap,in this work,we developed a new adaptive physics-informed deep learning model with high generalization capability to predict pore pressure values directly from seismic data.Specifically,the new model,named CGP-NN,consists of a novel parametric features extraction approach(1DCPP),a stacked multilayer gated recurrent model(multilayer GRU),and an adaptive physics-informed loss function.Through machine training,the developed model can automatically select the optimal physical model to constrain the results for each pore pressure prediction.The CGP-NN model has the best generalization when the physicsrelated metricλ=0.5.A hybrid approach combining Eaton and Bowers methods is also proposed to build machine-learnable labels for solving the problem of few labels.To validate the developed model and methodology,a case study on a complex reservoir in Tarim Basin was further performed to demonstrate the high accuracy on the pore pressure prediction of new wells along with the strong generalization ability.The adaptive physics-informed deep learning approach presented here has potential application in the prediction of pore pressures coupled with multiple genesis mechanisms using seismic data.
基金supported by the National Natural Science Foundation of China(Grant No.52308340)the Innovative Projects of Universities in Guangdong(Grant No.2022KTSCX208)Sichuan Transportation Science and Technology Project(Grant No.2018-ZL-01).
文摘Historically,landslides have been the primary type of geological disaster worldwide.Generally,the stability of reservoir banks is primarily affected by rainfall and reservoir water level fluctuations.Moreover,the stability of reservoir banks changes with the long-term dynamics of external disastercausing factors.Thus,assessing the time-varying reliability of reservoir landslides remains a challenge.In this paper,a machine learning(ML)based approach is proposed to analyze the long-term reliability of reservoir bank landslides in spatially variable soils through time series prediction.This study systematically investigated the prediction performances of three ML algorithms,i.e.multilayer perceptron(MLP),convolutional neural network(CNN),and long short-term memory(LSTM).Additionally,the effects of the data quantity and data ratio on the predictive power of deep learning models are considered.The results show that all three ML models can accurately depict the changes in the time-varying failure probability of reservoir landslides.The CNN model outperforms both the MLP and LSTM models in predicting the failure probability.Furthermore,selecting the right data ratio can improve the prediction accuracy of the failure probability obtained by ML models.
基金supported by the National Key R&D Program of China(Grant No.2017YFC1501604)the National Natural Science Foundation of China(Grant Nos.41875114 and 41875057).
文摘Accurate prediction of tropical cyclone(TC)intensity is challenging due to the complex physical processes involved.Here,we introduce a new TC intensity prediction scheme for the western North Pacific(WNP)based on a time-dependent theory of TC intensification,termed the energetically based dynamical system(EBDS)model,together with the use of a long short-term memory(LSTM)neural network.In time-dependent theory,TC intensity change is controlled by both the internal dynamics of the TC system and various environmental factors,expressed as environmental dynamical efficiency.The LSTM neural network is used to predict the environmental dynamical efficiency in the EBDS model trained using besttrack TC data and global reanalysis data during 1982–2017.The transfer learning and ensemble methods are used to retrain the scheme using the environmental factors predicted by the Global Forecast System(GFS)of the National Centers for Environmental Prediction during 2017–21.The predicted environmental dynamical efficiency is finally iterated into the EBDS equations to predict TC intensity.The new scheme is evaluated for TC intensity prediction using both reanalysis data and the GFS prediction data.The intensity prediction by the new scheme shows better skill than the official prediction from the China Meteorological Administration(CMA)and those by other state-of-art statistical and dynamical forecast systems,except for the 72-h forecast.Particularly at the longer lead times of 96 h and 120 h,the new scheme has smaller forecast errors,with a more than 30%improvement over the official forecasts.
基金supported by the National Natural Science Foundation of China(Grant No.42004030)Basic Scientific Fund for National Public Research Institutes of China(Grant No.2022S03)+1 种基金Science and Technology Innovation Project(LSKJ202205102)funded by Laoshan Laboratory,and the National Key Research and Development Program of China(2020YFB0505805).
文摘The scarcity of in-situ ocean observations poses a challenge for real-time information acquisition in the ocean.Among the crucial hydroacoustic environmental parameters,ocean sound velocity exhibits significant spatial and temporal variability and it is highly relevant to oceanic research.In this study,we propose a new data-driven approach,leveraging deep learning techniques,for the prediction of sound velocity fields(SVFs).Our novel spatiotemporal prediction model,STLSTM-SA,combines Spatiotemporal Long Short-Term Memory(ST-LSTM) with a self-attention mechanism to enable accurate and real-time prediction of SVFs.To circumvent the limited amount of observational data,we employ transfer learning by first training the model using reanalysis datasets,followed by fine-tuning it using in-situ analysis data to obtain the final prediction model.By utilizing the historical 12-month SVFs as input,our model predicts the SVFs for the subsequent three months.We compare the performance of five models:Artificial Neural Networks(ANN),Long ShortTerm Memory(LSTM),Convolutional LSTM(ConvLSTM),ST-LSTM,and our proposed ST-LSTM-SA model in a test experiment spanning 2019 to 2022.Our results demonstrate that the ST-LSTM-SA model significantly improves the prediction accuracy and stability of sound velocity in both temporal and spatial dimensions.The ST-LSTM-SA model not only accurately predicts the ocean sound velocity field(SVF),but also provides valuable insights for spatiotemporal prediction of other oceanic environmental variables.
基金financially supported by the National Key Research and Development Program of China(2022YFB3706800,2020YFB1710100)the National Natural Science Foundation of China(51821001,52090042,52074183)。
文摘The complex sand-casting process combined with the interactions between process parameters makes it difficult to control the casting quality,resulting in a high scrap rate.A strategy based on a data-driven model was proposed to reduce casting defects and improve production efficiency,which includes the random forest(RF)classification model,the feature importance analysis,and the process parameters optimization with Monte Carlo simulation.The collected data includes four types of defects and corresponding process parameters were used to construct the RF model.Classification results show a recall rate above 90% for all categories.The Gini Index was used to assess the importance of the process parameters in the formation of various defects in the RF model.Finally,the classification model was applied to different production conditions for quality prediction.In the case of process parameters optimization for gas porosity defects,this model serves as an experimental process in the Monte Carlo method to estimate a better temperature distribution.The prediction model,when applied to the factory,greatly improved the efficiency of defect detection.Results show that the scrap rate decreased from 10.16% to 6.68%.
基金the National Natural Science Foundation of China(Grant Nos.42377164 and 52079062)the Interdisciplinary Innovation Fund of Natural Science,Nanchang University(Grant No.9167-28220007-YB2107).
文摘The accuracy of landslide susceptibility prediction(LSP)mainly depends on the precision of the landslide spatial position.However,the spatial position error of landslide survey is inevitable,resulting in considerable uncertainties in LSP modeling.To overcome this drawback,this study explores the influence of positional errors of landslide spatial position on LSP uncertainties,and then innovatively proposes a semi-supervised machine learning model to reduce the landslide spatial position error.This paper collected 16 environmental factors and 337 landslides with accurate spatial positions taking Shangyou County of China as an example.The 30e110 m error-based multilayer perceptron(MLP)and random forest(RF)models for LSP are established by randomly offsetting the original landslide by 30,50,70,90 and 110 m.The LSP uncertainties are analyzed by the LSP accuracy and distribution characteristics.Finally,a semi-supervised model is proposed to relieve the LSP uncertainties.Results show that:(1)The LSP accuracies of error-based RF/MLP models decrease with the increase of landslide position errors,and are lower than those of original data-based models;(2)70 m error-based models can still reflect the overall distribution characteristics of landslide susceptibility indices,thus original landslides with certain position errors are acceptable for LSP;(3)Semi-supervised machine learning model can efficiently reduce the landslide position errors and thus improve the LSP accuracies.
基金supported by the Japan Society for the Promotion of Science(JSPS)KAKENHI(JP22H03643)Japan Science and Technology Agency(JST)Support for Pioneering Research Initiated by the Next Generation(SPRING)(JPMJSP2145)+2 种基金JST Through the Establishment of University Fellowships Towards the Creation of Science Technology Innovation(JPMJFS2115)the National Natural Science Foundation of China(52078382)the State Key Laboratory of Disaster Reduction in Civil Engineering(CE19-A-01)。
文摘Accurately predicting fluid forces acting on the sur-face of a structure is crucial in engineering design.However,this task becomes particularly challenging in turbulent flow,due to the complex and irregular changes in the flow field.In this study,we propose a novel deep learning method,named mapping net-work-coordinated stacked gated recurrent units(MSU),for pre-dicting pressure on a circular cylinder from velocity data.Specifi-cally,our coordinated learning strategy is designed to extract the most critical velocity point for prediction,a process that has not been explored before.In our experiments,MSU extracts one point from a velocity field containing 121 points and utilizes this point to accurately predict 100 pressure points on the cylinder.This method significantly reduces the workload of data measure-ment in practical engineering applications.Our experimental results demonstrate that MSU predictions are highly similar to the real turbulent data in both spatio-temporal and individual aspects.Furthermore,the comparison results show that MSU predicts more precise results,even outperforming models that use all velocity field points.Compared with state-of-the-art methods,MSU has an average improvement of more than 45%in various indicators such as root mean square error(RMSE).Through comprehensive and authoritative physical verification,we estab-lished that MSU’s prediction results closely align with pressure field data obtained in real turbulence fields.This confirmation underscores the considerable potential of MSU for practical applications in real engineering scenarios.The code is available at https://github.com/zhangzm0128/MSU.
基金supported by the National Natural Science Foundation of China(Grant Nos.41976193 and 42176243).
文摘In recent years,deep learning methods have gradually been applied to prediction tasks related to Arctic sea ice concentration,but relatively little research has been conducted for larger spatial and temporal scales,mainly due to the limited time coverage of observations and reanalysis data.Meanwhile,deep learning predictions of sea ice thickness(SIT)have yet to receive ample attention.In this study,two data-driven deep learning(DL)models are built based on the ConvLSTM and fully convolutional U-net(FC-Unet)algorithms and trained using CMIP6 historical simulations for transfer learning and fine-tuned using reanalysis/observations.These models enable monthly predictions of Arctic SIT without considering the complex physical processes involved.Through comprehensive assessments of prediction skills by season and region,the results suggest that using a broader set of CMIP6 data for transfer learning,as well as incorporating multiple climate variables as predictors,contribute to better prediction results,although both DL models can effectively predict the spatiotemporal features of SIT anomalies.Regarding the predicted SIT anomalies of the FC-Unet model,the spatial correlations with reanalysis reach an average level of 89%over all months,while the temporal anomaly correlation coefficients are close to unity in most cases.The models also demonstrate robust performances in predicting SIT and SIE during extreme events.The effectiveness and reliability of the proposed deep transfer learning models in predicting Arctic SIT can facilitate more accurate pan-Arctic predictions,aiding climate change research and real-time business applications.
基金This work is funded by the National Natural Science Foundation of China(Grant Nos.42377164 and 52079062)the National Science Fund for Distinguished Young Scholars of China(Grant No.52222905).
文摘In the existing landslide susceptibility prediction(LSP)models,the influences of random errors in landslide conditioning factors on LSP are not considered,instead the original conditioning factors are directly taken as the model inputs,which brings uncertainties to LSP results.This study aims to reveal the influence rules of the different proportional random errors in conditioning factors on the LSP un-certainties,and further explore a method which can effectively reduce the random errors in conditioning factors.The original conditioning factors are firstly used to construct original factors-based LSP models,and then different random errors of 5%,10%,15% and 20%are added to these original factors for con-structing relevant errors-based LSP models.Secondly,low-pass filter-based LSP models are constructed by eliminating the random errors using low-pass filter method.Thirdly,the Ruijin County of China with 370 landslides and 16 conditioning factors are used as study case.Three typical machine learning models,i.e.multilayer perceptron(MLP),support vector machine(SVM)and random forest(RF),are selected as LSP models.Finally,the LSP uncertainties are discussed and results show that:(1)The low-pass filter can effectively reduce the random errors in conditioning factors to decrease the LSP uncertainties.(2)With the proportions of random errors increasing from 5%to 20%,the LSP uncertainty increases continuously.(3)The original factors-based models are feasible for LSP in the absence of more accurate conditioning factors.(4)The influence degrees of two uncertainty issues,machine learning models and different proportions of random errors,on the LSP modeling are large and basically the same.(5)The Shapley values effectively explain the internal mechanism of machine learning model predicting landslide sus-ceptibility.In conclusion,greater proportion of random errors in conditioning factors results in higher LSP uncertainty,and low-pass filter can effectively reduce these random errors.
基金supported by National Natural Science Foundation of China,China(No.42004016)HuBei Natural Science Fund,China(No.2020CFB329)+1 种基金HuNan Natural Science Fund,China(No.2023JJ60559,2023JJ60560)the State Key Laboratory of Geodesy and Earth’s Dynamics self-deployment project,China(No.S21L6101)。
文摘Short-term(up to 30 days)predictions of Earth Rotation Parameters(ERPs)such as Polar Motion(PM:PMX and PMY)play an essential role in real-time applications related to high-precision reference frame conversion.Currently,least squares(LS)+auto-regressive(AR)hybrid method is one of the main techniques of PM prediction.Besides,the weighted LS+AR hybrid method performs well for PM short-term prediction.However,the corresponding covariance information of LS fitting residuals deserves further exploration in the AR model.In this study,we have derived a modified stochastic model for the LS+AR hybrid method,namely the weighted LS+weighted AR hybrid method.By using the PM data products of IERS EOP 14 C04,the numerical results indicate that for PM short-term forecasting,the proposed weighted LS+weighted AR hybrid method shows an advantage over both the LS+AR hybrid method and the weighted LS+AR hybrid method.Compared to the mean absolute errors(MAEs)of PMX/PMY sho rt-term prediction of the LS+AR hybrid method and the weighted LS+AR hybrid method,the weighted LS+weighted AR hybrid method shows average improvements of 6.61%/12.08%and 0.24%/11.65%,respectively.Besides,for the slopes of the linear regression lines fitted to the errors of each method,the growth of the prediction error of the proposed method is slower than that of the other two methods.
文摘The Bozhong Sag is the largest petroliferous sag in the Bohai Bay Basin,and the source rocks of Paleogene Dongying and Shahejie Formations were buried deeply.Most of the drillings were located at the structural high,and there were few wells that met good quality source rocks,so it is difficult to evaluate the source rocks in the study area precisely by geochemical analysis only.Based on the Rock-Eval pyrolysis,total organic carbon(TOC)testing,the organic matter(OM)abundance of Paleogene source rocks in the southwestern Bozhong Sag were evaluated,including the lower of second member of Dongying Formation(E_(3)d2L),the third member of Dongying Formation(E_(3)d_(3)),the first and second members of Shahejie Formation(E_(2)s_(1+2)),the third member of Shahejie Formation(E_(2)s_(3)).The results indicate that the E_(2)s_(1+2)and E_(2)s_(3)have better hydrocarbon generative potentials with the highest OM abundance,the E_(3)d_(3)are of the second good quality,and the E_(3)d2L have poor to fair hydrocarbon generative potential.Furthermore,the well logs were applied to predict TOC and residual hydrocarbon generation potential(S_(2))based on the sedimentary facies classification,usingΔlogR,generalizedΔlogR,logging multiple linear regression and BP neural network methods.The various methods were compared,and the BP neural network method have relatively better prediction accuracy.Based on the pre-stack simultaneous inversion(P-wave impedance,P-wave velocity and density inversion results)and the post-stack seismic attributes,the three-dimensional(3D)seismic prediction of TOC and S_(2)was carried out.The results show that the seismic near well prediction results of TOC and S_(2)based on seismic multi-attributes analysis correspond well with the results of well logging methods,and the plane prediction results are identical with the sedimentary facies map in the study area.The TOC and S_(2)values of E_(2)s_(1+2)and E_(2)s_(3)are higher than those in E_(3)d_(3)and E_(3)d_(2)L,basically consistent with the geochemical analysis results.This method makes up the deficiency of geochemical methods,establishing the connection between geophysical information and geochemical data,and it is helpful to the 3D quantitative prediction and the evaluation of high-quality source rocks in the areas where the drillings are limited.
基金supported by the NationalNatural Science Foundation of China(Grant No.61867004)the Youth Fund of the National Natural Science Foundation of China(Grant No.41801288).
文摘The purpose of software defect prediction is to identify defect-prone code modules to assist software quality assurance teams with the appropriate allocation of resources and labor.In previous software defect prediction studies,transfer learning was effective in solving the problem of inconsistent project data distribution.However,target projects often lack sufficient data,which affects the performance of the transfer learning model.In addition,the presence of uncorrelated features between projects can decrease the prediction accuracy of the transfer learning model.To address these problems,this article propose a software defect prediction method based on stable learning(SDP-SL)that combines code visualization techniques and residual networks.This method first transforms code files into code images using code visualization techniques and then constructs a defect prediction model based on these code images.During the model training process,target project data are not required as prior knowledge.Following the principles of stable learning,this paper dynamically adjusted the weights of source project samples to eliminate dependencies between features,thereby capturing the“invariance mechanism”within the data.This approach explores the genuine relationship between code defect features and labels,thereby enhancing defect prediction performance.To evaluate the performance of SDP-SL,this article conducted comparative experiments on 10 open-source projects in the PROMISE dataset.The experimental results demonstrated that in terms of the F-measure,the proposed SDP-SL method outperformed other within-project defect prediction methods by 2.11%-44.03%.In cross-project defect prediction,the SDP-SL method provided an improvement of 5.89%-25.46% in prediction performance compared to other cross-project defect prediction methods.Therefore,SDP-SL can effectively enhance within-and cross-project defect predictions.
基金supported by the National Science and Technology Innovation 2030 Next-Generation Artifical Intelligence Major Project(2018AAA0101801)the National Natural Science Foundation of China(72271188)。
文摘With the development of information technology,a large number of product quality data in the entire manufacturing process is accumulated,but it is not explored and used effectively.The traditional product quality prediction models have many disadvantages,such as high complexity and low accuracy.To overcome the above problems,we propose an optimized data equalization method to pre-process dataset and design a simple but effective product quality prediction model:radial basis function model optimized by the firefly algorithm with Levy flight mechanism(RBFFALM).First,the new data equalization method is introduced to pre-process the dataset,which reduces the dimension of the data,removes redundant features,and improves the data distribution.Then the RBFFALFM is used to predict product quality.Comprehensive expe riments conducted on real-world product quality datasets validate that the new model RBFFALFM combining with the new data pre-processing method outperforms other previous me thods on predicting product quality.
基金supported by the Natural Science Foundation of Jiangsu Province(Grant No.BK20220421)the State Key Program of the National Natural Science Foundation of China(Grant No.42230702)the National Natural Science Foundation of China(Grant No.82302352).
文摘Landslides are destructive natural disasters that cause catastrophic damage and loss of life worldwide.Accurately predicting landslide displacement enables effective early warning and risk management.However,the limited availability of on-site measurement data has been a substantial obstacle in developing data-driven models,such as state-of-the-art machine learning(ML)models.To address these challenges,this study proposes a data augmentation framework that uses generative adversarial networks(GANs),a recent advance in generative artificial intelligence(AI),to improve the accuracy of landslide displacement prediction.The framework provides effective data augmentation to enhance limited datasets.A recurrent GAN model,RGAN-LS,is proposed,specifically designed to generate realistic synthetic multivariate time series that mimics the characteristics of real landslide on-site measurement data.A customized moment-matching loss is incorporated in addition to the adversarial loss in GAN during the training of RGAN-LS to capture the temporal dynamics and correlations in real time series data.Then,the synthetic data generated by RGAN-LS is used to enhance the training of long short-term memory(LSTM)networks and particle swarm optimization-support vector machine(PSO-SVM)models for landslide displacement prediction tasks.Results on two landslides in the Three Gorges Reservoir(TGR)region show a significant improvement in LSTM model prediction performance when trained on augmented data.For instance,in the case of the Baishuihe landslide,the average root mean square error(RMSE)increases by 16.11%,and the mean absolute error(MAE)by 17.59%.More importantly,the model’s responsiveness during mutational stages is enhanced for early warning purposes.However,the results have shown that the static PSO-SVM model only sees marginal gains compared to recurrent models such as LSTM.Further analysis indicates that an optimal synthetic-to-real data ratio(50%on the illustration cases)maximizes the improvements.This also demonstrates the robustness and effectiveness of supplementing training data for dynamic models to obtain better results.By using the powerful generative AI approach,RGAN-LS can generate high-fidelity synthetic landslide data.This is critical for improving the performance of advanced ML models in predicting landslide displacement,particularly when there are limited training data.Additionally,this approach has the potential to expand the use of generative AI in geohazard risk management and other research areas.
文摘Floods are one of the most serious natural disasters that can cause huge societal and economic losses.Extensive research has been conducted on topics like flood monitoring,prediction,and loss estimation.In these research fields,flood velocity plays a crucial role and is an important factor that influences the reliability of the outcomes.Traditional methods rely on physical models for flood simulation and prediction and could generate accurate results but often take a long time.Deep learning technology has recently shown significant potential in the same field,especially in terms of efficiency,helping to overcome the time-consuming associated with traditional methods.This study explores the potential of deep learning models in predicting flood velocity.More specifically,we use a Multi-Layer Perceptron(MLP)model,a specific type of Artificial Neural Networks(ANNs),to predict the velocity in the test area of the Lundesokna River in Norway with diverse terrain conditions.Geographic data and flood velocity simulated based on the physical hydraulic model are used in the study for the pre-training,optimization,and testing of the MLP model.Our experiment indicates that the MLP model has the potential to predict flood velocity in diverse terrain conditions of the river with acceptable accuracy against simulated velocity results but with a significant decrease in training time and testing time.Meanwhile,we discuss the limitations for the improvement in future work.
文摘BACKGROUND Cancer patients often suffer from severe stress reactions psychologically,such as anxiety and depression.Prostate cancer(PC)is one of the common cancer types,with most patients diagnosed at advanced stages that cannot be treated by radical surgery and which are accompanied by complications such as bodily pain and bone metastasis.Therefore,attention should be given to the mental health status of PC patients as well as physical adverse events in the course of clinical treatment.AIM To analyze the risk factors leading to anxiety and depression in PC patients after castration and build a risk prediction model.METHODS A retrospective analysis was performed on the data of 120 PC cases treated in Xi'an People's Hospital between January 2019 and January 2022.The patient cohort was divided into a training group(n=84)and a validation group(n=36)at a ratio of 7:3.The patients’anxiety symptoms and depression levels were assessed 2 wk after surgery with the Self-Rating Anxiety Scale(SAS)and the Selfrating Depression Scale(SDS),respectively.Logistic regression was used to analyze the risk factors affecting negative mood,and a risk prediction model was constructed.RESULTS In the training group,35 patients and 37 patients had an SAS score and an SDS score greater than or equal to 50,respectively.Based on the scores,we further subclassified patients into two groups:a bad mood group(n=35)and an emotional stability group(n=49).Multivariate logistic regression analysis showed that marital status,castration scheme,and postoperative Visual Analogue Scale(VAS)score were independent risk factors affecting a patient's bad mood(P<0.05).In the training and validation groups,patients with adverse emotions exhibited significantly higher risk scores than emotionally stable patients(P<0.0001).The area under the curve(AUC)of the risk prediction model for predicting bad mood in the training group was 0.743,the specificity was 70.96%,and the sensitivity was 66.03%,while in the validation group,the AUC,specificity,and sensitivity were 0.755,66.67%,and 76.19%,respectively.The Hosmer-Lemeshow test showed aχ^(2) of 4.2856,a P value of 0.830,and a C-index of 0.773(0.692-0.854).The calibration curve revealed that the predicted curve was basically consistent with the actual curve,and the calibration curve showed that the prediction model had good discrimination and accuracy.Decision curve analysis showed that the model had a high net profit.CONCLUSION In PC patients,marital status,castration scheme,and postoperative pain(VAS)score are important factors affecting postoperative anxiety and depression.The logistic regression model can be used to successfully predict the risk of adverse psychological emotions.
基金This research work is supported by Sichuan Science and Technology Program(Grant No.2022YFS0586)the National Key R&D Program of China(Grant No.2019YFC1509301)the National Natural Science Foundation of China(Grant No.61976046).
文摘Predicting the displacement of landslide is of utmost practical importance as the landslide can pose serious threats to both human life and property.However,traditional methods have the limitation of random selection in sliding window selection and seldom incorporate weather forecast data for displacement prediction,while a single structural model cannot handle input sequences of different lengths at the same time.In order to solve these limitations,in this study,a new approach is proposed that utilizes weather forecast data and incorporates the maximum information coefficient(MIC),long short-term memory network(LSTM),and attention mechanism to establish a teacher-student coupling model with parallel structure for short-term landslide displacement prediction.Through MIC,a suitable input sequence length is selected for the LSTM model.To investigate the influence of rainfall on landslides during different seasons,a parallel teacher-student coupling model is developed that is able to learn sequential information from various time series of different lengths.The teacher model learns sequence information from rainfall intensity time series while incorporating reliable short-term weather forecast data from platforms such as China Meteorological Administration(CMA)and Reliable Prognosis(https://rp5.ru)to improve the model’s expression capability,and the student model learns sequence information from other time series.An attention module is then designed to integrate different sequence information to derive a context vector,representing seasonal temporal attention mode.Finally,the predicted displacement is obtained through a linear layer.The proposed method demonstrates superior prediction accuracies,surpassing those of the support vector machine(SVM),LSTM,recurrent neural network(RNN),temporal convolutional network(TCN),and LSTM-Attention models.It achieves a mean absolute error(MAE)of 0.072 mm,root mean square error(RMSE)of 0.096 mm,and pearson correlation coefficients(PCCS)of 0.85.Additionally,it exhibits enhanced prediction stability and interpretability,rendering it an indispensable tool for landslide disaster prevention and mitigation.
基金JMW,RSS,EP,EK,WM,ZBP,and NRMT have received research funding from a precision trauma care research award from the Combat Casualty Care Research Program of the US Army Medical Research and Materiel Command(DM180044).
文摘We read with interest the recent systematic reviewaArtificial intelligence and machine learning for hemorrhagic trauma careoby Peng et al.[1],which evaluated literature on machine learning(ML)in the management of traumatic haemorrhage.We thank the authors for their contribution to the role of ML in trauma.
基金supported by the National Natural Science Foundation of China(81825009,82071505,81901358)the Chinese Academy of Medical Sciences Innovation Fund for Medical Sciences(2021-I2MC&T-B-099,2019-I2M-5–006)+2 种基金the Program of Chinese Institute for Brain Research Beijing(2020-NKX-XM-12)the King’s College London-Peking University Health Science Center Joint Institute for Medical Research(BMU2020KCL001,BMU2019LCKXJ012)the National Key R&D Program of China(2021YFF1201103,2016YFC1307000).
文摘Background:Choosing the appropriate antipsychotic drug(APD)treatment for patients with schizophrenia(SCZ)can be challenging,as the treatment response to APD is highly variable and difficult to predict due to the lack of effective biomarkers.Previous studies have indicated the association between treatment response and genetic and epigenetic factors,but no effective biomarkers have been identified.Hence,further research is imperative to enhance precision medicine in SCZ treatment.Methods:Participants with SCZ were recruited from two randomized trials.The discovery cohort was recruited from the CAPOC trial(n=2307)involved 6 weeks of treatment and equally randomized the participants to the Olanzapine,Risperidone,Quetiapine,Aripiprazole,Ziprasidone,and Haloperidol/Perphenazine(subsequently equally assigned to one or the other)groups.The external validation cohort was recruited from the CAPEC trial(n=1379),which involved 8 weeks of treatment and equally randomized the participants to the Olanzapine,Risperidone,and Aripiprazole groups.Additionally,healthy controls(n=275)from the local community were utilized as a genetic/epigenetic reference.The genetic and epigenetic(DNA methylation)risks of SCZ were assessed using the polygenic risk score(PRS)and polymethylation score,respectively.The study also examined the genetic-epigenetic interactions with treatment response through differential methylation analysis,methylation quantitative trait loci,colocalization,and promoteranchored chromatin interaction.Machine learning was used to develop a prediction model for treatment response,which was evaluated for accuracy and clinical benefit using the area under curve(AUC)for classification,R^(2) for regression,and decision curve analysis.Results:Six risk genes for SCZ(LINC01795,DDHD2,SBNO1,KCNG2,SEMA7A,and RUFY1)involved in cortical morphology were identified as having a genetic-epigenetic interaction associated with treatment response.The developed and externally validated prediction model,which incorporated clinical information,PRS,genetic risk score(GRS),and proxy methylation level(proxyDNAm),demonstrated positive benefits for a wide range of patients receiving different APDs,regardless of sex[discovery cohort:AUC=0.874(95%CI 0.867-0.881),R^(2)=0.478;external validation cohort:AUC=0.851(95%CI 0.841-0.861),R^(2)=0.507].Conclusions:This study presents a promising precision medicine approach to evaluate treatment response,which has the potential to aid clinicians in making informed decisions about APD treatment for patients with SCZ.Trial registration Chinese Clinical Trial Registry(https://www.chictr.org.cn/),18 Aug 2009 retrospectively registered:CAPOC-ChiCTR-RNC-09000521(https://www.chictr.org.cn/showproj.aspx?proj=9014),CAPEC-ChiCTRRNC-09000522(https://www.chictr.org.cn/showproj.aspx?proj=9013).