With the advancement of artificial intelligence,traffic forecasting is gaining more and more interest in optimizing route planning and enhancing service quality.Traffic volume is an influential parameter for planning ...With the advancement of artificial intelligence,traffic forecasting is gaining more and more interest in optimizing route planning and enhancing service quality.Traffic volume is an influential parameter for planning and operating traffic structures.This study proposed an improved ensemble-based deep learning method to solve traffic volume prediction problems.A set of optimal hyperparameters is also applied for the suggested approach to improve the performance of the learning process.The fusion of these methodologies aims to harness ensemble empirical mode decomposition’s capacity to discern complex traffic patterns and long short-term memory’s proficiency in learning temporal relationships.Firstly,a dataset for automatic vehicle identification is obtained and utilized in the preprocessing stage of the ensemble empirical mode decomposition model.The second aspect involves predicting traffic volume using the long short-term memory algorithm.Next,the study employs a trial-and-error approach to select a set of optimal hyperparameters,including the lookback window,the number of neurons in the hidden layers,and the gradient descent optimization.Finally,the fusion of the obtained results leads to a final traffic volume prediction.The experimental results show that the proposed method outperforms other benchmarks regarding various evaluation measures,including mean absolute error,root mean squared error,mean absolute percentage error,and R-squared.The achieved R-squared value reaches an impressive 98%,while the other evaluation indices surpass the competing.These findings highlight the accuracy of traffic pattern prediction.Consequently,this offers promising prospects for enhancing transportation management systems and urban infrastructure planning.展开更多
Predicting the displacement of landslide is of utmost practical importance as the landslide can pose serious threats to both human life and property.However,traditional methods have the limitation of random selection ...Predicting the displacement of landslide is of utmost practical importance as the landslide can pose serious threats to both human life and property.However,traditional methods have the limitation of random selection in sliding window selection and seldom incorporate weather forecast data for displacement prediction,while a single structural model cannot handle input sequences of different lengths at the same time.In order to solve these limitations,in this study,a new approach is proposed that utilizes weather forecast data and incorporates the maximum information coefficient(MIC),long short-term memory network(LSTM),and attention mechanism to establish a teacher-student coupling model with parallel structure for short-term landslide displacement prediction.Through MIC,a suitable input sequence length is selected for the LSTM model.To investigate the influence of rainfall on landslides during different seasons,a parallel teacher-student coupling model is developed that is able to learn sequential information from various time series of different lengths.The teacher model learns sequence information from rainfall intensity time series while incorporating reliable short-term weather forecast data from platforms such as China Meteorological Administration(CMA)and Reliable Prognosis(https://rp5.ru)to improve the model’s expression capability,and the student model learns sequence information from other time series.An attention module is then designed to integrate different sequence information to derive a context vector,representing seasonal temporal attention mode.Finally,the predicted displacement is obtained through a linear layer.The proposed method demonstrates superior prediction accuracies,surpassing those of the support vector machine(SVM),LSTM,recurrent neural network(RNN),temporal convolutional network(TCN),and LSTM-Attention models.It achieves a mean absolute error(MAE)of 0.072 mm,root mean square error(RMSE)of 0.096 mm,and pearson correlation coefficients(PCCS)of 0.85.Additionally,it exhibits enhanced prediction stability and interpretability,rendering it an indispensable tool for landslide disaster prevention and mitigation.展开更多
Natural events have had a significant impact on overall flight activity,and the aviation industry plays a vital role in helping society cope with the impact of these events.As one of the most impactful weather typhoon...Natural events have had a significant impact on overall flight activity,and the aviation industry plays a vital role in helping society cope with the impact of these events.As one of the most impactful weather typhoon seasons appears and continues,airlines operating in threatened areas and passengers having travel plans during this time period will pay close attention to the development of tropical storms.This paper proposes a deep multimodal fusion and multitasking trajectory prediction model that can improve the reliability of typhoon trajectory prediction and reduce the quantity of flight scheduling cancellation.The deep multimodal fusion module is formed by deep fusion of the feature output by multiple submodal fusion modules,and the multitask generation module uses longitude and latitude as two related tasks for simultaneous prediction.With more dependable data accuracy,problems can be analysed rapidly and more efficiently,enabling better decision-making with a proactive versus reactive posture.When multiple modalities coexist,features can be extracted from them simultaneously to supplement each other’s information.An actual case study,the typhoon Lichma that swept China in 2019,has demonstrated that the algorithm can effectively reduce the number of unnecessary flight cancellations compared to existing flight scheduling and assist the new generation of flight scheduling systems under extreme weather.展开更多
Among steganalysis techniques,detection against MV(motion vector)domain-based video steganography in the HEVC(High Efficiency Video Coding)standard remains a challenging issue.For the purpose of improving the detectio...Among steganalysis techniques,detection against MV(motion vector)domain-based video steganography in the HEVC(High Efficiency Video Coding)standard remains a challenging issue.For the purpose of improving the detection performance,this paper proposes a steganalysis method that can perfectly detectMV-based steganography in HEVC.Firstly,we define the local optimality of MVP(Motion Vector Prediction)based on the technology of AMVP(Advanced Motion Vector Prediction).Secondly,we analyze that in HEVC video,message embedding either usingMVP index orMVD(Motion Vector Difference)may destroy the above optimality of MVP.And then,we define the optimal rate of MVP as a steganalysis feature.Finally,we conduct steganalysis detection experiments on two general datasets for three popular steganographymethods and compare the performance with four state-ofthe-art steganalysis methods.The experimental results demonstrate the effectiveness of the proposed feature set.Furthermore,our method stands out for its practical applicability,requiring no model training and exhibiting low computational complexity,making it a viable solution for real-world scenarios.展开更多
The growing global requirement for food and the need for sustainable farming in an era of a changing climate and scarce resources have inspired substantial crop yield prediction research.Deep learning(DL)and machine l...The growing global requirement for food and the need for sustainable farming in an era of a changing climate and scarce resources have inspired substantial crop yield prediction research.Deep learning(DL)and machine learning(ML)models effectively deal with such challenges.This research paper comprehensively analyses recent advancements in crop yield prediction from January 2016 to March 2024.In addition,it analyses the effectiveness of various input parameters considered in crop yield prediction models.We conducted an in-depth search and gathered studies that employed crop modeling and AI-based methods to predict crop yield.The total number of articles reviewed for crop yield prediction using ML,meta-modeling(Crop models coupled with ML/DL),and DL-based prediction models and input parameter selection is 125.We conduct the research by setting up five objectives for this research and discussing them after analyzing the selected research papers.Each study is assessed based on the crop type,input parameters employed for prediction,the modeling techniques adopted,and the evaluation metrics used for estimatingmodel performance.We also discuss the ethical and social impacts of AI on agriculture.However,various approaches presented in the scientific literature have delivered impressive predictions,they are complicateddue to intricate,multifactorial influences oncropgrowthand theneed for accuratedata-driven models.Therefore,thorough research is required to deal with challenges in predicting agricultural output.展开更多
The output of photovoltaic power stations is significantly affected by environmental factors,leading to intermittent and fluctuating power generation.With the increasing frequency of extreme weather events due to glob...The output of photovoltaic power stations is significantly affected by environmental factors,leading to intermittent and fluctuating power generation.With the increasing frequency of extreme weather events due to global warming,photovoltaic power stations may experience drastic reductions in power generation or even complete shutdowns during such conditions.The integration of these stations on a large scale into the power grid could potentially pose challenges to systemstability.To address this issue,in this study,we propose a network architecture based on VMDKELMfor predicting the power output of photovoltaic power plants during severe weather events.Initially,a grey relational analysis is conducted to identify key environmental factors influencing photovoltaic power generation.Subsequently,GMM clustering is utilized to classify meteorological data points based on their probabilities within different Gaussian distributions,enabling comprehensive meteorological clustering and extraction of significant extreme weather data.The data are decomposed using VMD to Fourier transform,followed by smoothing processing and signal reconstruction using KELM to forecast photovoltaic power output under major extreme weather conditions.The proposed prediction scheme is validated by establishing three prediction models,and the predicted photovoltaic output under four major extreme weather conditions is analyzed to assess the impact of severe weather on photovoltaic power station output.The experimental results show that the photovoltaic power output under conditions of dust storms,thunderstorms,solid hail precipitation,and snowstorms is reduced by 68.84%,42.70%,61.86%,and 49.92%,respectively,compared to that under clear day conditions.The photovoltaic power prediction accuracies,in descending order,are dust storms,solid hail precipitation,thunderstorms,and snowstorms.展开更多
This paper uses Gaussian interval type-2 fuzzy se theory on historical traffic volume data processing to obtain a 24-hour prediction of traffic volume with high precision. A K-means clustering method is used in this p...This paper uses Gaussian interval type-2 fuzzy se theory on historical traffic volume data processing to obtain a 24-hour prediction of traffic volume with high precision. A K-means clustering method is used in this paper to get 5 minutes traffic volume variation as input data for the Gaussian interval type-2 fuzzy sets which can reflect the distribution of historical traffic volume in one statistical period. Moreover, the cluster with the largest collection of data obtained by K-means clustering method is calculated to get the key parameters of type-2 fuzzy sets, mean and standard deviation of the Gaussian membership function.Using the range of data as the input of Gaussian interval type-2 fuzzy sets leads to the range of traffic volume forecasting output with the ability of describing the possible range of the traffic volume as well as the traffic volume prediction data with high accuracy. The simulation results show that the average relative error is reduced to 8% based on the combined K-means Gaussian interval type-2 fuzzy sets forecasting method. The fluctuation range in terms of an upper and a lower forecasting traffic volume completely envelopes the actual traffic volume and reproduces the fluctuation range of traffic flow.展开更多
The mixing enthalpies of 23 binary liquid alloys are calculated by molecular interaction volume model (MIVM), which is a two-parameter model with the partial molar infinite dilute mixing enthalpies. The predicted va...The mixing enthalpies of 23 binary liquid alloys are calculated by molecular interaction volume model (MIVM), which is a two-parameter model with the partial molar infinite dilute mixing enthalpies. The predicted values are in agreement with the experimental data and then indicate that the model is reliable and convenient.展开更多
Twitter sentiment has been shown to be useful in predicting whether Bitcoin’s price will increase or decrease.Yet the state-of-the-art is limited to predicting the price direction and not the magnitude of increase/de...Twitter sentiment has been shown to be useful in predicting whether Bitcoin’s price will increase or decrease.Yet the state-of-the-art is limited to predicting the price direction and not the magnitude of increase/decrease.In this paper,we seek to build on the state-of-the-art to not only predict the direction yet to also predict the magnitude of increase/decrease.We utilise not only sentiment extracted from tweets,but also the volume of tweets.We present results from experiments exploring the relation between sentiment and future price at different temporal granularities,with the goal of discovering the optimal time interval at which the sentiment expressed becomes a reliable indicator of price change.Two different neural network models are explored and evaluated,one based on recurrent nets and one based on convolutional networks.An additional model is presented to predict the magnitude of change,which is framed as a multi-class classification problem.It is shown that this model yields more reliable predictions when used alongside a price trend prediction model.The main research contribution from this paper is that we demonstrate that not only can price direction prediction be made but the magnitude in price change can be predicted with relative accuracy(63%).展开更多
With the development of information and communication technologies,all public tertiary hospitals in China began to use online outpatient appointment systems.However,the phenomenon of patient no-shows in online outpati...With the development of information and communication technologies,all public tertiary hospitals in China began to use online outpatient appointment systems.However,the phenomenon of patient no-shows in online outpatient appointments is becoming more serious.The objective of this study is to design a prediction model for patient no-shows,thereby assisting hospitals in making relevant decisions,and reducing the probability of patient no-show behavior.We used 382,004 original online outpatient appointment records,and divided the data set into a training set(N_(1)=286,503),and a validation set(N_(2)=95,501).We used machine learning algorithms such as logistic regression,k-nearest neighbor(KNN),boosting,decision tree(DT),random forest(RF)and bagging to design prediction models for patient no-show in online outpatient appointments.The patient no-show rate of online outpatient appointment was 11.1%(N=42,224).From the validation set,bagging had the highest area under the ROC curve and AUC value,which was 0.990,followed by random forest and boosting models,which were 0.987 and 0.976,respectively.In contrast,compared with the previous prediction models,the area under ROC and AUC values of the logistic regression,decision tree,and k-nearest neighbors were lower at 0.597,0.499 and 0.843,respectively.This study demonstrates the possibility of using data from multiple sources to predict patient no-shows.The prediction model results can provide decision basis for hospitals to reduce medical resource waste,develop effective outpatient appointment policies,and optimize operations.展开更多
The current situation of the railway passenger traffic (RPT) and the traffic marketing is analyzed. The grey model theory is adopted to establish a prediction model for the railway passenger traffic volume (RPTV).T...The current situation of the railway passenger traffic (RPT) and the traffic marketing is analyzed. The grey model theory is adopted to establish a prediction model for the railway passenger traffic volume (RPTV).The RPTV from 2001 to 2005 is predicted with the proposed model, and a few suggestions are put forward.展开更多
The structure type for the crystal of 4,4'-bis-(2-hydroxy-ethoxyl)-biphenyl 1 has been predicted by using the previously developed interfacial model for small organic molecules. Based on the calculated hydrophobic...The structure type for the crystal of 4,4'-bis-(2-hydroxy-ethoxyl)-biphenyl 1 has been predicted by using the previously developed interfacial model for small organic molecules. Based on the calculated hydrophobic to hydrophilic volume of 1, this model predicts the crystal structure to be of lamellar or bicontinuous type, which has been confirmed by the X-ray single-crystal structure analysis (C20H26O6, monoclinic, P21/C, a = 16.084(1), b = 6.0103(4), c = 9.6410(7) A, β9 = 103.014(2)°, V= 908.1(1) A3, Z = 2, Dc= 1.325 g/cm3, F(000)=388,μ = 0.097 mm-1, MoKα radiation, λ = 0.71073 A, R = 0.0382 and wR = 0.0882 with I > 2σ(I) for 7121 reflections collected, 1852 unique reflections and 170 parameters). As predicted, the hydrophobic and hydrophilic portions of 1 form in the lamellae. The same interfacial model is applied to other amphilphilic small molecule organic systems for structural type prediction.展开更多
This work constructed a machine learning(ML)model to predict the atmospheric corrosion rate of low-alloy steels(LAS).The material properties of LAS,environmental factors,and exposure time were used as the input,while ...This work constructed a machine learning(ML)model to predict the atmospheric corrosion rate of low-alloy steels(LAS).The material properties of LAS,environmental factors,and exposure time were used as the input,while the corrosion rate as the output.6 dif-ferent ML algorithms were used to construct the proposed model.Through optimization and filtering,the eXtreme gradient boosting(XG-Boost)model exhibited good corrosion rate prediction accuracy.The features of material properties were then transformed into atomic and physical features using the proposed property transformation approach,and the dominant descriptors that affected the corrosion rate were filtered using the recursive feature elimination(RFE)as well as XGBoost methods.The established ML models exhibited better predic-tion performance and generalization ability via property transformation descriptors.In addition,the SHapley additive exPlanations(SHAP)method was applied to analyze the relationship between the descriptors and corrosion rate.The results showed that the property transformation model could effectively help with analyzing the corrosion behavior,thereby significantly improving the generalization ability of corrosion rate prediction models.展开更多
The scientific community recognizes the seriousness of rockbursts and the need for effective mitigation measures.The literature reports various successful applications of machine learning(ML)models for rockburst asses...The scientific community recognizes the seriousness of rockbursts and the need for effective mitigation measures.The literature reports various successful applications of machine learning(ML)models for rockburst assessment;however,a significant question remains unanswered:How reliable are these models,and at what confidence level are classifications made?Typically,ML models output single rockburst grade even in the face of intricate and out-of-distribution samples,without any associated confidence value.Given the susceptibility of ML models to errors,it becomes imperative to quantify their uncertainty to prevent consequential failures.To address this issue,we propose a conformal prediction(CP)framework built on traditional ML models(extreme gradient boosting and random forest)to generate valid classifications of rockburst while producing a measure of confidence for its output.The proposed framework guarantees marginal coverage and,in most cases,conditional coverage on the test dataset.The CP was evaluated on a rockburst case in the Sanshandao Gold Mine in China,where it achieved high coverage and efficiency at applicable confidence levels.Significantly,the CP identified several“confident”classifications from the traditional ML model as unreliable,necessitating expert verification for informed decision-making.The proposed framework improves the reliability and accuracy of rockburst assessments,with the potential to bolster user confidence.展开更多
Accurate prediction of tropical cyclone(TC)intensity is challenging due to the complex physical processes involved.Here,we introduce a new TC intensity prediction scheme for the western North Pacific(WNP)based on a ti...Accurate prediction of tropical cyclone(TC)intensity is challenging due to the complex physical processes involved.Here,we introduce a new TC intensity prediction scheme for the western North Pacific(WNP)based on a time-dependent theory of TC intensification,termed the energetically based dynamical system(EBDS)model,together with the use of a long short-term memory(LSTM)neural network.In time-dependent theory,TC intensity change is controlled by both the internal dynamics of the TC system and various environmental factors,expressed as environmental dynamical efficiency.The LSTM neural network is used to predict the environmental dynamical efficiency in the EBDS model trained using besttrack TC data and global reanalysis data during 1982–2017.The transfer learning and ensemble methods are used to retrain the scheme using the environmental factors predicted by the Global Forecast System(GFS)of the National Centers for Environmental Prediction during 2017–21.The predicted environmental dynamical efficiency is finally iterated into the EBDS equations to predict TC intensity.The new scheme is evaluated for TC intensity prediction using both reanalysis data and the GFS prediction data.The intensity prediction by the new scheme shows better skill than the official prediction from the China Meteorological Administration(CMA)and those by other state-of-art statistical and dynamical forecast systems,except for the 72-h forecast.Particularly at the longer lead times of 96 h and 120 h,the new scheme has smaller forecast errors,with a more than 30%improvement over the official forecasts.展开更多
Background:Choosing the appropriate antipsychotic drug(APD)treatment for patients with schizophrenia(SCZ)can be challenging,as the treatment response to APD is highly variable and difficult to predict due to the lack ...Background:Choosing the appropriate antipsychotic drug(APD)treatment for patients with schizophrenia(SCZ)can be challenging,as the treatment response to APD is highly variable and difficult to predict due to the lack of effective biomarkers.Previous studies have indicated the association between treatment response and genetic and epigenetic factors,but no effective biomarkers have been identified.Hence,further research is imperative to enhance precision medicine in SCZ treatment.Methods:Participants with SCZ were recruited from two randomized trials.The discovery cohort was recruited from the CAPOC trial(n=2307)involved 6 weeks of treatment and equally randomized the participants to the Olanzapine,Risperidone,Quetiapine,Aripiprazole,Ziprasidone,and Haloperidol/Perphenazine(subsequently equally assigned to one or the other)groups.The external validation cohort was recruited from the CAPEC trial(n=1379),which involved 8 weeks of treatment and equally randomized the participants to the Olanzapine,Risperidone,and Aripiprazole groups.Additionally,healthy controls(n=275)from the local community were utilized as a genetic/epigenetic reference.The genetic and epigenetic(DNA methylation)risks of SCZ were assessed using the polygenic risk score(PRS)and polymethylation score,respectively.The study also examined the genetic-epigenetic interactions with treatment response through differential methylation analysis,methylation quantitative trait loci,colocalization,and promoteranchored chromatin interaction.Machine learning was used to develop a prediction model for treatment response,which was evaluated for accuracy and clinical benefit using the area under curve(AUC)for classification,R^(2) for regression,and decision curve analysis.Results:Six risk genes for SCZ(LINC01795,DDHD2,SBNO1,KCNG2,SEMA7A,and RUFY1)involved in cortical morphology were identified as having a genetic-epigenetic interaction associated with treatment response.The developed and externally validated prediction model,which incorporated clinical information,PRS,genetic risk score(GRS),and proxy methylation level(proxyDNAm),demonstrated positive benefits for a wide range of patients receiving different APDs,regardless of sex[discovery cohort:AUC=0.874(95%CI 0.867-0.881),R^(2)=0.478;external validation cohort:AUC=0.851(95%CI 0.841-0.861),R^(2)=0.507].Conclusions:This study presents a promising precision medicine approach to evaluate treatment response,which has the potential to aid clinicians in making informed decisions about APD treatment for patients with SCZ.Trial registration Chinese Clinical Trial Registry(https://www.chictr.org.cn/),18 Aug 2009 retrospectively registered:CAPOC-ChiCTR-RNC-09000521(https://www.chictr.org.cn/showproj.aspx?proj=9014),CAPEC-ChiCTRRNC-09000522(https://www.chictr.org.cn/showproj.aspx?proj=9013).展开更多
The scarcity of in-situ ocean observations poses a challenge for real-time information acquisition in the ocean.Among the crucial hydroacoustic environmental parameters,ocean sound velocity exhibits significant spatia...The scarcity of in-situ ocean observations poses a challenge for real-time information acquisition in the ocean.Among the crucial hydroacoustic environmental parameters,ocean sound velocity exhibits significant spatial and temporal variability and it is highly relevant to oceanic research.In this study,we propose a new data-driven approach,leveraging deep learning techniques,for the prediction of sound velocity fields(SVFs).Our novel spatiotemporal prediction model,STLSTM-SA,combines Spatiotemporal Long Short-Term Memory(ST-LSTM) with a self-attention mechanism to enable accurate and real-time prediction of SVFs.To circumvent the limited amount of observational data,we employ transfer learning by first training the model using reanalysis datasets,followed by fine-tuning it using in-situ analysis data to obtain the final prediction model.By utilizing the historical 12-month SVFs as input,our model predicts the SVFs for the subsequent three months.We compare the performance of five models:Artificial Neural Networks(ANN),Long ShortTerm Memory(LSTM),Convolutional LSTM(ConvLSTM),ST-LSTM,and our proposed ST-LSTM-SA model in a test experiment spanning 2019 to 2022.Our results demonstrate that the ST-LSTM-SA model significantly improves the prediction accuracy and stability of sound velocity in both temporal and spatial dimensions.The ST-LSTM-SA model not only accurately predicts the ocean sound velocity field(SVF),but also provides valuable insights for spatiotemporal prediction of other oceanic environmental variables.展开更多
In the existing landslide susceptibility prediction(LSP)models,the influences of random errors in landslide conditioning factors on LSP are not considered,instead the original conditioning factors are directly taken a...In the existing landslide susceptibility prediction(LSP)models,the influences of random errors in landslide conditioning factors on LSP are not considered,instead the original conditioning factors are directly taken as the model inputs,which brings uncertainties to LSP results.This study aims to reveal the influence rules of the different proportional random errors in conditioning factors on the LSP un-certainties,and further explore a method which can effectively reduce the random errors in conditioning factors.The original conditioning factors are firstly used to construct original factors-based LSP models,and then different random errors of 5%,10%,15% and 20%are added to these original factors for con-structing relevant errors-based LSP models.Secondly,low-pass filter-based LSP models are constructed by eliminating the random errors using low-pass filter method.Thirdly,the Ruijin County of China with 370 landslides and 16 conditioning factors are used as study case.Three typical machine learning models,i.e.multilayer perceptron(MLP),support vector machine(SVM)and random forest(RF),are selected as LSP models.Finally,the LSP uncertainties are discussed and results show that:(1)The low-pass filter can effectively reduce the random errors in conditioning factors to decrease the LSP uncertainties.(2)With the proportions of random errors increasing from 5%to 20%,the LSP uncertainty increases continuously.(3)The original factors-based models are feasible for LSP in the absence of more accurate conditioning factors.(4)The influence degrees of two uncertainty issues,machine learning models and different proportions of random errors,on the LSP modeling are large and basically the same.(5)The Shapley values effectively explain the internal mechanism of machine learning model predicting landslide sus-ceptibility.In conclusion,greater proportion of random errors in conditioning factors results in higher LSP uncertainty,and low-pass filter can effectively reduce these random errors.展开更多
In recent years,deep learning methods have gradually been applied to prediction tasks related to Arctic sea ice concentration,but relatively little research has been conducted for larger spatial and temporal scales,ma...In recent years,deep learning methods have gradually been applied to prediction tasks related to Arctic sea ice concentration,but relatively little research has been conducted for larger spatial and temporal scales,mainly due to the limited time coverage of observations and reanalysis data.Meanwhile,deep learning predictions of sea ice thickness(SIT)have yet to receive ample attention.In this study,two data-driven deep learning(DL)models are built based on the ConvLSTM and fully convolutional U-net(FC-Unet)algorithms and trained using CMIP6 historical simulations for transfer learning and fine-tuned using reanalysis/observations.These models enable monthly predictions of Arctic SIT without considering the complex physical processes involved.Through comprehensive assessments of prediction skills by season and region,the results suggest that using a broader set of CMIP6 data for transfer learning,as well as incorporating multiple climate variables as predictors,contribute to better prediction results,although both DL models can effectively predict the spatiotemporal features of SIT anomalies.Regarding the predicted SIT anomalies of the FC-Unet model,the spatial correlations with reanalysis reach an average level of 89%over all months,while the temporal anomaly correlation coefficients are close to unity in most cases.The models also demonstrate robust performances in predicting SIT and SIE during extreme events.The effectiveness and reliability of the proposed deep transfer learning models in predicting Arctic SIT can facilitate more accurate pan-Arctic predictions,aiding climate change research and real-time business applications.展开更多
The accuracy of landslide susceptibility prediction(LSP)mainly depends on the precision of the landslide spatial position.However,the spatial position error of landslide survey is inevitable,resulting in considerable ...The accuracy of landslide susceptibility prediction(LSP)mainly depends on the precision of the landslide spatial position.However,the spatial position error of landslide survey is inevitable,resulting in considerable uncertainties in LSP modeling.To overcome this drawback,this study explores the influence of positional errors of landslide spatial position on LSP uncertainties,and then innovatively proposes a semi-supervised machine learning model to reduce the landslide spatial position error.This paper collected 16 environmental factors and 337 landslides with accurate spatial positions taking Shangyou County of China as an example.The 30e110 m error-based multilayer perceptron(MLP)and random forest(RF)models for LSP are established by randomly offsetting the original landslide by 30,50,70,90 and 110 m.The LSP uncertainties are analyzed by the LSP accuracy and distribution characteristics.Finally,a semi-supervised model is proposed to relieve the LSP uncertainties.Results show that:(1)The LSP accuracies of error-based RF/MLP models decrease with the increase of landslide position errors,and are lower than those of original data-based models;(2)70 m error-based models can still reflect the overall distribution characteristics of landslide susceptibility indices,thus original landslides with certain position errors are acceptable for LSP;(3)Semi-supervised machine learning model can efficiently reduce the landslide position errors and thus improve the LSP accuracies.展开更多
文摘With the advancement of artificial intelligence,traffic forecasting is gaining more and more interest in optimizing route planning and enhancing service quality.Traffic volume is an influential parameter for planning and operating traffic structures.This study proposed an improved ensemble-based deep learning method to solve traffic volume prediction problems.A set of optimal hyperparameters is also applied for the suggested approach to improve the performance of the learning process.The fusion of these methodologies aims to harness ensemble empirical mode decomposition’s capacity to discern complex traffic patterns and long short-term memory’s proficiency in learning temporal relationships.Firstly,a dataset for automatic vehicle identification is obtained and utilized in the preprocessing stage of the ensemble empirical mode decomposition model.The second aspect involves predicting traffic volume using the long short-term memory algorithm.Next,the study employs a trial-and-error approach to select a set of optimal hyperparameters,including the lookback window,the number of neurons in the hidden layers,and the gradient descent optimization.Finally,the fusion of the obtained results leads to a final traffic volume prediction.The experimental results show that the proposed method outperforms other benchmarks regarding various evaluation measures,including mean absolute error,root mean squared error,mean absolute percentage error,and R-squared.The achieved R-squared value reaches an impressive 98%,while the other evaluation indices surpass the competing.These findings highlight the accuracy of traffic pattern prediction.Consequently,this offers promising prospects for enhancing transportation management systems and urban infrastructure planning.
基金This research work is supported by Sichuan Science and Technology Program(Grant No.2022YFS0586)the National Key R&D Program of China(Grant No.2019YFC1509301)the National Natural Science Foundation of China(Grant No.61976046).
文摘Predicting the displacement of landslide is of utmost practical importance as the landslide can pose serious threats to both human life and property.However,traditional methods have the limitation of random selection in sliding window selection and seldom incorporate weather forecast data for displacement prediction,while a single structural model cannot handle input sequences of different lengths at the same time.In order to solve these limitations,in this study,a new approach is proposed that utilizes weather forecast data and incorporates the maximum information coefficient(MIC),long short-term memory network(LSTM),and attention mechanism to establish a teacher-student coupling model with parallel structure for short-term landslide displacement prediction.Through MIC,a suitable input sequence length is selected for the LSTM model.To investigate the influence of rainfall on landslides during different seasons,a parallel teacher-student coupling model is developed that is able to learn sequential information from various time series of different lengths.The teacher model learns sequence information from rainfall intensity time series while incorporating reliable short-term weather forecast data from platforms such as China Meteorological Administration(CMA)and Reliable Prognosis(https://rp5.ru)to improve the model’s expression capability,and the student model learns sequence information from other time series.An attention module is then designed to integrate different sequence information to derive a context vector,representing seasonal temporal attention mode.Finally,the predicted displacement is obtained through a linear layer.The proposed method demonstrates superior prediction accuracies,surpassing those of the support vector machine(SVM),LSTM,recurrent neural network(RNN),temporal convolutional network(TCN),and LSTM-Attention models.It achieves a mean absolute error(MAE)of 0.072 mm,root mean square error(RMSE)of 0.096 mm,and pearson correlation coefficients(PCCS)of 0.85.Additionally,it exhibits enhanced prediction stability and interpretability,rendering it an indispensable tool for landslide disaster prevention and mitigation.
基金supported by the National Natural Science Foundation of China(62073330)。
文摘Natural events have had a significant impact on overall flight activity,and the aviation industry plays a vital role in helping society cope with the impact of these events.As one of the most impactful weather typhoon seasons appears and continues,airlines operating in threatened areas and passengers having travel plans during this time period will pay close attention to the development of tropical storms.This paper proposes a deep multimodal fusion and multitasking trajectory prediction model that can improve the reliability of typhoon trajectory prediction and reduce the quantity of flight scheduling cancellation.The deep multimodal fusion module is formed by deep fusion of the feature output by multiple submodal fusion modules,and the multitask generation module uses longitude and latitude as two related tasks for simultaneous prediction.With more dependable data accuracy,problems can be analysed rapidly and more efficiently,enabling better decision-making with a proactive versus reactive posture.When multiple modalities coexist,features can be extracted from them simultaneously to supplement each other’s information.An actual case study,the typhoon Lichma that swept China in 2019,has demonstrated that the algorithm can effectively reduce the number of unnecessary flight cancellations compared to existing flight scheduling and assist the new generation of flight scheduling systems under extreme weather.
基金the National Natural Science Foundation of China(Grant Nos.62272478,62202496,61872384).
文摘Among steganalysis techniques,detection against MV(motion vector)domain-based video steganography in the HEVC(High Efficiency Video Coding)standard remains a challenging issue.For the purpose of improving the detection performance,this paper proposes a steganalysis method that can perfectly detectMV-based steganography in HEVC.Firstly,we define the local optimality of MVP(Motion Vector Prediction)based on the technology of AMVP(Advanced Motion Vector Prediction).Secondly,we analyze that in HEVC video,message embedding either usingMVP index orMVD(Motion Vector Difference)may destroy the above optimality of MVP.And then,we define the optimal rate of MVP as a steganalysis feature.Finally,we conduct steganalysis detection experiments on two general datasets for three popular steganographymethods and compare the performance with four state-ofthe-art steganalysis methods.The experimental results demonstrate the effectiveness of the proposed feature set.Furthermore,our method stands out for its practical applicability,requiring no model training and exhibiting low computational complexity,making it a viable solution for real-world scenarios.
文摘The growing global requirement for food and the need for sustainable farming in an era of a changing climate and scarce resources have inspired substantial crop yield prediction research.Deep learning(DL)and machine learning(ML)models effectively deal with such challenges.This research paper comprehensively analyses recent advancements in crop yield prediction from January 2016 to March 2024.In addition,it analyses the effectiveness of various input parameters considered in crop yield prediction models.We conducted an in-depth search and gathered studies that employed crop modeling and AI-based methods to predict crop yield.The total number of articles reviewed for crop yield prediction using ML,meta-modeling(Crop models coupled with ML/DL),and DL-based prediction models and input parameter selection is 125.We conduct the research by setting up five objectives for this research and discussing them after analyzing the selected research papers.Each study is assessed based on the crop type,input parameters employed for prediction,the modeling techniques adopted,and the evaluation metrics used for estimatingmodel performance.We also discuss the ethical and social impacts of AI on agriculture.However,various approaches presented in the scientific literature have delivered impressive predictions,they are complicateddue to intricate,multifactorial influences oncropgrowthand theneed for accuratedata-driven models.Therefore,thorough research is required to deal with challenges in predicting agricultural output.
基金funded by the Open Fund of National Key Laboratory of Renewable Energy Grid Integration(China Electric Power Research Institute)(No.NYB51202301624).
文摘The output of photovoltaic power stations is significantly affected by environmental factors,leading to intermittent and fluctuating power generation.With the increasing frequency of extreme weather events due to global warming,photovoltaic power stations may experience drastic reductions in power generation or even complete shutdowns during such conditions.The integration of these stations on a large scale into the power grid could potentially pose challenges to systemstability.To address this issue,in this study,we propose a network architecture based on VMDKELMfor predicting the power output of photovoltaic power plants during severe weather events.Initially,a grey relational analysis is conducted to identify key environmental factors influencing photovoltaic power generation.Subsequently,GMM clustering is utilized to classify meteorological data points based on their probabilities within different Gaussian distributions,enabling comprehensive meteorological clustering and extraction of significant extreme weather data.The data are decomposed using VMD to Fourier transform,followed by smoothing processing and signal reconstruction using KELM to forecast photovoltaic power output under major extreme weather conditions.The proposed prediction scheme is validated by establishing three prediction models,and the predicted photovoltaic output under four major extreme weather conditions is analyzed to assess the impact of severe weather on photovoltaic power station output.The experimental results show that the photovoltaic power output under conditions of dust storms,thunderstorms,solid hail precipitation,and snowstorms is reduced by 68.84%,42.70%,61.86%,and 49.92%,respectively,compared to that under clear day conditions.The photovoltaic power prediction accuracies,in descending order,are dust storms,solid hail precipitation,thunderstorms,and snowstorms.
基金supported by the National Key Research and Development Program of China(2018YFB1201500)
文摘This paper uses Gaussian interval type-2 fuzzy se theory on historical traffic volume data processing to obtain a 24-hour prediction of traffic volume with high precision. A K-means clustering method is used in this paper to get 5 minutes traffic volume variation as input data for the Gaussian interval type-2 fuzzy sets which can reflect the distribution of historical traffic volume in one statistical period. Moreover, the cluster with the largest collection of data obtained by K-means clustering method is calculated to get the key parameters of type-2 fuzzy sets, mean and standard deviation of the Gaussian membership function.Using the range of data as the input of Gaussian interval type-2 fuzzy sets leads to the range of traffic volume forecasting output with the ability of describing the possible range of the traffic volume as well as the traffic volume prediction data with high accuracy. The simulation results show that the average relative error is reduced to 8% based on the combined K-means Gaussian interval type-2 fuzzy sets forecasting method. The fluctuation range in terms of an upper and a lower forecasting traffic volume completely envelopes the actual traffic volume and reproduces the fluctuation range of traffic flow.
基金the National Natural Science Foundation ofChina (No.50764006)Young Foundation of Kunming University of Science and Tech-nology (No.KKZ200727021)the Applied Fundamental Research Foundation ofYunnan Province (Nos.2007E039M and 2006E0021M).
文摘The mixing enthalpies of 23 binary liquid alloys are calculated by molecular interaction volume model (MIVM), which is a two-parameter model with the partial molar infinite dilute mixing enthalpies. The predicted values are in agreement with the experimental data and then indicate that the model is reliable and convenient.
文摘Twitter sentiment has been shown to be useful in predicting whether Bitcoin’s price will increase or decrease.Yet the state-of-the-art is limited to predicting the price direction and not the magnitude of increase/decrease.In this paper,we seek to build on the state-of-the-art to not only predict the direction yet to also predict the magnitude of increase/decrease.We utilise not only sentiment extracted from tweets,but also the volume of tweets.We present results from experiments exploring the relation between sentiment and future price at different temporal granularities,with the goal of discovering the optimal time interval at which the sentiment expressed becomes a reliable indicator of price change.Two different neural network models are explored and evaluated,one based on recurrent nets and one based on convolutional networks.An additional model is presented to predict the magnitude of change,which is framed as a multi-class classification problem.It is shown that this model yields more reliable predictions when used alongside a price trend prediction model.The main research contribution from this paper is that we demonstrate that not only can price direction prediction be made but the magnitude in price change can be predicted with relative accuracy(63%).
基金National Natural Science Foundation Program of China[No.71971092],[No.71671073]and[71810107003].
文摘With the development of information and communication technologies,all public tertiary hospitals in China began to use online outpatient appointment systems.However,the phenomenon of patient no-shows in online outpatient appointments is becoming more serious.The objective of this study is to design a prediction model for patient no-shows,thereby assisting hospitals in making relevant decisions,and reducing the probability of patient no-show behavior.We used 382,004 original online outpatient appointment records,and divided the data set into a training set(N_(1)=286,503),and a validation set(N_(2)=95,501).We used machine learning algorithms such as logistic regression,k-nearest neighbor(KNN),boosting,decision tree(DT),random forest(RF)and bagging to design prediction models for patient no-show in online outpatient appointments.The patient no-show rate of online outpatient appointment was 11.1%(N=42,224).From the validation set,bagging had the highest area under the ROC curve and AUC value,which was 0.990,followed by random forest and boosting models,which were 0.987 and 0.976,respectively.In contrast,compared with the previous prediction models,the area under ROC and AUC values of the logistic regression,decision tree,and k-nearest neighbors were lower at 0.597,0.499 and 0.843,respectively.This study demonstrates the possibility of using data from multiple sources to predict patient no-shows.The prediction model results can provide decision basis for hospitals to reduce medical resource waste,develop effective outpatient appointment policies,and optimize operations.
文摘The current situation of the railway passenger traffic (RPT) and the traffic marketing is analyzed. The grey model theory is adopted to establish a prediction model for the railway passenger traffic volume (RPTV).The RPTV from 2001 to 2005 is predicted with the proposed model, and a few suggestions are put forward.
基金This work was supported by the National Science Foundation(Grant DMR-9812351)
文摘The structure type for the crystal of 4,4'-bis-(2-hydroxy-ethoxyl)-biphenyl 1 has been predicted by using the previously developed interfacial model for small organic molecules. Based on the calculated hydrophobic to hydrophilic volume of 1, this model predicts the crystal structure to be of lamellar or bicontinuous type, which has been confirmed by the X-ray single-crystal structure analysis (C20H26O6, monoclinic, P21/C, a = 16.084(1), b = 6.0103(4), c = 9.6410(7) A, β9 = 103.014(2)°, V= 908.1(1) A3, Z = 2, Dc= 1.325 g/cm3, F(000)=388,μ = 0.097 mm-1, MoKα radiation, λ = 0.71073 A, R = 0.0382 and wR = 0.0882 with I > 2σ(I) for 7121 reflections collected, 1852 unique reflections and 170 parameters). As predicted, the hydrophobic and hydrophilic portions of 1 form in the lamellae. The same interfacial model is applied to other amphilphilic small molecule organic systems for structural type prediction.
基金the National Key R&D Program of China(No.2021YFB3701705).
文摘This work constructed a machine learning(ML)model to predict the atmospheric corrosion rate of low-alloy steels(LAS).The material properties of LAS,environmental factors,and exposure time were used as the input,while the corrosion rate as the output.6 dif-ferent ML algorithms were used to construct the proposed model.Through optimization and filtering,the eXtreme gradient boosting(XG-Boost)model exhibited good corrosion rate prediction accuracy.The features of material properties were then transformed into atomic and physical features using the proposed property transformation approach,and the dominant descriptors that affected the corrosion rate were filtered using the recursive feature elimination(RFE)as well as XGBoost methods.The established ML models exhibited better predic-tion performance and generalization ability via property transformation descriptors.In addition,the SHapley additive exPlanations(SHAP)method was applied to analyze the relationship between the descriptors and corrosion rate.The results showed that the property transformation model could effectively help with analyzing the corrosion behavior,thereby significantly improving the generalization ability of corrosion rate prediction models.
文摘The scientific community recognizes the seriousness of rockbursts and the need for effective mitigation measures.The literature reports various successful applications of machine learning(ML)models for rockburst assessment;however,a significant question remains unanswered:How reliable are these models,and at what confidence level are classifications made?Typically,ML models output single rockburst grade even in the face of intricate and out-of-distribution samples,without any associated confidence value.Given the susceptibility of ML models to errors,it becomes imperative to quantify their uncertainty to prevent consequential failures.To address this issue,we propose a conformal prediction(CP)framework built on traditional ML models(extreme gradient boosting and random forest)to generate valid classifications of rockburst while producing a measure of confidence for its output.The proposed framework guarantees marginal coverage and,in most cases,conditional coverage on the test dataset.The CP was evaluated on a rockburst case in the Sanshandao Gold Mine in China,where it achieved high coverage and efficiency at applicable confidence levels.Significantly,the CP identified several“confident”classifications from the traditional ML model as unreliable,necessitating expert verification for informed decision-making.The proposed framework improves the reliability and accuracy of rockburst assessments,with the potential to bolster user confidence.
基金supported by the National Key R&D Program of China(Grant No.2017YFC1501604)the National Natural Science Foundation of China(Grant Nos.41875114 and 41875057).
文摘Accurate prediction of tropical cyclone(TC)intensity is challenging due to the complex physical processes involved.Here,we introduce a new TC intensity prediction scheme for the western North Pacific(WNP)based on a time-dependent theory of TC intensification,termed the energetically based dynamical system(EBDS)model,together with the use of a long short-term memory(LSTM)neural network.In time-dependent theory,TC intensity change is controlled by both the internal dynamics of the TC system and various environmental factors,expressed as environmental dynamical efficiency.The LSTM neural network is used to predict the environmental dynamical efficiency in the EBDS model trained using besttrack TC data and global reanalysis data during 1982–2017.The transfer learning and ensemble methods are used to retrain the scheme using the environmental factors predicted by the Global Forecast System(GFS)of the National Centers for Environmental Prediction during 2017–21.The predicted environmental dynamical efficiency is finally iterated into the EBDS equations to predict TC intensity.The new scheme is evaluated for TC intensity prediction using both reanalysis data and the GFS prediction data.The intensity prediction by the new scheme shows better skill than the official prediction from the China Meteorological Administration(CMA)and those by other state-of-art statistical and dynamical forecast systems,except for the 72-h forecast.Particularly at the longer lead times of 96 h and 120 h,the new scheme has smaller forecast errors,with a more than 30%improvement over the official forecasts.
基金supported by the National Natural Science Foundation of China(81825009,82071505,81901358)the Chinese Academy of Medical Sciences Innovation Fund for Medical Sciences(2021-I2MC&T-B-099,2019-I2M-5–006)+2 种基金the Program of Chinese Institute for Brain Research Beijing(2020-NKX-XM-12)the King’s College London-Peking University Health Science Center Joint Institute for Medical Research(BMU2020KCL001,BMU2019LCKXJ012)the National Key R&D Program of China(2021YFF1201103,2016YFC1307000).
文摘Background:Choosing the appropriate antipsychotic drug(APD)treatment for patients with schizophrenia(SCZ)can be challenging,as the treatment response to APD is highly variable and difficult to predict due to the lack of effective biomarkers.Previous studies have indicated the association between treatment response and genetic and epigenetic factors,but no effective biomarkers have been identified.Hence,further research is imperative to enhance precision medicine in SCZ treatment.Methods:Participants with SCZ were recruited from two randomized trials.The discovery cohort was recruited from the CAPOC trial(n=2307)involved 6 weeks of treatment and equally randomized the participants to the Olanzapine,Risperidone,Quetiapine,Aripiprazole,Ziprasidone,and Haloperidol/Perphenazine(subsequently equally assigned to one or the other)groups.The external validation cohort was recruited from the CAPEC trial(n=1379),which involved 8 weeks of treatment and equally randomized the participants to the Olanzapine,Risperidone,and Aripiprazole groups.Additionally,healthy controls(n=275)from the local community were utilized as a genetic/epigenetic reference.The genetic and epigenetic(DNA methylation)risks of SCZ were assessed using the polygenic risk score(PRS)and polymethylation score,respectively.The study also examined the genetic-epigenetic interactions with treatment response through differential methylation analysis,methylation quantitative trait loci,colocalization,and promoteranchored chromatin interaction.Machine learning was used to develop a prediction model for treatment response,which was evaluated for accuracy and clinical benefit using the area under curve(AUC)for classification,R^(2) for regression,and decision curve analysis.Results:Six risk genes for SCZ(LINC01795,DDHD2,SBNO1,KCNG2,SEMA7A,and RUFY1)involved in cortical morphology were identified as having a genetic-epigenetic interaction associated with treatment response.The developed and externally validated prediction model,which incorporated clinical information,PRS,genetic risk score(GRS),and proxy methylation level(proxyDNAm),demonstrated positive benefits for a wide range of patients receiving different APDs,regardless of sex[discovery cohort:AUC=0.874(95%CI 0.867-0.881),R^(2)=0.478;external validation cohort:AUC=0.851(95%CI 0.841-0.861),R^(2)=0.507].Conclusions:This study presents a promising precision medicine approach to evaluate treatment response,which has the potential to aid clinicians in making informed decisions about APD treatment for patients with SCZ.Trial registration Chinese Clinical Trial Registry(https://www.chictr.org.cn/),18 Aug 2009 retrospectively registered:CAPOC-ChiCTR-RNC-09000521(https://www.chictr.org.cn/showproj.aspx?proj=9014),CAPEC-ChiCTRRNC-09000522(https://www.chictr.org.cn/showproj.aspx?proj=9013).
基金supported by the National Natural Science Foundation of China(Grant No.42004030)Basic Scientific Fund for National Public Research Institutes of China(Grant No.2022S03)+1 种基金Science and Technology Innovation Project(LSKJ202205102)funded by Laoshan Laboratory,and the National Key Research and Development Program of China(2020YFB0505805).
文摘The scarcity of in-situ ocean observations poses a challenge for real-time information acquisition in the ocean.Among the crucial hydroacoustic environmental parameters,ocean sound velocity exhibits significant spatial and temporal variability and it is highly relevant to oceanic research.In this study,we propose a new data-driven approach,leveraging deep learning techniques,for the prediction of sound velocity fields(SVFs).Our novel spatiotemporal prediction model,STLSTM-SA,combines Spatiotemporal Long Short-Term Memory(ST-LSTM) with a self-attention mechanism to enable accurate and real-time prediction of SVFs.To circumvent the limited amount of observational data,we employ transfer learning by first training the model using reanalysis datasets,followed by fine-tuning it using in-situ analysis data to obtain the final prediction model.By utilizing the historical 12-month SVFs as input,our model predicts the SVFs for the subsequent three months.We compare the performance of five models:Artificial Neural Networks(ANN),Long ShortTerm Memory(LSTM),Convolutional LSTM(ConvLSTM),ST-LSTM,and our proposed ST-LSTM-SA model in a test experiment spanning 2019 to 2022.Our results demonstrate that the ST-LSTM-SA model significantly improves the prediction accuracy and stability of sound velocity in both temporal and spatial dimensions.The ST-LSTM-SA model not only accurately predicts the ocean sound velocity field(SVF),but also provides valuable insights for spatiotemporal prediction of other oceanic environmental variables.
基金This work is funded by the National Natural Science Foundation of China(Grant Nos.42377164 and 52079062)the National Science Fund for Distinguished Young Scholars of China(Grant No.52222905).
文摘In the existing landslide susceptibility prediction(LSP)models,the influences of random errors in landslide conditioning factors on LSP are not considered,instead the original conditioning factors are directly taken as the model inputs,which brings uncertainties to LSP results.This study aims to reveal the influence rules of the different proportional random errors in conditioning factors on the LSP un-certainties,and further explore a method which can effectively reduce the random errors in conditioning factors.The original conditioning factors are firstly used to construct original factors-based LSP models,and then different random errors of 5%,10%,15% and 20%are added to these original factors for con-structing relevant errors-based LSP models.Secondly,low-pass filter-based LSP models are constructed by eliminating the random errors using low-pass filter method.Thirdly,the Ruijin County of China with 370 landslides and 16 conditioning factors are used as study case.Three typical machine learning models,i.e.multilayer perceptron(MLP),support vector machine(SVM)and random forest(RF),are selected as LSP models.Finally,the LSP uncertainties are discussed and results show that:(1)The low-pass filter can effectively reduce the random errors in conditioning factors to decrease the LSP uncertainties.(2)With the proportions of random errors increasing from 5%to 20%,the LSP uncertainty increases continuously.(3)The original factors-based models are feasible for LSP in the absence of more accurate conditioning factors.(4)The influence degrees of two uncertainty issues,machine learning models and different proportions of random errors,on the LSP modeling are large and basically the same.(5)The Shapley values effectively explain the internal mechanism of machine learning model predicting landslide sus-ceptibility.In conclusion,greater proportion of random errors in conditioning factors results in higher LSP uncertainty,and low-pass filter can effectively reduce these random errors.
基金supported by the National Natural Science Foundation of China(Grant Nos.41976193 and 42176243).
文摘In recent years,deep learning methods have gradually been applied to prediction tasks related to Arctic sea ice concentration,but relatively little research has been conducted for larger spatial and temporal scales,mainly due to the limited time coverage of observations and reanalysis data.Meanwhile,deep learning predictions of sea ice thickness(SIT)have yet to receive ample attention.In this study,two data-driven deep learning(DL)models are built based on the ConvLSTM and fully convolutional U-net(FC-Unet)algorithms and trained using CMIP6 historical simulations for transfer learning and fine-tuned using reanalysis/observations.These models enable monthly predictions of Arctic SIT without considering the complex physical processes involved.Through comprehensive assessments of prediction skills by season and region,the results suggest that using a broader set of CMIP6 data for transfer learning,as well as incorporating multiple climate variables as predictors,contribute to better prediction results,although both DL models can effectively predict the spatiotemporal features of SIT anomalies.Regarding the predicted SIT anomalies of the FC-Unet model,the spatial correlations with reanalysis reach an average level of 89%over all months,while the temporal anomaly correlation coefficients are close to unity in most cases.The models also demonstrate robust performances in predicting SIT and SIE during extreme events.The effectiveness and reliability of the proposed deep transfer learning models in predicting Arctic SIT can facilitate more accurate pan-Arctic predictions,aiding climate change research and real-time business applications.
基金the National Natural Science Foundation of China(Grant Nos.42377164 and 52079062)the Interdisciplinary Innovation Fund of Natural Science,Nanchang University(Grant No.9167-28220007-YB2107).
文摘The accuracy of landslide susceptibility prediction(LSP)mainly depends on the precision of the landslide spatial position.However,the spatial position error of landslide survey is inevitable,resulting in considerable uncertainties in LSP modeling.To overcome this drawback,this study explores the influence of positional errors of landslide spatial position on LSP uncertainties,and then innovatively proposes a semi-supervised machine learning model to reduce the landslide spatial position error.This paper collected 16 environmental factors and 337 landslides with accurate spatial positions taking Shangyou County of China as an example.The 30e110 m error-based multilayer perceptron(MLP)and random forest(RF)models for LSP are established by randomly offsetting the original landslide by 30,50,70,90 and 110 m.The LSP uncertainties are analyzed by the LSP accuracy and distribution characteristics.Finally,a semi-supervised model is proposed to relieve the LSP uncertainties.Results show that:(1)The LSP accuracies of error-based RF/MLP models decrease with the increase of landslide position errors,and are lower than those of original data-based models;(2)70 m error-based models can still reflect the overall distribution characteristics of landslide susceptibility indices,thus original landslides with certain position errors are acceptable for LSP;(3)Semi-supervised machine learning model can efficiently reduce the landslide position errors and thus improve the LSP accuracies.