Using series of daily average temperature observations over the period of 1961-1999 of 701 meteorological stations in China, and simulated results of 20 global climate models (such as BCCR_BCM2.0, CGCM3T47) during t...Using series of daily average temperature observations over the period of 1961-1999 of 701 meteorological stations in China, and simulated results of 20 global climate models (such as BCCR_BCM2.0, CGCM3T47) during the same period as the observation, we validate and analyze the simulated results of the models by using three factor statistical method, achieve the results of mul- ti-model ensemble, test and verify the results of multi-model ensemble by using the observation data during the period of 1991-1999. Finally, we analyze changes of the annual mean temperature result of multi-mode ensemble prediction for the period of 2011-2040 under the emission scenarios A2, A1B and B 1. Analyzed results show that: (1) Global climate models can repro- duce Chinese regional spatial distribution of annual mean temperature, especially in low latitudes and eastern China. (2) With the factor of the trend of annual mean temperature changes in reference period, there is an obvious bias between the model and the observation. (3) Testing the result of multi-model ensemble during the period of 1991-1999, we can simulate the trend of temper- ature increase. Compared to observation, the result of different weighing multi-model ensemble prediction is better than the same weighing ensemble. (4) For the period of 20ll-2040, the growth of the annual mean temperature in China, which results from multi-mode ensemble prediction, is above 1℃. In the spatial distribution of annual mean temperature, under the emission scenarios of A2, A1B and B 1, the trend of growth in South China region is the smallest, the increment is less than or equals to 0.8℃; the trends in the northwestern region and south of the Qinghai-Tibet Plateau are the largest, the increment is more than 1℃.展开更多
The 21-yr ensemble predictions of model precipitation and circulation in the East Asian and western North Pacific (Asia-Pacific) summer monsoon region (0°-50°N, 100° 150°E) were evaluated in ni...The 21-yr ensemble predictions of model precipitation and circulation in the East Asian and western North Pacific (Asia-Pacific) summer monsoon region (0°-50°N, 100° 150°E) were evaluated in nine different AGCM, used in the Asia-Pacific Economic Cooperation Climate Center (APCC) multi-model ensemble seasonal prediction system. The analysis indicates that the precipitation anomaly patterns of model ensemble predictions are substantially different from the observed counterparts in this region, but the summer monsoon circulations are reasonably predicted. For example, all models can well produce the interannual variability of the western North Pacific monsoon index (WNPMI) defined by 850 hPa winds, but they failed to predict the relationship between WNPMI and precipitation anomalies. The interannual variability of the 500 hPa geopotential height (GPH) can be well predicted by the models in contrast to precipitation anomalies. On the basis of such model performances and the relationship between the interannual variations of 500 hPa GPH and precipitation anomalies, we developed a statistical scheme used to downscale the summer monsoon precipitation anomaly on the basis of EOF and singular value decomposition (SVD). In this scheme, the three leading EOF modes of 500 hPa GPH anomaly fields predicted by the models are firstly corrected by the linear regression between the principal components in each model and observation, respectively. Then, the corrected model GPH is chosen as the predictor to downscale the precipitation anomaly field, which is assembled by the forecasted expansion coefficients of model 500 hPa GPH and the three leading SVD modes of observed precipitation anomaly corresponding to the prediction of model 500 hPa GPH during a 19-year training period. The cross-validated forecasts suggest that this downscaling scheme may have a potential to improve the forecast skill of the precipitation anomaly in the South China Sea, western North Pacific and the East Asia Pacific regions, where the anomaly correlation coefficient (ACC) has been improved by 0.14, corresponding to the reduced RMSE of 10.4% in the conventional multi-model ensemble (MME) forecast.展开更多
Seasonal prediction of summer rainfall over the Yangtze River valley(YRV) is valuable for agricultural and industrial production and freshwater resource management in China, but remains a major challenge. Earlier mu...Seasonal prediction of summer rainfall over the Yangtze River valley(YRV) is valuable for agricultural and industrial production and freshwater resource management in China, but remains a major challenge. Earlier multi-model ensemble(MME) prediction schemes for summer rainfall over China focus on single-value prediction, which cannot provide the necessary uncertainty information, while commonly-used ensemble schemes for probability density function(PDF) prediction are not adapted to YRV summer rainfall prediction. In the present study, an MME PDF prediction scheme is proposed based on the ENSEMBLES hindcasts. It is similar to the earlier Bayesian ensemble prediction scheme, but with optimization of ensemble members and a revision of the variance modeling of the likelihood function. The optimized ensemble members are regressed YRV summer rainfall with factors selected from model outputs of synchronous 500-h Pa geopotential height as predictors. The revised variance modeling of the likelihood function is a simple linear regression with ensemble spread as the predictor. The cross-validation skill of 1960–2002 YRV summer rainfall prediction shows that the new scheme produces a skillful PDF prediction, and is much better-calibrated, sharper, and more accurate than the earlier Bayesian ensemble and raw ensemble.展开更多
A Bayesian probabilistic prediction scheme of the Yangtze River Valley (YRV) summer rainfall is proposed to combine forecast information from multi-model ensemble dataset provided by ENSEMBLES project.Due to the low f...A Bayesian probabilistic prediction scheme of the Yangtze River Valley (YRV) summer rainfall is proposed to combine forecast information from multi-model ensemble dataset provided by ENSEMBLES project.Due to the low forecast skill of rainfall in dynamic models,the time series of regressed YRV summer rainfall are selected as ensemble members in the new scheme,instead of commonly-used YRV summer rainfall simulated by models.Each time series of regressed YRV summer rainfall is derived from a simple linear regression.The predictor in each simple linear regression is the skillfully simulated circulation or surface temperature factor which is highly linear with the observed YRV summer rainfall in the training set.The high correlation between the ensemble mean of these regressed YRV summer rainfall and observation benefit extracting more sample information from the ensemble system.The results show that the cross-validated skill of the new scheme over the period of 1960 to 2002 is much higher than equally-weighted ensemble,multiple linear regression,and Bayesian ensemble with simulated YRV summer rainfall as ensemble members.In addition,the new scheme is also more skillful than reference forecasts (random forecast at a 0.01 significance level for ensemble mean and climatology forecast for probability density function).展开更多
Dissolved oxygen(DO)is an important indicator of aquaculture,and its accurate forecasting can effectively improve the quality of aquatic products.In this paper,a new DO hybrid forecasting model is proposed that includ...Dissolved oxygen(DO)is an important indicator of aquaculture,and its accurate forecasting can effectively improve the quality of aquatic products.In this paper,a new DO hybrid forecasting model is proposed that includes three stages:multi-factor analysis,adaptive decomposition,and an optimizationbased ensemble.First,considering the complex factors affecting DO,the grey relational(GR)degree method is used to screen out the environmental factors most closely related to DO.The consideration of multiple factors makes model fusion more effective.Second,the series of DO,water temperature,salinity,and oxygen saturation are decomposed adaptively into sub-series by means of the empirical wavelet transform(EWT)method.Then,five benchmark models are utilized to forecast the sub-series of EWT decomposition.The ensemble weights of these five sub-forecasting models are calculated by particle swarm optimization and gravitational search algorithm(PSOGSA).Finally,a multi-factor ensemble model for DO is obtained by weighted allocation.The performance of the proposed model is verified by timeseries data collected by the pacific islands ocean observing system(PacIOOS)from the WQB04 station at Hilo.The evaluation indicators involved in the experiment include the Nash–Sutcliffe efficiency(NSE),Kling–Gupta efficiency(KGE),mean absolute percent error(MAPE),standard deviation of error(SDE),and coefficient of determination(R^(2)).Example analysis demonstrates that:①The proposed model can obtain excellent DO forecasting results;②the proposed model is superior to other comparison models;and③the forecasting model can be used to analyze the trend of DO and enable managers to make better management decisions.展开更多
This study investigates multi-model ensemble forecasts of track and intensity of tropical cyclones over the western Pacific, based on forecast outputs from the China Meteorological Administration, European Centre for ...This study investigates multi-model ensemble forecasts of track and intensity of tropical cyclones over the western Pacific, based on forecast outputs from the China Meteorological Administration, European Centre for Medium-Range Weather Forecasts, Japan Meteorological Agency and National Centers for Environmental Prediction in the THORPEX Interactive Grand Global Ensemble(TIGGE) datasets. The multi-model ensemble schemes, namely the bias-removed ensemble mean(BREM) and superensemble(SUP), are compared with the ensemble mean(EMN) and single-model forecasts. Moreover, a new model bias estimation scheme is investigated and applied to the BREM and SUP schemes. The results showed that, compared with single-model forecasts and EMN, the multi-model ensembles of the BREM and SUP schemes can have smaller errors in most cases. However, there were also circumstances where BREM was less skillful than EMN, indicating that using a time-averaged error as model bias is not optimal. A new model bias estimation scheme of the biweight mean is introduced. Through minimizing the negative influence of singular errors, this scheme can obtain a more accurate model bias estimation and improve the BREM forecast skill. The application of the biweight mean in the bias calculation of SUP also resulted in improved skill. The results indicate that the modification of multi-model ensemble schemes through this bias estimation method is feasible.展开更多
Climate change adaptation and relevant policy-making need reliable projections of future climate.Methods based on multi-model ensemble are generally considered as the most efficient way to achieve the goal.However,the...Climate change adaptation and relevant policy-making need reliable projections of future climate.Methods based on multi-model ensemble are generally considered as the most efficient way to achieve the goal.However,their efficiency varies and inter-comparison is a challenging task,as they use a variety of target variables,geographic regions,time periods,or model pools.Here,we construct and use a consistent framework to evaluate the performance of five ensemble-processing methods,i.e.,multimodel ensemble mean(MME),rank-based weighting(RANK),reliability ensemble averaging(REA),climate model weighting by independence and performance(ClimWIP),and Bayesian model averaging(BMA).We investigate the annual mean temperature(Tav)and total precipitation(Prcptot)changes(relative to 1995–2014)over China and its seven subregions at 1.5 and 2℃warming levels(relative to pre-industrial).All ensemble-processing methods perform better than MME,and achieve generally consistent results in terms of median values.But they show different results in terms of inter-model spread,served as a measure of uncertainty,and signal-to-noise ratio(SNR).ClimWIP is the most optimal method with its good performance in simulating current climate and in providing credible future projections.The uncertainty,measured by the range of 10th–90th percentiles,is reduced by about 30%for Tav,and 15%for Prcptot in China,with a certain variation among subregions.Based on ClimWIP,and averaged over whole China under 1.5/2℃global warming levels,Tav increases by about 1.1/1.8℃(relative to 1995–2014),while Prcptot increases by about 5.4%/11.2%,respectively.Reliability of projections is found dependent on investigated regions and indices.The projection for Tav is credible across all regions,as its SNR is generally larger than 2,while the SNR is lower than 1 for Prcptot over most regions under 1.5℃warming.The largest warming is found in northeastern China,with increase of 1.3(0.6–1.7)/2.0(1.4–2.6)℃(ensemble’s median and range of the 10th–90th percentiles)under 1.5/2℃warming,followed by northern and northwestern China.The smallest but the most robust warming is in southwestern China,with values exceeding 0.9(0.6–1.1)/1.5(1.1–1.7)℃.The most robust projection and largest increase is achieved in northwestern China for Prcptot,with increase of 9.1%(–1.6–24.7%)/17.9%(0.5–36.4%)under 1.5/2℃warming.Followed by northern China,where the increase is 6.0%(–2.6–17.8%)/11.8%(2.4–25.1%),respectively.The precipitation projection is of large uncertainty in southwestern China,even with uncertain sign of variation.For the additional half-degree warming,Tav increases more than 0.5℃throughout China.Almost all regions witness an increase of Prcptot,with the largest increase in northwestern China.展开更多
Central Asia(CA)is highly sensitive and vulnerable to changes in precipitation due to global warming,so the projection of precipitation extremes is essential for local climate risk assessment.However,global and region...Central Asia(CA)is highly sensitive and vulnerable to changes in precipitation due to global warming,so the projection of precipitation extremes is essential for local climate risk assessment.However,global and regional climate models often fail to reproduce the observed daily precipitation distribution and hence extremes,especially in areas with complex terrain.In this study,we proposed a statistical downscaling(SD)model based on quantile delta mapping to assess and project eight precipitation indices at 73 meteorological stations across CA driven by ERA5 reanalysis data and simulations of 10 global climate models(GCMs)for present and future(2081-2100)periods under two shared socioeconomic pathways(SSP245 and SSP585).The reanalysis data and raw GCM outputs clearly underestimate mean precipitation intensity(SDII)and maximum 1-day precipitation(RX1DAY)and overestimate the number of wet days(R1MM)and maximum consecutive wet days(CWD)at stations across CA.However,the SD model effectively reduces the biases and RMSEs of the modeled precipitation indices compared to the observations.Also it effectively adjusts the distributional biases in the downscaled daily precipitation and indices at the stations across CA.In addition,it is skilled in capturing the spatial patterns of the observed precipitation indices.Obviously,SDII and RX1DAY are improved by the SD model,especially in the southeastern mountainous area.Under the intermediate scenario(SSP245),our SD multi-model ensemble projections project significant and robust increases in SDII and total extreme precipitation(R95PTOT)of 0.5 mm d^(-1) and 19.7 mm,respectively,over CA at the end of the 21st century(2081-2100)compared to the present values(1995-2014).More pronounced increases in indices R95PTOT,SDII,number of very wet days(R10MM),and RX1DAY are projected under the higher emission scenario(SSP585),particularly in the mountainous southeastern region.The SD model suggested that SDII and RX1DAY will likely rise more rapidly than those projected by previous model simulations over CA during the period 2081-2100.The SD projection of the possible future changes in precipitation and extremes improves the knowledge base for local risk management and climate change adaptation in CA.展开更多
Dear Editor,This letter presents a novel process monitoring model based on ensemble structure analysis(ESA).The ESA model takes advantage of principal component analysis(PCA),locality preserving projections(LPP),and m...Dear Editor,This letter presents a novel process monitoring model based on ensemble structure analysis(ESA).The ESA model takes advantage of principal component analysis(PCA),locality preserving projections(LPP),and multi-manifold projections(MMP)models,and then combines the multiple solutions within an ensemble result through Bayesian inference.In the developed ESA model,different structure features of the given dataset are taken into account simultaneously,the suitability and reliability of the ESA-based monitoring model are then illustrated through comparison.Introduction:The requirement for ensuring safe operation and improving process efficiency has led to increased research activity in the field of process monitoring.展开更多
Identifying rare patterns for medical diagnosis is a challenging task due to heterogeneity and the volume of data.Data summarization can create a concise version of the original data that can be used for effective dia...Identifying rare patterns for medical diagnosis is a challenging task due to heterogeneity and the volume of data.Data summarization can create a concise version of the original data that can be used for effective diagnosis.In this paper,we propose an ensemble summarization method that combines clustering and sampling to create a summary of the original data to ensure the inclusion of rare patterns.To the best of our knowledge,there has been no such technique available to augment the performance of anomaly detection techniques and simultaneously increase the efficiency of medical diagnosis.The performance of popular anomaly detection algorithms increases significantly in terms of accuracy and computational complexity when the summaries are used.Therefore,the medical diagnosis becomes more effective,and our experimental results reflect that the combination of the proposed summarization scheme and all underlying algorithms used in this paper outperforms the most popular anomaly detection techniques.展开更多
We present a formulation of the single-trajectory entropy using the trajectories ensemble. The single-trajectory entropy is affected by its surrounding trajectories via the distribution function. The single-trajectory...We present a formulation of the single-trajectory entropy using the trajectories ensemble. The single-trajectory entropy is affected by its surrounding trajectories via the distribution function. The single-trajectory entropies are studied in two typical potentials, i.e., harmonic potential and double-well potential, and in viscous environment by interacting trajectory method. The results of the trajectory methods are in agreement well with the numerical methods(Monte Carlo simulation and difference equation). The single-trajectory entropies increasing(decreasing) could be caused by absorption(emission) heat from(to) the thermal environment. Also, some interesting trajectories, which correspond to the rare evens in the processes, are demonstrated.展开更多
A redundant-subspace-weighting(RSW)-based approach is proposed to enhance the frequency stability on a time scale of a clock ensemble.In this method,multiple overlapping subspaces are constructed in the clock ensemble...A redundant-subspace-weighting(RSW)-based approach is proposed to enhance the frequency stability on a time scale of a clock ensemble.In this method,multiple overlapping subspaces are constructed in the clock ensemble,and the weight of each clock in this ensemble is defined by using the spatial covariance matrix.The superimposition average of covariances in different subspaces reduces the correlations between clocks in the same laboratory to some extent.After optimizing the parameters of this weighting procedure,the frequency stabilities of virtual clock ensembles are significantly improved in most cases.展开更多
Data security assurance is crucial due to the increasing prevalence of cloud computing and its widespread use across different industries,especially in light of the growing number of cybersecurity threats.A major and ...Data security assurance is crucial due to the increasing prevalence of cloud computing and its widespread use across different industries,especially in light of the growing number of cybersecurity threats.A major and everpresent threat is Ransomware-as-a-Service(RaaS)assaults,which enable even individuals with minimal technical knowledge to conduct ransomware operations.This study provides a new approach for RaaS attack detection which uses an ensemble of deep learning models.For this purpose,the network intrusion detection dataset“UNSWNB15”from the Intelligent Security Group of the University of New South Wales,Australia is analyzed.In the initial phase,the rectified linear unit-,scaled exponential linear unit-,and exponential linear unit-based three separate Multi-Layer Perceptron(MLP)models are developed.Later,using the combined predictive power of these three MLPs,the RansoDetect Fusion ensemble model is introduced in the suggested methodology.The proposed ensemble technique outperforms previous studieswith impressive performance metrics results,including 98.79%accuracy and recall,98.85%precision,and 98.80%F1-score.The empirical results of this study validate the ensemble model’s ability to improve cybersecurity defenses by showing that it outperforms individual MLPmodels.In expanding the field of cybersecurity strategy,this research highlights the significance of combined deep learning models in strengthening intrusion detection systems against sophisticated cyber threats.展开更多
With the widespread use of machine learning(ML)technology,the operational efficiency and responsiveness of power grids have been significantly enhanced,allowing smart grids to achieve high levels of automation and int...With the widespread use of machine learning(ML)technology,the operational efficiency and responsiveness of power grids have been significantly enhanced,allowing smart grids to achieve high levels of automation and intelligence.However,tree ensemble models commonly used in smart grids are vulnerable to adversarial attacks,making it urgent to enhance their robustness.To address this,we propose a robustness enhancement method that incorporates physical constraints into the node-splitting decisions of tree ensembles.Our algorithm improves robustness by developing a dataset of adversarial examples that comply with physical laws,ensuring training data accurately reflects possible attack scenarios while adhering to physical rules.In our experiments,the proposed method increased robustness against adversarial attacks by 100%when applied to real grid data under physical constraints.These results highlight the advantages of our method in maintaining efficient and secure operation of smart grids under adversarial conditions.展开更多
Ensemble prediction is widely used to represent the uncertainty of single deterministic Numerical Weather Prediction(NWP) caused by errors in initial conditions(ICs). The traditional Singular Vector(SV) initial pertur...Ensemble prediction is widely used to represent the uncertainty of single deterministic Numerical Weather Prediction(NWP) caused by errors in initial conditions(ICs). The traditional Singular Vector(SV) initial perturbation method tends only to capture synoptic scale initial uncertainty rather than mesoscale uncertainty in global ensemble prediction. To address this issue, a multiscale SV initial perturbation method based on the China Meteorological Administration Global Ensemble Prediction System(CMA-GEPS) is proposed to quantify multiscale initial uncertainty. The multiscale SV initial perturbation approach entails calculating multiscale SVs at different resolutions with multiple linearized physical processes to capture fast-growing perturbations from mesoscale to synoptic scale in target areas and combining these SVs by using a Gaussian sampling method with amplitude coefficients to generate initial perturbations. Following that, the energy norm,energy spectrum, and structure of multiscale SVs and their impact on GEPS are analyzed based on a batch experiment in different seasons. The results show that the multiscale SV initial perturbations can possess more energy and capture more mesoscale uncertainties than the traditional single-SV method. Meanwhile, multiscale SV initial perturbations can reflect the strongest dynamical instability in target areas. Their performances in global ensemble prediction when compared to single-scale SVs are shown to(i) improve the relationship between the ensemble spread and the root-mean-square error and(ii) provide a better probability forecast skill for atmospheric circulation during the late forecast period and for short-to medium-range precipitation. This study provides scientific evidence and application foundations for the design and development of a multiscale SV initial perturbation method for the GEPS.展开更多
The software development process mostly depends on accurately identifying both essential and optional features.Initially,user needs are typically expressed in free-form language,requiring significant time and human re...The software development process mostly depends on accurately identifying both essential and optional features.Initially,user needs are typically expressed in free-form language,requiring significant time and human resources to translate these into clear functional and non-functional requirements.To address this challenge,various machine learning(ML)methods have been explored to automate the understanding of these requirements,aiming to reduce time and human effort.However,existing techniques often struggle with complex instructions and large-scale projects.In our study,we introduce an innovative approach known as the Functional and Non-functional Requirements Classifier(FNRC).By combining the traditional random forest algorithm with the Accuracy Sliding Window(ASW)technique,we develop optimal sub-ensembles that surpass the initial classifier’s accuracy while using fewer trees.Experimental results demonstrate that our FNRC methodology performs robustly across different datasets,achieving a balanced Precision of 75%on the PROMISE dataset and an impressive Recall of 85%on the CCHIT dataset.Both datasets consistently maintain an F-measure around 64%,highlighting FNRC’s ability to effectively balance precision and recall in diverse scenarios.These findings contribute to more accurate and efficient software development processes,increasing the probability of achieving successful project outcomes.展开更多
Geotechnical engineering data are usually small-sample and high-dimensional,which brings a lot of challenges in predictive modeling.This paper uses a typical high-dimensional and small-sample swell pressure(P_(s))data...Geotechnical engineering data are usually small-sample and high-dimensional,which brings a lot of challenges in predictive modeling.This paper uses a typical high-dimensional and small-sample swell pressure(P_(s))dataset to explore the possibility of using multi-algorithm hybrid ensemble and dimensionality reduction methods to mitigate the uncertainty of soil parameter prediction.Based on six machine learning(ML)algorithms,the base learner pool is constructed,and four ensemble methods,Stacking(SG),Blending(BG),Voting regression(VR),and Feature weight linear stacking(FWL),are used for the multi-algorithm ensemble.Furthermore,the importance of permutation is used for feature dimensionality reduction to mitigate the impact of weakly correlated variables on predictive modeling.The results show that the proposed methods are superior to traditional prediction models and base ML models,where FWL is more suitable for modeling with small-sample datasets,and dimensionality reduction can simplify the data structure and reduce the adverse impact of the small-sample effect,which points the way to feature selection for predictive modeling.Based on the ensemble methods,the feature importance of the five primary factors affecting P_(s) is the maximum dry density(31.145%),clay fraction(15.876%),swell percent(15.289%),plasticity index(14%),and optimum moisture content(13.69%),the influence of input parameters on P_(s) is also investigated,in line with the findings of the existing literature.展开更多
基金supported by Adapting Climate Change in China (ACCC) Project:Climate Science (Project No.ACCC/003)
文摘Using series of daily average temperature observations over the period of 1961-1999 of 701 meteorological stations in China, and simulated results of 20 global climate models (such as BCCR_BCM2.0, CGCM3T47) during the same period as the observation, we validate and analyze the simulated results of the models by using three factor statistical method, achieve the results of mul- ti-model ensemble, test and verify the results of multi-model ensemble by using the observation data during the period of 1991-1999. Finally, we analyze changes of the annual mean temperature result of multi-mode ensemble prediction for the period of 2011-2040 under the emission scenarios A2, A1B and B 1. Analyzed results show that: (1) Global climate models can repro- duce Chinese regional spatial distribution of annual mean temperature, especially in low latitudes and eastern China. (2) With the factor of the trend of annual mean temperature changes in reference period, there is an obvious bias between the model and the observation. (3) Testing the result of multi-model ensemble during the period of 1991-1999, we can simulate the trend of temper- ature increase. Compared to observation, the result of different weighing multi-model ensemble prediction is better than the same weighing ensemble. (4) For the period of 20ll-2040, the growth of the annual mean temperature in China, which results from multi-mode ensemble prediction, is above 1℃. In the spatial distribution of annual mean temperature, under the emission scenarios of A2, A1B and B 1, the trend of growth in South China region is the smallest, the increment is less than or equals to 0.8℃; the trends in the northwestern region and south of the Qinghai-Tibet Plateau are the largest, the increment is more than 1℃.
基金The National Nat-ural Science Foundation of China (NSFC), Grant Nos.90711003, 40375014the program of GYHY200706005, and the APCC Visiting Scientist Program jointly supportedthis work.
文摘The 21-yr ensemble predictions of model precipitation and circulation in the East Asian and western North Pacific (Asia-Pacific) summer monsoon region (0°-50°N, 100° 150°E) were evaluated in nine different AGCM, used in the Asia-Pacific Economic Cooperation Climate Center (APCC) multi-model ensemble seasonal prediction system. The analysis indicates that the precipitation anomaly patterns of model ensemble predictions are substantially different from the observed counterparts in this region, but the summer monsoon circulations are reasonably predicted. For example, all models can well produce the interannual variability of the western North Pacific monsoon index (WNPMI) defined by 850 hPa winds, but they failed to predict the relationship between WNPMI and precipitation anomalies. The interannual variability of the 500 hPa geopotential height (GPH) can be well predicted by the models in contrast to precipitation anomalies. On the basis of such model performances and the relationship between the interannual variations of 500 hPa GPH and precipitation anomalies, we developed a statistical scheme used to downscale the summer monsoon precipitation anomaly on the basis of EOF and singular value decomposition (SVD). In this scheme, the three leading EOF modes of 500 hPa GPH anomaly fields predicted by the models are firstly corrected by the linear regression between the principal components in each model and observation, respectively. Then, the corrected model GPH is chosen as the predictor to downscale the precipitation anomaly field, which is assembled by the forecasted expansion coefficients of model 500 hPa GPH and the three leading SVD modes of observed precipitation anomaly corresponding to the prediction of model 500 hPa GPH during a 19-year training period. The cross-validated forecasts suggest that this downscaling scheme may have a potential to improve the forecast skill of the precipitation anomaly in the South China Sea, western North Pacific and the East Asia Pacific regions, where the anomaly correlation coefficient (ACC) has been improved by 0.14, corresponding to the reduced RMSE of 10.4% in the conventional multi-model ensemble (MME) forecast.
基金co-supported by the National Natural Science Foundation (Grant Nos. 41005052 and 41375086)the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant No. XDA05110201)the National Basic Research Program of China (Grant No. 2010CB950403)
文摘Seasonal prediction of summer rainfall over the Yangtze River valley(YRV) is valuable for agricultural and industrial production and freshwater resource management in China, but remains a major challenge. Earlier multi-model ensemble(MME) prediction schemes for summer rainfall over China focus on single-value prediction, which cannot provide the necessary uncertainty information, while commonly-used ensemble schemes for probability density function(PDF) prediction are not adapted to YRV summer rainfall prediction. In the present study, an MME PDF prediction scheme is proposed based on the ENSEMBLES hindcasts. It is similar to the earlier Bayesian ensemble prediction scheme, but with optimization of ensemble members and a revision of the variance modeling of the likelihood function. The optimized ensemble members are regressed YRV summer rainfall with factors selected from model outputs of synchronous 500-h Pa geopotential height as predictors. The revised variance modeling of the likelihood function is a simple linear regression with ensemble spread as the predictor. The cross-validation skill of 1960–2002 YRV summer rainfall prediction shows that the new scheme produces a skillful PDF prediction, and is much better-calibrated, sharper, and more accurate than the earlier Bayesian ensemble and raw ensemble.
基金supported by the Knowledge Innovation Key Project of Chinese Academy of Sciences (CAS) under Grant No.KZCX2-YW-217Doctor Research Startup Project at the Institute of Atmospheric Physics,the CAS under Grant No.7-098300
文摘A Bayesian probabilistic prediction scheme of the Yangtze River Valley (YRV) summer rainfall is proposed to combine forecast information from multi-model ensemble dataset provided by ENSEMBLES project.Due to the low forecast skill of rainfall in dynamic models,the time series of regressed YRV summer rainfall are selected as ensemble members in the new scheme,instead of commonly-used YRV summer rainfall simulated by models.Each time series of regressed YRV summer rainfall is derived from a simple linear regression.The predictor in each simple linear regression is the skillfully simulated circulation or surface temperature factor which is highly linear with the observed YRV summer rainfall in the training set.The high correlation between the ensemble mean of these regressed YRV summer rainfall and observation benefit extracting more sample information from the ensemble system.The results show that the cross-validated skill of the new scheme over the period of 1960 to 2002 is much higher than equally-weighted ensemble,multiple linear regression,and Bayesian ensemble with simulated YRV summer rainfall as ensemble members.In addition,the new scheme is also more skillful than reference forecasts (random forecast at a 0.01 significance level for ensemble mean and climatology forecast for probability density function).
基金the National Natural Science Foundation of China(61873283)the Changsha Science&Technology Project(KQ1707017)the innovation-driven project of the Central South University(2019CX005).
文摘Dissolved oxygen(DO)is an important indicator of aquaculture,and its accurate forecasting can effectively improve the quality of aquatic products.In this paper,a new DO hybrid forecasting model is proposed that includes three stages:multi-factor analysis,adaptive decomposition,and an optimizationbased ensemble.First,considering the complex factors affecting DO,the grey relational(GR)degree method is used to screen out the environmental factors most closely related to DO.The consideration of multiple factors makes model fusion more effective.Second,the series of DO,water temperature,salinity,and oxygen saturation are decomposed adaptively into sub-series by means of the empirical wavelet transform(EWT)method.Then,five benchmark models are utilized to forecast the sub-series of EWT decomposition.The ensemble weights of these five sub-forecasting models are calculated by particle swarm optimization and gravitational search algorithm(PSOGSA).Finally,a multi-factor ensemble model for DO is obtained by weighted allocation.The performance of the proposed model is verified by timeseries data collected by the pacific islands ocean observing system(PacIOOS)from the WQB04 station at Hilo.The evaluation indicators involved in the experiment include the Nash–Sutcliffe efficiency(NSE),Kling–Gupta efficiency(KGE),mean absolute percent error(MAPE),standard deviation of error(SDE),and coefficient of determination(R^(2)).Example analysis demonstrates that:①The proposed model can obtain excellent DO forecasting results;②the proposed model is superior to other comparison models;and③the forecasting model can be used to analyze the trend of DO and enable managers to make better management decisions.
基金Special Research Program for Public Welfare(Meteorology)of China(GYHY200906009,GYHY201006015,GYHY200906007)National Natural Science Foundation of China(4107503541475044)
文摘This study investigates multi-model ensemble forecasts of track and intensity of tropical cyclones over the western Pacific, based on forecast outputs from the China Meteorological Administration, European Centre for Medium-Range Weather Forecasts, Japan Meteorological Agency and National Centers for Environmental Prediction in the THORPEX Interactive Grand Global Ensemble(TIGGE) datasets. The multi-model ensemble schemes, namely the bias-removed ensemble mean(BREM) and superensemble(SUP), are compared with the ensemble mean(EMN) and single-model forecasts. Moreover, a new model bias estimation scheme is investigated and applied to the BREM and SUP schemes. The results showed that, compared with single-model forecasts and EMN, the multi-model ensembles of the BREM and SUP schemes can have smaller errors in most cases. However, there were also circumstances where BREM was less skillful than EMN, indicating that using a time-averaged error as model bias is not optimal. A new model bias estimation scheme of the biweight mean is introduced. Through minimizing the negative influence of singular errors, this scheme can obtain a more accurate model bias estimation and improve the BREM forecast skill. The application of the biweight mean in the bias calculation of SUP also resulted in improved skill. The results indicate that the modification of multi-model ensemble schemes through this bias estimation method is feasible.
基金supported by the National Natural Science Foundation of China(Grant No.42275184)the National Key Research and Development Program of China(Grant No.2017YFA0603804)the Postgraduate Research and Practice Innovation Program of Government of Jiangsu Province(Grant No.KYCX22_1135).
文摘Climate change adaptation and relevant policy-making need reliable projections of future climate.Methods based on multi-model ensemble are generally considered as the most efficient way to achieve the goal.However,their efficiency varies and inter-comparison is a challenging task,as they use a variety of target variables,geographic regions,time periods,or model pools.Here,we construct and use a consistent framework to evaluate the performance of five ensemble-processing methods,i.e.,multimodel ensemble mean(MME),rank-based weighting(RANK),reliability ensemble averaging(REA),climate model weighting by independence and performance(ClimWIP),and Bayesian model averaging(BMA).We investigate the annual mean temperature(Tav)and total precipitation(Prcptot)changes(relative to 1995–2014)over China and its seven subregions at 1.5 and 2℃warming levels(relative to pre-industrial).All ensemble-processing methods perform better than MME,and achieve generally consistent results in terms of median values.But they show different results in terms of inter-model spread,served as a measure of uncertainty,and signal-to-noise ratio(SNR).ClimWIP is the most optimal method with its good performance in simulating current climate and in providing credible future projections.The uncertainty,measured by the range of 10th–90th percentiles,is reduced by about 30%for Tav,and 15%for Prcptot in China,with a certain variation among subregions.Based on ClimWIP,and averaged over whole China under 1.5/2℃global warming levels,Tav increases by about 1.1/1.8℃(relative to 1995–2014),while Prcptot increases by about 5.4%/11.2%,respectively.Reliability of projections is found dependent on investigated regions and indices.The projection for Tav is credible across all regions,as its SNR is generally larger than 2,while the SNR is lower than 1 for Prcptot over most regions under 1.5℃warming.The largest warming is found in northeastern China,with increase of 1.3(0.6–1.7)/2.0(1.4–2.6)℃(ensemble’s median and range of the 10th–90th percentiles)under 1.5/2℃warming,followed by northern and northwestern China.The smallest but the most robust warming is in southwestern China,with values exceeding 0.9(0.6–1.1)/1.5(1.1–1.7)℃.The most robust projection and largest increase is achieved in northwestern China for Prcptot,with increase of 9.1%(–1.6–24.7%)/17.9%(0.5–36.4%)under 1.5/2℃warming.Followed by northern China,where the increase is 6.0%(–2.6–17.8%)/11.8%(2.4–25.1%),respectively.The precipitation projection is of large uncertainty in southwestern China,even with uncertain sign of variation.For the additional half-degree warming,Tav increases more than 0.5℃throughout China.Almost all regions witness an increase of Prcptot,with the largest increase in northwestern China.
基金research was jointly sponsored by the Strategic Priority Research Program of the Chinese Academy of Sciences(XDA20020201 and XDA19030402)the National Natural Science Foundation of China(41775077).
文摘Central Asia(CA)is highly sensitive and vulnerable to changes in precipitation due to global warming,so the projection of precipitation extremes is essential for local climate risk assessment.However,global and regional climate models often fail to reproduce the observed daily precipitation distribution and hence extremes,especially in areas with complex terrain.In this study,we proposed a statistical downscaling(SD)model based on quantile delta mapping to assess and project eight precipitation indices at 73 meteorological stations across CA driven by ERA5 reanalysis data and simulations of 10 global climate models(GCMs)for present and future(2081-2100)periods under two shared socioeconomic pathways(SSP245 and SSP585).The reanalysis data and raw GCM outputs clearly underestimate mean precipitation intensity(SDII)and maximum 1-day precipitation(RX1DAY)and overestimate the number of wet days(R1MM)and maximum consecutive wet days(CWD)at stations across CA.However,the SD model effectively reduces the biases and RMSEs of the modeled precipitation indices compared to the observations.Also it effectively adjusts the distributional biases in the downscaled daily precipitation and indices at the stations across CA.In addition,it is skilled in capturing the spatial patterns of the observed precipitation indices.Obviously,SDII and RX1DAY are improved by the SD model,especially in the southeastern mountainous area.Under the intermediate scenario(SSP245),our SD multi-model ensemble projections project significant and robust increases in SDII and total extreme precipitation(R95PTOT)of 0.5 mm d^(-1) and 19.7 mm,respectively,over CA at the end of the 21st century(2081-2100)compared to the present values(1995-2014).More pronounced increases in indices R95PTOT,SDII,number of very wet days(R10MM),and RX1DAY are projected under the higher emission scenario(SSP585),particularly in the mountainous southeastern region.The SD model suggested that SDII and RX1DAY will likely rise more rapidly than those projected by previous model simulations over CA during the period 2081-2100.The SD projection of the possible future changes in precipitation and extremes improves the knowledge base for local risk management and climate change adaptation in CA.
基金supported by the National Natural Science Foundation of China(61503204)the Natural Science Foundation of Zhejiang Province(Y16F030001)the Nature Science Foundation of Ningbo City(2016A610092).
文摘Dear Editor,This letter presents a novel process monitoring model based on ensemble structure analysis(ESA).The ESA model takes advantage of principal component analysis(PCA),locality preserving projections(LPP),and multi-manifold projections(MMP)models,and then combines the multiple solutions within an ensemble result through Bayesian inference.In the developed ESA model,different structure features of the given dataset are taken into account simultaneously,the suitability and reliability of the ESA-based monitoring model are then illustrated through comparison.Introduction:The requirement for ensuring safe operation and improving process efficiency has led to increased research activity in the field of process monitoring.
文摘Identifying rare patterns for medical diagnosis is a challenging task due to heterogeneity and the volume of data.Data summarization can create a concise version of the original data that can be used for effective diagnosis.In this paper,we propose an ensemble summarization method that combines clustering and sampling to create a summary of the original data to ensure the inclusion of rare patterns.To the best of our knowledge,there has been no such technique available to augment the performance of anomaly detection techniques and simultaneously increase the efficiency of medical diagnosis.The performance of popular anomaly detection algorithms increases significantly in terms of accuracy and computational complexity when the summaries are used.Therefore,the medical diagnosis becomes more effective,and our experimental results reflect that the combination of the proposed summarization scheme and all underlying algorithms used in this paper outperforms the most popular anomaly detection techniques.
基金supported by the National Natural Science Foundation of China (Grant No. 12234013)the Natural Science Foundation of Shandong Province (Grant No. ZR2021LLZ009)。
文摘We present a formulation of the single-trajectory entropy using the trajectories ensemble. The single-trajectory entropy is affected by its surrounding trajectories via the distribution function. The single-trajectory entropies are studied in two typical potentials, i.e., harmonic potential and double-well potential, and in viscous environment by interacting trajectory method. The results of the trajectory methods are in agreement well with the numerical methods(Monte Carlo simulation and difference equation). The single-trajectory entropies increasing(decreasing) could be caused by absorption(emission) heat from(to) the thermal environment. Also, some interesting trajectories, which correspond to the rare evens in the processes, are demonstrated.
基金Project supported by the National Key Research and Development Program of China (Grant No.2021YFB3900701)the Science and Technology Plan Project of the State Administration for Market Regulation of China (Grant No.2023MK178)the National Natural Science Foundation of China (Grant No.42227802)。
文摘A redundant-subspace-weighting(RSW)-based approach is proposed to enhance the frequency stability on a time scale of a clock ensemble.In this method,multiple overlapping subspaces are constructed in the clock ensemble,and the weight of each clock in this ensemble is defined by using the spatial covariance matrix.The superimposition average of covariances in different subspaces reduces the correlations between clocks in the same laboratory to some extent.After optimizing the parameters of this weighting procedure,the frequency stabilities of virtual clock ensembles are significantly improved in most cases.
基金the Deanship of Scientific Research,Najran University,Kingdom of Saudi Arabia,for funding this work under the Research Groups Funding Program Grant Code Number(NU/RG/SERC/12/43).
文摘Data security assurance is crucial due to the increasing prevalence of cloud computing and its widespread use across different industries,especially in light of the growing number of cybersecurity threats.A major and everpresent threat is Ransomware-as-a-Service(RaaS)assaults,which enable even individuals with minimal technical knowledge to conduct ransomware operations.This study provides a new approach for RaaS attack detection which uses an ensemble of deep learning models.For this purpose,the network intrusion detection dataset“UNSWNB15”from the Intelligent Security Group of the University of New South Wales,Australia is analyzed.In the initial phase,the rectified linear unit-,scaled exponential linear unit-,and exponential linear unit-based three separate Multi-Layer Perceptron(MLP)models are developed.Later,using the combined predictive power of these three MLPs,the RansoDetect Fusion ensemble model is introduced in the suggested methodology.The proposed ensemble technique outperforms previous studieswith impressive performance metrics results,including 98.79%accuracy and recall,98.85%precision,and 98.80%F1-score.The empirical results of this study validate the ensemble model’s ability to improve cybersecurity defenses by showing that it outperforms individual MLPmodels.In expanding the field of cybersecurity strategy,this research highlights the significance of combined deep learning models in strengthening intrusion detection systems against sophisticated cyber threats.
基金This work was supported by Natural Science Foundation of China(Nos.62303126,62362008,62066006,authors Zhenyong Zhang and Bin Hu,https://www.nsfc.gov.cn/,accessed on 25 July 2024)Guizhou Provincial Science and Technology Projects(No.ZK[2022]149,author Zhenyong Zhang,https://kjt.guizhou.gov.cn/,accessed on 25 July 2024)+1 种基金Guizhou Provincial Research Project(Youth)forUniversities(No.[2022]104,author Zhenyong Zhang,https://jyt.guizhou.gov.cn/,accessed on 25 July 2024)GZU Cultivation Project of NSFC(No.[2020]80,author Zhenyong Zhang,https://www.gzu.edu.cn/,accessed on 25 July 2024).
文摘With the widespread use of machine learning(ML)technology,the operational efficiency and responsiveness of power grids have been significantly enhanced,allowing smart grids to achieve high levels of automation and intelligence.However,tree ensemble models commonly used in smart grids are vulnerable to adversarial attacks,making it urgent to enhance their robustness.To address this,we propose a robustness enhancement method that incorporates physical constraints into the node-splitting decisions of tree ensembles.Our algorithm improves robustness by developing a dataset of adversarial examples that comply with physical laws,ensuring training data accurately reflects possible attack scenarios while adhering to physical rules.In our experiments,the proposed method increased robustness against adversarial attacks by 100%when applied to real grid data under physical constraints.These results highlight the advantages of our method in maintaining efficient and secure operation of smart grids under adversarial conditions.
基金supported by the Joint Funds of the Chinese National Natural Science Foundation (NSFC)(Grant No.U2242213)the National Key Research and Development (R&D)Program of the Ministry of Science and Technology of China(Grant No. 2021YFC3000902)the National Science Foundation for Young Scholars (Grant No. 42205166)。
文摘Ensemble prediction is widely used to represent the uncertainty of single deterministic Numerical Weather Prediction(NWP) caused by errors in initial conditions(ICs). The traditional Singular Vector(SV) initial perturbation method tends only to capture synoptic scale initial uncertainty rather than mesoscale uncertainty in global ensemble prediction. To address this issue, a multiscale SV initial perturbation method based on the China Meteorological Administration Global Ensemble Prediction System(CMA-GEPS) is proposed to quantify multiscale initial uncertainty. The multiscale SV initial perturbation approach entails calculating multiscale SVs at different resolutions with multiple linearized physical processes to capture fast-growing perturbations from mesoscale to synoptic scale in target areas and combining these SVs by using a Gaussian sampling method with amplitude coefficients to generate initial perturbations. Following that, the energy norm,energy spectrum, and structure of multiscale SVs and their impact on GEPS are analyzed based on a batch experiment in different seasons. The results show that the multiscale SV initial perturbations can possess more energy and capture more mesoscale uncertainties than the traditional single-SV method. Meanwhile, multiscale SV initial perturbations can reflect the strongest dynamical instability in target areas. Their performances in global ensemble prediction when compared to single-scale SVs are shown to(i) improve the relationship between the ensemble spread and the root-mean-square error and(ii) provide a better probability forecast skill for atmospheric circulation during the late forecast period and for short-to medium-range precipitation. This study provides scientific evidence and application foundations for the design and development of a multiscale SV initial perturbation method for the GEPS.
基金This work is supported by EIAS(Emerging Intelligent Autonomous Systems)Data Science Lab,Prince Sultan University,Kingdom of Saudi Arabia,by paying the APC.
文摘The software development process mostly depends on accurately identifying both essential and optional features.Initially,user needs are typically expressed in free-form language,requiring significant time and human resources to translate these into clear functional and non-functional requirements.To address this challenge,various machine learning(ML)methods have been explored to automate the understanding of these requirements,aiming to reduce time and human effort.However,existing techniques often struggle with complex instructions and large-scale projects.In our study,we introduce an innovative approach known as the Functional and Non-functional Requirements Classifier(FNRC).By combining the traditional random forest algorithm with the Accuracy Sliding Window(ASW)technique,we develop optimal sub-ensembles that surpass the initial classifier’s accuracy while using fewer trees.Experimental results demonstrate that our FNRC methodology performs robustly across different datasets,achieving a balanced Precision of 75%on the PROMISE dataset and an impressive Recall of 85%on the CCHIT dataset.Both datasets consistently maintain an F-measure around 64%,highlighting FNRC’s ability to effectively balance precision and recall in diverse scenarios.These findings contribute to more accurate and efficient software development processes,increasing the probability of achieving successful project outcomes.
基金great gratitude to National Key Research and Development Project(Grant No.2019YFC1509800)for their financial supportNational Nature Science Foundation of China(Grant No.12172211)for their financial support.
文摘Geotechnical engineering data are usually small-sample and high-dimensional,which brings a lot of challenges in predictive modeling.This paper uses a typical high-dimensional and small-sample swell pressure(P_(s))dataset to explore the possibility of using multi-algorithm hybrid ensemble and dimensionality reduction methods to mitigate the uncertainty of soil parameter prediction.Based on six machine learning(ML)algorithms,the base learner pool is constructed,and four ensemble methods,Stacking(SG),Blending(BG),Voting regression(VR),and Feature weight linear stacking(FWL),are used for the multi-algorithm ensemble.Furthermore,the importance of permutation is used for feature dimensionality reduction to mitigate the impact of weakly correlated variables on predictive modeling.The results show that the proposed methods are superior to traditional prediction models and base ML models,where FWL is more suitable for modeling with small-sample datasets,and dimensionality reduction can simplify the data structure and reduce the adverse impact of the small-sample effect,which points the way to feature selection for predictive modeling.Based on the ensemble methods,the feature importance of the five primary factors affecting P_(s) is the maximum dry density(31.145%),clay fraction(15.876%),swell percent(15.289%),plasticity index(14%),and optimum moisture content(13.69%),the influence of input parameters on P_(s) is also investigated,in line with the findings of the existing literature.