Xinjiang Uygur Autonomous Region is a typical inland arid area in China with a sparse and uneven distribution of meteorological stations,limited access to precipitation data,and significant water scarcity.Evaluating a...Xinjiang Uygur Autonomous Region is a typical inland arid area in China with a sparse and uneven distribution of meteorological stations,limited access to precipitation data,and significant water scarcity.Evaluating and integrating precipitation datasets from different sources to accurately characterize precipitation patterns has become a challenge to provide more accurate and alternative precipitation information for the region,which can even improve the performance of hydrological modelling.This study evaluated the applicability of widely used five satellite-based precipitation products(Climate Hazards Group InfraRed Precipitation with Station(CHIRPS),China Meteorological Forcing Dataset(CMFD),Climate Prediction Center morphing method(CMORPH),Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks-Climate Data Record(PERSIANN-CDR),and Tropical Rainfall Measuring Mission Multi-satellite Precipitation Analysis(TMPA))and a reanalysis precipitation dataset(ECMWF Reanalysis v5-Land Dataset(ERA5-Land))in Xinjiang using ground-based observational precipitation data from a limited number of meteorological stations.Based on this assessment,we proposed a framework that integrated different precipitation datasets with varying spatial resolutions using a dynamic Bayesian model averaging(DBMA)approach,the expectation-maximization method,and the ordinary Kriging interpolation method.The daily precipitation data merged using the DBMA approach exhibited distinct spatiotemporal variability,with an outstanding performance,as indicated by low root mean square error(RMSE=1.40 mm/d)and high Person's correlation coefficient(CC=0.67).Compared with the traditional simple model averaging(SMA)and individual product data,although the DBMA-fused precipitation data were slightly lower than the best precipitation product(CMFD),the overall performance of DBMA was more robust.The error analysis between DBMA-fused precipitation dataset and the more advanced Integrated Multi-satellite Retrievals for Global Precipitation Measurement Final(IMERG-F)precipitation product,as well as hydrological simulations in the Ebinur Lake Basin,further demonstrated the superior performance of DBMA-fused precipitation dataset in the entire Xinjiang region.The proposed framework for solving the fusion problem of multi-source precipitation data with different spatial resolutions is feasible for application in inland arid areas,and aids in obtaining more accurate regional hydrological information and improving regional water resources management capabilities and meteorological research in these regions.展开更多
To ensure agreement between theoretical calculations and experimental data,parameters to selected nuclear physics models are perturbed and fine-tuned in nuclear data evaluations.This approach assumes that the chosen s...To ensure agreement between theoretical calculations and experimental data,parameters to selected nuclear physics models are perturbed and fine-tuned in nuclear data evaluations.This approach assumes that the chosen set of models accurately represents the‘true’distribution of considered observables.Furthermore,the models are chosen globally,indicating their applicability across the entire energy range of interest.However,this approach overlooks uncertainties inherent in the models themselves.In this work,we propose that instead of selecting globally a winning model set and proceeding with it as if it was the‘true’model set,we,instead,take a weighted average over multiple models within a Bayesian model averaging(BMA)framework,each weighted by its posterior probability.The method involves executing a set of TALYS calculations by randomly varying multiple nuclear physics models and their parameters to yield a vector of calculated observables.Next,computed likelihood function values at each incident energy point were then combined with the prior distributions to obtain updated posterior distributions for selected cross sections and the elastic angular distributions.As the cross sections and elastic angular distributions were updated locally on a per-energy-point basis,the approach typically results in discontinuities or“kinks”in the cross section curves,and these were addressed using spline interpolation.The proposed BMA method was applied to the evaluation of proton-induced reactions on ^(58)Ni between 1 and 100 MeV.The results demonstrated a favorable comparison with experimental data as well as with the TENDL-2023 evaluation.展开更多
Statistical biases may be introduced by imprecisely quantifying background radiation reference levels. It is, therefore, imperative to devise a simple, adaptable approach for precisely describing the reference backgro...Statistical biases may be introduced by imprecisely quantifying background radiation reference levels. It is, therefore, imperative to devise a simple, adaptable approach for precisely describing the reference background levels of naturally occurring radionuclides (NOR) in mining sites. As a substitute statistical method, we suggest using Bayesian modeling in this work to examine the spatial distribution of NOR. For naturally occurring gamma-induced radionuclides like 232Th, 40K, and 238U, statistical parameters are inferred using the Markov Chain Monte Carlo (MCMC) method. After obtaining an accurate subsample using bootstrapping, we exclude any possible outliers that fall outside of the Highest Density Interval (HDI). We use MCMC to build a Bayesian model with the resampled data and make predictions about the posterior distribution of radionuclides produced by gamma irradiation. This method offers a strong and dependable way to describe NOR reference background values, which is important for managing and evaluating radiation risks in mining contexts.展开更多
In order to classify the minimal hepatic encephalopathy (MHE) patients from healthy controls, the independent component analysis (ICA) is used to generate the default mode network (DMN) from resting-state functi...In order to classify the minimal hepatic encephalopathy (MHE) patients from healthy controls, the independent component analysis (ICA) is used to generate the default mode network (DMN) from resting-state functional magnetic resonance imaging (fMRI). Then a Bayesian voxel- wised method, graphical-model-based multivariate analysis (GAMMA), is used to explore the associations between abnormal functional integration within DMN and clinical variable. Without any prior knowledge, five machine learning methods, namely, support vector machines (SVMs), classification and regression trees ( CART ), logistic regression, the Bayesian network, and C4.5, are applied to the classification. The functional integration patterns were alternative within DMN, which have the power to predict MHE with an accuracy of 98%. The GAMMA method generating functional integration patterns within DMN can become a simple, objective, and common imaging biomarker for detecting MIIE and can serve as a supplement to the existing diagnostic methods.展开更多
It is quite common in statistical modeling to select a model and make inference as if the model had been known in advance;i.e. ignoring model selection uncertainty. The resulted estimator is called post-model selectio...It is quite common in statistical modeling to select a model and make inference as if the model had been known in advance;i.e. ignoring model selection uncertainty. The resulted estimator is called post-model selection estimator (PMSE) whose properties are hard to derive. Conditioning on data at hand (as it is usually the case), Bayesian model selection is free of this phenomenon. This paper is concerned with the properties of Bayesian estimator obtained after model selection when the frequentist (long run) performances of the resulted Bayesian estimator are of interest. The proposed method, using Bayesian decision theory, is based on the well known Bayesian model averaging (BMA)’s machinery;and outperforms PMSE and BMA. It is shown that if the unconditional model selection probability is equal to model prior, then the proposed approach reduces BMA. The method is illustrated using Bernoulli trials.展开更多
Bayesian model averaging (BMA) is a popular and powerful statistical method of taking account of uncertainty about model form or assumption. Usually the long run (frequentist) performances of the resulted estimator ar...Bayesian model averaging (BMA) is a popular and powerful statistical method of taking account of uncertainty about model form or assumption. Usually the long run (frequentist) performances of the resulted estimator are hard to derive. This paper proposes a mixture of priors and sampling distributions as a basic of a Bayes estimator. The frequentist properties of the new Bayes estimator are automatically derived from Bayesian decision theory. It is shown that if all competing models have the same parametric form, the new Bayes estimator reduces to BMA estimator. The method is applied to the daily exchange rate Euro to US Dollar.展开更多
Gross primary production(GPP) plays a crucial part in the carbon cycle of terrestrial ecosystems.A set of validated monthly GPP data from 1957 to 2010 in 0.5°× 0.5° grids of China was weighted from the ...Gross primary production(GPP) plays a crucial part in the carbon cycle of terrestrial ecosystems.A set of validated monthly GPP data from 1957 to 2010 in 0.5°× 0.5° grids of China was weighted from the Multi-scale Terrestrial Model Intercomparison Project using Bayesian model averaging(BMA).The spatial anomalies of detrended BMA GPP during the growing seasons of typical El Nino years indicated that GPP response to El Nino varies with Pacific Decadal Oscillation(PDO) phases: when the PDO was in the cool phase,it was likely that GPP was greater in northern China(32°–38°N,111°–122°E) and less in the Yangtze River valley(28°–32°N,111°–122°E);in contrast,when PDO was in the warm phase,the GPP anomalies were usually reversed in these two regions.The consistent spatiotemporal pattern and high partial correlation revealed that rainfall dominated this phenomenon.The previously published findings on how El Nino during different phases of PDO affecting rainfall in eastern China make the statistical relationship between GPP and El Nino in this study theoretically credible.This paper not only introduces an effective way to use BMA in grids that have mixed plant function types,but also makes it possible to evaluate the carbon cycle in eastern China based on the prediction of El Nino and PDO.展开更多
This study developed a hierarchical Bayesian(HB)model for local and regional flood frequency analysis in the Dongting Lake Basin,in China.The annual maximum daily flows from 15 streamflow-gauged sites in the study are...This study developed a hierarchical Bayesian(HB)model for local and regional flood frequency analysis in the Dongting Lake Basin,in China.The annual maximum daily flows from 15 streamflow-gauged sites in the study area were analyzed with the HB model.The generalized extreme value(GEV)distribution was selected as the extreme flood distribution,and the GEV distribution location and scale parameters were spatially modeled through a regression approach with the drainage area as a covariate.The Markov chain Monte Carlo(MCMC)method with Gibbs sampling was employed to calculate the posterior distribution in the HB model.The results showed that the proposed HB model provided satisfactory Bayesian credible intervals for flood quantiles,while the traditional delta method could not provide reliable uncertainty estimations for large flood quantiles,due to the fact that the lower confidence bounds tended to decrease as the return periods increased.Furthermore,the HB model for regional analysis allowed for a reduction in the value of some restrictive assumptions in the traditional index flood method,such as the homogeneity region assumption and the scale invariance assumption.The HB model can also provide an uncertainty band of flood quantile prediction at a poorly gauged or ungauged site,but the index flood method with L-moments does not demonstrate this uncertainty directly.Therefore,the HB model is an effective method of implementing the flexible local and regional frequency analysis scheme,and of quantifying the associated predictive uncertainty.展开更多
The choices of the parameterizations for each component in a microwave emission model have significant effects on the quality of brightness temperature (Tb) sim- ulation. How to reduce the uncertainty in the Tb simu...The choices of the parameterizations for each component in a microwave emission model have significant effects on the quality of brightness temperature (Tb) sim- ulation. How to reduce the uncertainty in the Tb simulation is investigated by adopting a statistical post-processing procedure with the Bayesian model averaging (BMA) ensemble approach. The simulations by the community microwave emission model (CMEM) cou- pled with the community land model version 4.5 (CLM4.5) over China's Mainland are con- ducted by the 24 configurations from four vegetation opacity parameterizations (VOPs), three soil dielectric constant parameterizations (SDCPs), and two soil roughness param- eterizations (SRPs). Compared with the simple arithmetical averaging (SAA) method, the BMA reconstructions have a higher spatial correlation coefficient (larger than 0.99) than the C-band satellite observations of the advanced microwave scanning radiometer on the Earth observing system (AMSR-E) at the vertical polarization. Moreover, the BMA product performs the best among the ensemble members for all vegetation classes, with a mean root-mean-square difference (RMSD) of 4 K and a temporal correlation coefficient of 0.64.展开更多
Climate change in mountainous regions has significant impacts on hydrological and ecological systems. This research studied the future temperature, precipitation and snowfall in the 21^(st) century for the Tianshan ...Climate change in mountainous regions has significant impacts on hydrological and ecological systems. This research studied the future temperature, precipitation and snowfall in the 21^(st) century for the Tianshan and northern Kunlun Mountains(TKM) based on the general circulation model(GCM) simulation ensemble from the coupled model intercomparison project phase 5(CMIP5) under the representative concentration pathway(RCP) lower emission scenario RCP4.5 and higher emission scenario RCP8.5 using the Bayesian model averaging(BMA) technique. Results show that(1) BMA significantly outperformed the simple ensemble analysis and BMA mean matches all the three observed climate variables;(2) at the end of the 21^(st) century(2070–2099) under RCP8.5, compared to the control period(1976–2005), annual mean temperature and mean annual precipitation will rise considerably by 4.8°C and 5.2%, respectively, while mean annual snowfall will dramatically decrease by 26.5%;(3) precipitation will increase in the northern Tianshan region while decrease in the Amu Darya Basin. Snowfall will significantly decrease in the western TKM. Mean annual snowfall fraction will also decrease from 0.56 of 1976–2005 to 0.42 of 2070–2099 under RCP8.5; and(4) snowfall shows a high sensitivity to temperature in autumn and spring while a low sensitivity in winter, with the highest sensitivity values occurring at the edge areas of TKM. The projections mean that flood risk will increase and solid water storage will decrease.展开更多
Real-world study is valuable for traditional Chinese medicine.However,there are no gold standards of statistical approaches for analyzing data from real-world study of traditional Chinese medicine.With the development...Real-world study is valuable for traditional Chinese medicine.However,there are no gold standards of statistical approaches for analyzing data from real-world study of traditional Chinese medicine.With the development of computer technology,researchers have increasingly paid attention to Bayesian statistics in the biomedical field.In present study,real-world study and Bayesian statistics were introduced.It was discussed that why and when to use Bayesian analysis and the challenge in the real-world study of traditional Chinese medicine.展开更多
The ability to estimate terrestrial water storage(TWS)is essential for monitoring hydrological extremes(e.g.,droughts and floods)and predicting future changes in the hydrological cycle.However,inadequacies in model ph...The ability to estimate terrestrial water storage(TWS)is essential for monitoring hydrological extremes(e.g.,droughts and floods)and predicting future changes in the hydrological cycle.However,inadequacies in model physics and parameters,as well as uncertainties in meteorological forcing data,commonly limit the ability of land surface models(LSMs)to accurately simulate TWS.In this study,the authors show how simulations of TWS anomalies(TWSAs)from multiple meteorological forcings and multiple LSMs can be combined in a Bayesian model averaging(BMA)ensemble approach to improve monitoring and predictions.Simulations using three forcing datasets and two LSMs were conducted over China's Mainland for the period 1979–2008.All the simulations showed good temporal correlations with satellite observations from the Gravity Recovery and Climate Experiment during 2004–08.The correlation coefficient ranged between 0.5 and 0.8 in the humid regions(e.g.,the Yangtze river basin,Huaihe basin,and Zhujiang basin),but was much lower in the arid regions(e.g.,the Heihe basin and Tarim river basin).The BMA ensemble approach performed better than all individual member simulations.It captured the spatial distribution and temporal variations of TWSAs over China's Mainland and the eight major river basins very well;plus,it showed the highest R value(>0.5)over most basins and the lowest root-mean-square error value(<40 mm)in all basins of China.The good performance of the BMA ensemble approach shows that it is a promising way to reproduce long-term,high-resolution spatial and temporal TWSA data.展开更多
Electrical Source Imaging (ESI) is a non-invasive technique of reconstructing brain activities using EEG data. This technique has been applied to evaluate epilepsy patients being evaluated for epilepsy surgery, showin...Electrical Source Imaging (ESI) is a non-invasive technique of reconstructing brain activities using EEG data. This technique has been applied to evaluate epilepsy patients being evaluated for epilepsy surgery, showing encouraging results for mapping interictal epileptiform discharges (IED). However, ESI is underused in planning epilepsy surgery. This is basically due to the wide availability of methods for solving the electromagnetism inverse problem (e-IP) associated to few studies using EEG setups similar to those most commonly used in clinical setting. In this study, we applied six different methods of solving the e-IP based on IEDs of 20 focal epilepsy patients that presented abnormalities in their MRI. We compared the ESI maps obtained by each method with the location of the abnormality, calculating the Euclidian distances from the center of the lesion to the closest border of the method solution (CL-BM) and also to the solution’s maxima (CL-MM). We also applied a score system in order to allow us to evaluate the sensitivity of each method for temporal and extra temporal patients. In our patients, the Bayesian Model Averaging method had a sensitivity of 86% and the shortest CL-MM. This method also had more restricted solutions that were more representative of epileptogenic activities than those obtained by the other methods.展开更多
Small area estimation (SAE) tackles the problem of providing reliable estimates for small areas, i.e., subsets of the population for which sample information is not sufficient to warrant the use of a direct estimator....Small area estimation (SAE) tackles the problem of providing reliable estimates for small areas, i.e., subsets of the population for which sample information is not sufficient to warrant the use of a direct estimator. Hierarchical Bayesian approach to SAE problems offers several advantages over traditional SAE models including the ability of appropriately accounting for the type of surveyed variable. In this paper, a number of model specifications for estimating small area counts are discussed and their relative merits are illustrated. We conducted a simulation study by reproducing in a simplified form the Italian Labour Force Survey and taking the Local Labor Markets as target areas. Simulated data were generated by assuming population characteristics of interest as well as survey sampling design as known. In one set of experiments, numbers of employment/unemployment from census data were utilized, in others population characteristics were varied. Results show persistent model failures for some standard Fay-Herriot specifications and for generalized linear Poisson models with (log-)normal sampling stage, whilst either unmatched or nonnormal sampling stage models get the best performance in terms of bias, accuracy and reliability. Though, the study also found that any model noticeably improves on its performance by letting sampling variances be stochastically determined rather than assumed as known as is the general practice. Moreover, we address the issue of model determination to point out limits and possible deceptions of commonly used criteria for model selection and checking in SAE context.展开更多
The successful experience of adopting distributed development models in such open source projects includes GNU/Linux operating system, Apache HTTP server, Android, BusyBox, and so on. The open source project contains ...The successful experience of adopting distributed development models in such open source projects includes GNU/Linux operating system, Apache HTTP server, Android, BusyBox, and so on. The open source project contains special features so-called software composition by which several geographically-dispersed compo-nents are developed in all parts of the world. We propose a method of component-oriented reliability as-sessment based on hierarchical Bayesian model and Markov chain Monte Carlo methods. Especially, we fo-cus on the fault-detection rate for each component reported to the bug tracking system. We can assess the reliability for the whole open source software system by using the confidence interval for each component. Also, we analyze actual software fault-count data to show numerical examples of reliability assessment for OSS.展开更多
Computations involved in Bayesian approach to practical model selection problems are usually very difficult. Computational simplifications are sometimes possible, but are not generally applicable. There is a large lit...Computations involved in Bayesian approach to practical model selection problems are usually very difficult. Computational simplifications are sometimes possible, but are not generally applicable. There is a large literature available on a methodology based on information theory called Minimum Description Length (MDL). It is described here how many of these techniques are either directly Bayesian in nature, or are very good objective approximations to Bayesian solutions. First, connections between the Bayesian approach and MDL are theoretically explored;thereafter a few illustrations are provided to describe how MDL can give useful computational simplifications.展开更多
Stable water isotopes are natural tracers quantifying the contribution of moisture recycling to local precipitation,i.e.,the moisture recycling ratio,but various isotope-based models usually lead to different results,...Stable water isotopes are natural tracers quantifying the contribution of moisture recycling to local precipitation,i.e.,the moisture recycling ratio,but various isotope-based models usually lead to different results,which affects the accuracy of local moisture recycling.In this study,a total of 18 stations from four typical areas in China were selected to compare the performance of isotope-based linear and Bayesian mixing models and to determine local moisture recycling ratio.Among the three vapor sources including advection,transpiration,and surface evaporation,the advection vapor usually played a dominant role,and the contribution of surface evaporation was less than that of transpiration.When the abnormal values were ignored,the arithmetic averages of differences between isotope-based linear and the Bayesian mixing models were 0.9%for transpiration,0.2%for surface evaporation,and–1.1%for advection,respectively,and the medians were 0.5%,0.2%,and–0.8%,respectively.The importance of transpiration was slightly less for most cases when the Bayesian mixing model was applied,and the contribution of advection was relatively larger.The Bayesian mixing model was found to perform better in determining an efficient solution since linear model sometimes resulted in negative contribution ratios.Sensitivity test with two isotope scenarios indicated that the Bayesian model had a relatively low sensitivity to the changes in isotope input,and it was important to accurately estimate the isotopes in precipitation vapor.Generally,the Bayesian mixing model should be recommended instead of a linear model.The findings are useful for understanding the performance of isotope-based linear and Bayesian mixing models under various climate backgrounds.展开更多
The Bayesian structural equation model integrates the principles of Bayesian statistics, providing a more flexible and comprehensive modeling framework. In exploring complex relationships between variables, handling u...The Bayesian structural equation model integrates the principles of Bayesian statistics, providing a more flexible and comprehensive modeling framework. In exploring complex relationships between variables, handling uncertainty, and dealing with missing data, the Bayesian structural equation model demonstrates unique advantages. Therefore, Bayesian methods are used in this paper to establish a structural equation model of innovative talent cognition, with the measurement of college students’ cognition of innovative talent being studied. An in-depth analysis is conducted on the effects of innovative self-efficacy, social resources, innovative personality traits, and school education, aiming to explore the factors influencing college students’ innovative talent. The results indicate that innovative self-efficacy plays a key role in perception, social resources are significantly positively correlated with the perception of innovative talents, innovative personality tendencies and school education are positively correlated with the perception of innovative talents, but the impact is not significant.展开更多
A probabilistic precipitation forecasting model using generalized additive models (GAMs) and Bayesian model averaging (BMA) was proposed in this paper. GAMs were used to fit the spatial-temporal precipitation mode...A probabilistic precipitation forecasting model using generalized additive models (GAMs) and Bayesian model averaging (BMA) was proposed in this paper. GAMs were used to fit the spatial-temporal precipitation models to individual ensemble member forecasts. The distributions of the precipitation occurrence and the cumulative precipitation amount were represented simultaneously by a single Tweedie distribution. BMA was then used as a post-processing method to combine the individual models to form a more skillful probabilistic forecasting model. The mixing weights were estimated using the expectation-maximization algorithm. The residual diagnostics was used to examine if the fitted BMA forecasting model had fully captured the spatial and temporal variations of precipitation. The proposed method was applied to daily observations at the Yishusi River basin for July 2007 using the National Centers for Environmental Prediction ensemble forecasts. By applying scoring rules, the BMA forecasts were verified and showed better performances compared with the empirical probabilistic ensemble forecasts, particularly for extreme precipitation. Finally, possible improvements and a^plication of this method to the downscaling of climate change scenarios were discussed.展开更多
Bayesian model averaging(BMA) is a recently proposed statistical method for calibrating forecast ensembles from numerical weather models.However,successful implementation of BMA requires accurate estimates of the weig...Bayesian model averaging(BMA) is a recently proposed statistical method for calibrating forecast ensembles from numerical weather models.However,successful implementation of BMA requires accurate estimates of the weights and variances of the individual competing models in the ensemble.Two methods,namely the Expectation-Maximization(EM) and the Markov Chain Monte Carlo(MCMC) algorithms,are widely used for BMA model training.Both methods have their own respective strengths and weaknesses.In this paper,we first modify the BMA log-likelihood function with the aim of removing the addi-tional limitation that requires that the BMA weights add to one,and then use a limited memory quasi-Newtonian algorithm for solving the nonlinear optimization problem,thereby formulating a new approach for BMA(referred to as BMA-BFGS).Several groups of multi-model soil moisture simulation experiments from three land surface models show that the performance of BMA-BFGS is similar to the MCMC method in terms of simulation accuracy,and that both are superior to the EM algo-rithm.On the other hand,the computational cost of the BMA-BFGS algorithm is substantially less than for MCMC and is al-most equivalent to that for EM.展开更多
基金supported by The Technology Innovation Team(Tianshan Innovation Team),Innovative Team for Efficient Utilization of Water Resources in Arid Regions(2022TSYCTD0001)the National Natural Science Foundation of China(42171269)the Xinjiang Academician Workstation Cooperative Research Project(2020.B-001).
文摘Xinjiang Uygur Autonomous Region is a typical inland arid area in China with a sparse and uneven distribution of meteorological stations,limited access to precipitation data,and significant water scarcity.Evaluating and integrating precipitation datasets from different sources to accurately characterize precipitation patterns has become a challenge to provide more accurate and alternative precipitation information for the region,which can even improve the performance of hydrological modelling.This study evaluated the applicability of widely used five satellite-based precipitation products(Climate Hazards Group InfraRed Precipitation with Station(CHIRPS),China Meteorological Forcing Dataset(CMFD),Climate Prediction Center morphing method(CMORPH),Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks-Climate Data Record(PERSIANN-CDR),and Tropical Rainfall Measuring Mission Multi-satellite Precipitation Analysis(TMPA))and a reanalysis precipitation dataset(ECMWF Reanalysis v5-Land Dataset(ERA5-Land))in Xinjiang using ground-based observational precipitation data from a limited number of meteorological stations.Based on this assessment,we proposed a framework that integrated different precipitation datasets with varying spatial resolutions using a dynamic Bayesian model averaging(DBMA)approach,the expectation-maximization method,and the ordinary Kriging interpolation method.The daily precipitation data merged using the DBMA approach exhibited distinct spatiotemporal variability,with an outstanding performance,as indicated by low root mean square error(RMSE=1.40 mm/d)and high Person's correlation coefficient(CC=0.67).Compared with the traditional simple model averaging(SMA)and individual product data,although the DBMA-fused precipitation data were slightly lower than the best precipitation product(CMFD),the overall performance of DBMA was more robust.The error analysis between DBMA-fused precipitation dataset and the more advanced Integrated Multi-satellite Retrievals for Global Precipitation Measurement Final(IMERG-F)precipitation product,as well as hydrological simulations in the Ebinur Lake Basin,further demonstrated the superior performance of DBMA-fused precipitation dataset in the entire Xinjiang region.The proposed framework for solving the fusion problem of multi-source precipitation data with different spatial resolutions is feasible for application in inland arid areas,and aids in obtaining more accurate regional hydrological information and improving regional water resources management capabilities and meteorological research in these regions.
基金funding from the Paul ScherrerInstitute,Switzerland through the NES/GFA-ABE Cross Project。
文摘To ensure agreement between theoretical calculations and experimental data,parameters to selected nuclear physics models are perturbed and fine-tuned in nuclear data evaluations.This approach assumes that the chosen set of models accurately represents the‘true’distribution of considered observables.Furthermore,the models are chosen globally,indicating their applicability across the entire energy range of interest.However,this approach overlooks uncertainties inherent in the models themselves.In this work,we propose that instead of selecting globally a winning model set and proceeding with it as if it was the‘true’model set,we,instead,take a weighted average over multiple models within a Bayesian model averaging(BMA)framework,each weighted by its posterior probability.The method involves executing a set of TALYS calculations by randomly varying multiple nuclear physics models and their parameters to yield a vector of calculated observables.Next,computed likelihood function values at each incident energy point were then combined with the prior distributions to obtain updated posterior distributions for selected cross sections and the elastic angular distributions.As the cross sections and elastic angular distributions were updated locally on a per-energy-point basis,the approach typically results in discontinuities or“kinks”in the cross section curves,and these were addressed using spline interpolation.The proposed BMA method was applied to the evaluation of proton-induced reactions on ^(58)Ni between 1 and 100 MeV.The results demonstrated a favorable comparison with experimental data as well as with the TENDL-2023 evaluation.
文摘Statistical biases may be introduced by imprecisely quantifying background radiation reference levels. It is, therefore, imperative to devise a simple, adaptable approach for precisely describing the reference background levels of naturally occurring radionuclides (NOR) in mining sites. As a substitute statistical method, we suggest using Bayesian modeling in this work to examine the spatial distribution of NOR. For naturally occurring gamma-induced radionuclides like 232Th, 40K, and 238U, statistical parameters are inferred using the Markov Chain Monte Carlo (MCMC) method. After obtaining an accurate subsample using bootstrapping, we exclude any possible outliers that fall outside of the Highest Density Interval (HDI). We use MCMC to build a Bayesian model with the resampled data and make predictions about the posterior distribution of radionuclides produced by gamma irradiation. This method offers a strong and dependable way to describe NOR reference background values, which is important for managing and evaluating radiation risks in mining contexts.
基金The National Natural Science Foundation of China(No.8123003481271739+2 种基金81501453)the Special Program of Medical Science of Jiangsu Province(No.BL2013029)the Natural Science Foundation of Jiangsu Province(No.BK20141342)
文摘In order to classify the minimal hepatic encephalopathy (MHE) patients from healthy controls, the independent component analysis (ICA) is used to generate the default mode network (DMN) from resting-state functional magnetic resonance imaging (fMRI). Then a Bayesian voxel- wised method, graphical-model-based multivariate analysis (GAMMA), is used to explore the associations between abnormal functional integration within DMN and clinical variable. Without any prior knowledge, five machine learning methods, namely, support vector machines (SVMs), classification and regression trees ( CART ), logistic regression, the Bayesian network, and C4.5, are applied to the classification. The functional integration patterns were alternative within DMN, which have the power to predict MHE with an accuracy of 98%. The GAMMA method generating functional integration patterns within DMN can become a simple, objective, and common imaging biomarker for detecting MIIE and can serve as a supplement to the existing diagnostic methods.
文摘It is quite common in statistical modeling to select a model and make inference as if the model had been known in advance;i.e. ignoring model selection uncertainty. The resulted estimator is called post-model selection estimator (PMSE) whose properties are hard to derive. Conditioning on data at hand (as it is usually the case), Bayesian model selection is free of this phenomenon. This paper is concerned with the properties of Bayesian estimator obtained after model selection when the frequentist (long run) performances of the resulted Bayesian estimator are of interest. The proposed method, using Bayesian decision theory, is based on the well known Bayesian model averaging (BMA)’s machinery;and outperforms PMSE and BMA. It is shown that if the unconditional model selection probability is equal to model prior, then the proposed approach reduces BMA. The method is illustrated using Bernoulli trials.
文摘Bayesian model averaging (BMA) is a popular and powerful statistical method of taking account of uncertainty about model form or assumption. Usually the long run (frequentist) performances of the resulted estimator are hard to derive. This paper proposes a mixture of priors and sampling distributions as a basic of a Bayes estimator. The frequentist properties of the new Bayes estimator are automatically derived from Bayesian decision theory. It is shown that if all competing models have the same parametric form, the new Bayes estimator reduces to BMA estimator. The method is applied to the daily exchange rate Euro to US Dollar.
基金supported by the National Key Research and Development Program of China (Grant Nos.2016YFA0602501 and 2018YFA0606004)the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant Nos.XDA20040301 and XDA20020201)。
文摘Gross primary production(GPP) plays a crucial part in the carbon cycle of terrestrial ecosystems.A set of validated monthly GPP data from 1957 to 2010 in 0.5°× 0.5° grids of China was weighted from the Multi-scale Terrestrial Model Intercomparison Project using Bayesian model averaging(BMA).The spatial anomalies of detrended BMA GPP during the growing seasons of typical El Nino years indicated that GPP response to El Nino varies with Pacific Decadal Oscillation(PDO) phases: when the PDO was in the cool phase,it was likely that GPP was greater in northern China(32°–38°N,111°–122°E) and less in the Yangtze River valley(28°–32°N,111°–122°E);in contrast,when PDO was in the warm phase,the GPP anomalies were usually reversed in these two regions.The consistent spatiotemporal pattern and high partial correlation revealed that rainfall dominated this phenomenon.The previously published findings on how El Nino during different phases of PDO affecting rainfall in eastern China make the statistical relationship between GPP and El Nino in this study theoretically credible.This paper not only introduces an effective way to use BMA in grids that have mixed plant function types,but also makes it possible to evaluate the carbon cycle in eastern China based on the prediction of El Nino and PDO.
基金supported by the National Natural Science Foundation of China(Grants No.51779074 and 41371052)the Special Fund for the Public Welfare Industry of the Ministry of Water Resources of China(Grant No.201501059)+3 种基金the National Key Research and Development Program of China(Grant No.2017YFC0404304)the Jiangsu Water Conservancy Science and Technology Project(Grant No.2017027)the Program for Outstanding Young Talents in Colleges and Universities of Anhui Province(Grant No.gxyq2018143)the Natural Science Foundation of Wanjiang University of Technology(Grant No.WG18030)
文摘This study developed a hierarchical Bayesian(HB)model for local and regional flood frequency analysis in the Dongting Lake Basin,in China.The annual maximum daily flows from 15 streamflow-gauged sites in the study area were analyzed with the HB model.The generalized extreme value(GEV)distribution was selected as the extreme flood distribution,and the GEV distribution location and scale parameters were spatially modeled through a regression approach with the drainage area as a covariate.The Markov chain Monte Carlo(MCMC)method with Gibbs sampling was employed to calculate the posterior distribution in the HB model.The results showed that the proposed HB model provided satisfactory Bayesian credible intervals for flood quantiles,while the traditional delta method could not provide reliable uncertainty estimations for large flood quantiles,due to the fact that the lower confidence bounds tended to decrease as the return periods increased.Furthermore,the HB model for regional analysis allowed for a reduction in the value of some restrictive assumptions in the traditional index flood method,such as the homogeneity region assumption and the scale invariance assumption.The HB model can also provide an uncertainty band of flood quantile prediction at a poorly gauged or ungauged site,but the index flood method with L-moments does not demonstrate this uncertainty directly.Therefore,the HB model is an effective method of implementing the flexible local and regional frequency analysis scheme,and of quantifying the associated predictive uncertainty.
基金Project supported by the China Special Fund for Meteorological Research in the Public Interest(No.GYHY201306045)the National Natural Science Foundation of China(Nos.41305066 and41575096)
文摘The choices of the parameterizations for each component in a microwave emission model have significant effects on the quality of brightness temperature (Tb) sim- ulation. How to reduce the uncertainty in the Tb simulation is investigated by adopting a statistical post-processing procedure with the Bayesian model averaging (BMA) ensemble approach. The simulations by the community microwave emission model (CMEM) cou- pled with the community land model version 4.5 (CLM4.5) over China's Mainland are con- ducted by the 24 configurations from four vegetation opacity parameterizations (VOPs), three soil dielectric constant parameterizations (SDCPs), and two soil roughness param- eterizations (SRPs). Compared with the simple arithmetical averaging (SAA) method, the BMA reconstructions have a higher spatial correlation coefficient (larger than 0.99) than the C-band satellite observations of the advanced microwave scanning radiometer on the Earth observing system (AMSR-E) at the vertical polarization. Moreover, the BMA product performs the best among the ensemble members for all vegetation classes, with a mean root-mean-square difference (RMSD) of 4 K and a temporal correlation coefficient of 0.64.
基金supported by the Thousand Youth Talents Plan(Xinjiang Project)the National Natural Science Foundation of China(41630859)the West Light Foundation of Chinese Academy of Sciences(2016QNXZB12)
文摘Climate change in mountainous regions has significant impacts on hydrological and ecological systems. This research studied the future temperature, precipitation and snowfall in the 21^(st) century for the Tianshan and northern Kunlun Mountains(TKM) based on the general circulation model(GCM) simulation ensemble from the coupled model intercomparison project phase 5(CMIP5) under the representative concentration pathway(RCP) lower emission scenario RCP4.5 and higher emission scenario RCP8.5 using the Bayesian model averaging(BMA) technique. Results show that(1) BMA significantly outperformed the simple ensemble analysis and BMA mean matches all the three observed climate variables;(2) at the end of the 21^(st) century(2070–2099) under RCP8.5, compared to the control period(1976–2005), annual mean temperature and mean annual precipitation will rise considerably by 4.8°C and 5.2%, respectively, while mean annual snowfall will dramatically decrease by 26.5%;(3) precipitation will increase in the northern Tianshan region while decrease in the Amu Darya Basin. Snowfall will significantly decrease in the western TKM. Mean annual snowfall fraction will also decrease from 0.56 of 1976–2005 to 0.42 of 2070–2099 under RCP8.5; and(4) snowfall shows a high sensitivity to temperature in autumn and spring while a low sensitivity in winter, with the highest sensitivity values occurring at the edge areas of TKM. The projections mean that flood risk will increase and solid water storage will decrease.
基金the project of National Natural Science Foundation of China(grant numbers 81273935,81303093,81602930).
文摘Real-world study is valuable for traditional Chinese medicine.However,there are no gold standards of statistical approaches for analyzing data from real-world study of traditional Chinese medicine.With the development of computer technology,researchers have increasingly paid attention to Bayesian statistics in the biomedical field.In present study,real-world study and Bayesian statistics were introduced.It was discussed that why and when to use Bayesian analysis and the challenge in the real-world study of traditional Chinese medicine.
基金supported by the National Natural Science Foundation of China(Grant Nos.41405083 and 91437220)the Natural Science Foundation of Hunan Province,China(Grant No.2015JJ3098)+1 种基金the Key Research Program of Frontier Sciences,CAS(QYZDY-SSW-DQC012)the Fund Project for The Education Department of Hunan Province(Grant No.16A234)
文摘The ability to estimate terrestrial water storage(TWS)is essential for monitoring hydrological extremes(e.g.,droughts and floods)and predicting future changes in the hydrological cycle.However,inadequacies in model physics and parameters,as well as uncertainties in meteorological forcing data,commonly limit the ability of land surface models(LSMs)to accurately simulate TWS.In this study,the authors show how simulations of TWS anomalies(TWSAs)from multiple meteorological forcings and multiple LSMs can be combined in a Bayesian model averaging(BMA)ensemble approach to improve monitoring and predictions.Simulations using three forcing datasets and two LSMs were conducted over China's Mainland for the period 1979–2008.All the simulations showed good temporal correlations with satellite observations from the Gravity Recovery and Climate Experiment during 2004–08.The correlation coefficient ranged between 0.5 and 0.8 in the humid regions(e.g.,the Yangtze river basin,Huaihe basin,and Zhujiang basin),but was much lower in the arid regions(e.g.,the Heihe basin and Tarim river basin).The BMA ensemble approach performed better than all individual member simulations.It captured the spatial distribution and temporal variations of TWSAs over China's Mainland and the eight major river basins very well;plus,it showed the highest R value(>0.5)over most basins and the lowest root-mean-square error value(<40 mm)in all basins of China.The good performance of the BMA ensemble approach shows that it is a promising way to reproduce long-term,high-resolution spatial and temporal TWSA data.
文摘Electrical Source Imaging (ESI) is a non-invasive technique of reconstructing brain activities using EEG data. This technique has been applied to evaluate epilepsy patients being evaluated for epilepsy surgery, showing encouraging results for mapping interictal epileptiform discharges (IED). However, ESI is underused in planning epilepsy surgery. This is basically due to the wide availability of methods for solving the electromagnetism inverse problem (e-IP) associated to few studies using EEG setups similar to those most commonly used in clinical setting. In this study, we applied six different methods of solving the e-IP based on IEDs of 20 focal epilepsy patients that presented abnormalities in their MRI. We compared the ESI maps obtained by each method with the location of the abnormality, calculating the Euclidian distances from the center of the lesion to the closest border of the method solution (CL-BM) and also to the solution’s maxima (CL-MM). We also applied a score system in order to allow us to evaluate the sensitivity of each method for temporal and extra temporal patients. In our patients, the Bayesian Model Averaging method had a sensitivity of 86% and the shortest CL-MM. This method also had more restricted solutions that were more representative of epileptogenic activities than those obtained by the other methods.
文摘Small area estimation (SAE) tackles the problem of providing reliable estimates for small areas, i.e., subsets of the population for which sample information is not sufficient to warrant the use of a direct estimator. Hierarchical Bayesian approach to SAE problems offers several advantages over traditional SAE models including the ability of appropriately accounting for the type of surveyed variable. In this paper, a number of model specifications for estimating small area counts are discussed and their relative merits are illustrated. We conducted a simulation study by reproducing in a simplified form the Italian Labour Force Survey and taking the Local Labor Markets as target areas. Simulated data were generated by assuming population characteristics of interest as well as survey sampling design as known. In one set of experiments, numbers of employment/unemployment from census data were utilized, in others population characteristics were varied. Results show persistent model failures for some standard Fay-Herriot specifications and for generalized linear Poisson models with (log-)normal sampling stage, whilst either unmatched or nonnormal sampling stage models get the best performance in terms of bias, accuracy and reliability. Though, the study also found that any model noticeably improves on its performance by letting sampling variances be stochastically determined rather than assumed as known as is the general practice. Moreover, we address the issue of model determination to point out limits and possible deceptions of commonly used criteria for model selection and checking in SAE context.
文摘The successful experience of adopting distributed development models in such open source projects includes GNU/Linux operating system, Apache HTTP server, Android, BusyBox, and so on. The open source project contains special features so-called software composition by which several geographically-dispersed compo-nents are developed in all parts of the world. We propose a method of component-oriented reliability as-sessment based on hierarchical Bayesian model and Markov chain Monte Carlo methods. Especially, we fo-cus on the fault-detection rate for each component reported to the bug tracking system. We can assess the reliability for the whole open source software system by using the confidence interval for each component. Also, we analyze actual software fault-count data to show numerical examples of reliability assessment for OSS.
文摘Computations involved in Bayesian approach to practical model selection problems are usually very difficult. Computational simplifications are sometimes possible, but are not generally applicable. There is a large literature available on a methodology based on information theory called Minimum Description Length (MDL). It is described here how many of these techniques are either directly Bayesian in nature, or are very good objective approximations to Bayesian solutions. First, connections between the Bayesian approach and MDL are theoretically explored;thereafter a few illustrations are provided to describe how MDL can give useful computational simplifications.
基金This study was supported by the National Natural Science Foundation of China(42261008,41971034)the Natural Science Foundation of Gansu Province,China(22JR5RA074).
文摘Stable water isotopes are natural tracers quantifying the contribution of moisture recycling to local precipitation,i.e.,the moisture recycling ratio,but various isotope-based models usually lead to different results,which affects the accuracy of local moisture recycling.In this study,a total of 18 stations from four typical areas in China were selected to compare the performance of isotope-based linear and Bayesian mixing models and to determine local moisture recycling ratio.Among the three vapor sources including advection,transpiration,and surface evaporation,the advection vapor usually played a dominant role,and the contribution of surface evaporation was less than that of transpiration.When the abnormal values were ignored,the arithmetic averages of differences between isotope-based linear and the Bayesian mixing models were 0.9%for transpiration,0.2%for surface evaporation,and–1.1%for advection,respectively,and the medians were 0.5%,0.2%,and–0.8%,respectively.The importance of transpiration was slightly less for most cases when the Bayesian mixing model was applied,and the contribution of advection was relatively larger.The Bayesian mixing model was found to perform better in determining an efficient solution since linear model sometimes resulted in negative contribution ratios.Sensitivity test with two isotope scenarios indicated that the Bayesian model had a relatively low sensitivity to the changes in isotope input,and it was important to accurately estimate the isotopes in precipitation vapor.Generally,the Bayesian mixing model should be recommended instead of a linear model.The findings are useful for understanding the performance of isotope-based linear and Bayesian mixing models under various climate backgrounds.
文摘The Bayesian structural equation model integrates the principles of Bayesian statistics, providing a more flexible and comprehensive modeling framework. In exploring complex relationships between variables, handling uncertainty, and dealing with missing data, the Bayesian structural equation model demonstrates unique advantages. Therefore, Bayesian methods are used in this paper to establish a structural equation model of innovative talent cognition, with the measurement of college students’ cognition of innovative talent being studied. An in-depth analysis is conducted on the effects of innovative self-efficacy, social resources, innovative personality traits, and school education, aiming to explore the factors influencing college students’ innovative talent. The results indicate that innovative self-efficacy plays a key role in perception, social resources are significantly positively correlated with the perception of innovative talents, innovative personality tendencies and school education are positively correlated with the perception of innovative talents, but the impact is not significant.
基金Supported by the National Basic Research and Development (973) Program of China (2010CB428402)China Meteorological Administration Special Public Welfare Research Fund (GYHY200706001)
文摘A probabilistic precipitation forecasting model using generalized additive models (GAMs) and Bayesian model averaging (BMA) was proposed in this paper. GAMs were used to fit the spatial-temporal precipitation models to individual ensemble member forecasts. The distributions of the precipitation occurrence and the cumulative precipitation amount were represented simultaneously by a single Tweedie distribution. BMA was then used as a post-processing method to combine the individual models to form a more skillful probabilistic forecasting model. The mixing weights were estimated using the expectation-maximization algorithm. The residual diagnostics was used to examine if the fitted BMA forecasting model had fully captured the spatial and temporal variations of precipitation. The proposed method was applied to daily observations at the Yishusi River basin for July 2007 using the National Centers for Environmental Prediction ensemble forecasts. By applying scoring rules, the BMA forecasts were verified and showed better performances compared with the empirical probabilistic ensemble forecasts, particularly for extreme precipitation. Finally, possible improvements and a^plication of this method to the downscaling of climate change scenarios were discussed.
基金supported by National Basic Research Program of China (Grant No. 2010CB428403)National Natural Science Foundation of China (Grant No.41075076)Knowledge Innovation Program of the Chinese Academy of Sciences (Grant No.KZCX2-EW-QN207)
文摘Bayesian model averaging(BMA) is a recently proposed statistical method for calibrating forecast ensembles from numerical weather models.However,successful implementation of BMA requires accurate estimates of the weights and variances of the individual competing models in the ensemble.Two methods,namely the Expectation-Maximization(EM) and the Markov Chain Monte Carlo(MCMC) algorithms,are widely used for BMA model training.Both methods have their own respective strengths and weaknesses.In this paper,we first modify the BMA log-likelihood function with the aim of removing the addi-tional limitation that requires that the BMA weights add to one,and then use a limited memory quasi-Newtonian algorithm for solving the nonlinear optimization problem,thereby formulating a new approach for BMA(referred to as BMA-BFGS).Several groups of multi-model soil moisture simulation experiments from three land surface models show that the performance of BMA-BFGS is similar to the MCMC method in terms of simulation accuracy,and that both are superior to the EM algo-rithm.On the other hand,the computational cost of the BMA-BFGS algorithm is substantially less than for MCMC and is al-most equivalent to that for EM.