Hydrocracking is a catalytic reaction process in the petroleum refineries for converting the higher boiling temperature residue of crude oil into a lighter fraction of hydrocarbons such as gasoline and diesel. In this...Hydrocracking is a catalytic reaction process in the petroleum refineries for converting the higher boiling temperature residue of crude oil into a lighter fraction of hydrocarbons such as gasoline and diesel. In this study, a modified continuous lumping kinetic approach is applied to model the hydro-cracking of vacuum gas oil. The model is modified to take into consideration the reactor temperature on the reaction yield distribution. The model is calibrated by maximizing the likelihood function between the modeled and measured data at four different reactor temperatures. Bayesian approach parameter estimation is also applied to obtain the confidence interval of model parameters by considering the uncertainty associated with the measured errors and the model structural errors. Then Monte Carlo simulation is applied to the posterior range of the model parameters to obtain the 95% confidence interval of the model outputs for each individual fraction of the hydrocracking products. A good agreement is observed between the output of the calibrated model and the measured data points. The Bayesian approach based on the Markov Chain Monte Carlo simulation is shown to be efficient to quantify the uncertainty associated with the parameter values of the continuous lumping model.展开更多
Most previous land-surface model calibration studies have defined globalranges for their parameters to search for optimal parameter sets. Little work has been conducted tostudy the impacts of realistic versus global r...Most previous land-surface model calibration studies have defined globalranges for their parameters to search for optimal parameter sets. Little work has been conducted tostudy the impacts of realistic versus global ranges as well as model complexities on the calibrationand uncertainty estimates. The primary purpose of this paper is to investigate these impacts byemploying Bayesian Stochastic Inversion (BSI) to the Chameleon Surface Model (CHASM). The CHASM wasdesigned to explore the general aspects of land-surface energy balance representation within acommon modeling framework that can be run from a simple energy balance formulation to a complexmosaic type structure. The BSI is an uncertainty estimation technique based on Bayes theorem,importance sampling, and very fast simulated annealing. The model forcing data and surface flux datawere collected at seven sites representing a wide range of climate and vegetation conditions. Foreach site, four experiments were performed with simple and complex CHASM formulations as well asrealistic and global parameter ranges. Twenty eight experiments were conducted and 50 000 parametersets were used for each run. The results show that the use of global and realistic ranges givessimilar simulations for both modes for most sites, but the global ranges tend to produce someunreasonable optimal parameter values. Comparison of simple and complex modes shows that the simplemode has more parameters with unreasonable optimal values. Use of parameter ranges and modelcomplexities have significant impacts on frequency distribution of parameters, marginal posteriorprobability density functions, and estimates of uncertainty of simulated sensible and latent heatfluxes. Comparison between model complexity and parameter ranges shows that the former has moresignificant impacts on parameter and uncertainty estimations.展开更多
We consider state and parameter estimation for compartmental models having both timevarying and time-invariant parameters.In this manuscript,we first detail a general Bayesian computational framework as a continuation...We consider state and parameter estimation for compartmental models having both timevarying and time-invariant parameters.In this manuscript,we first detail a general Bayesian computational framework as a continuation of our previous work.Subsequently,this framework is specifically tailored to the susceptible-infectious-removed(SIR)model which describes a basic mechanism for the spread of infectious diseases through a system of coupled nonlinear differential equations.The SIR model consists of three states,namely,the susceptible,infectious,and removed compartments.The coupling among these states is controlled by two parameters,the infection rate and the recovery rate.The simplicity of the SIR model and similar compartmental models make them applicable to many classes of infectious diseases.However,the combined assumption of a deterministic model and time-invariance among the model parameters are two significant impediments which critically limit their use for long-term predictions.The tendency of certain model parameters to vary in time due to seasonal trends,non-pharmaceutical interventions,and other random effects necessitates a model that structurally permits the incorporation of such time-varying effects.Complementary to this,is the need for a robust mechanism for the estimation of the parameters of the resulting model from data.To this end,we consider an augmented state vector,which appends the time-varying parameters to the original system states whereby the time evolution of the time-varying parameters are driven by an artificial noise process in a standard manner.Distinguishing between time-varying and time-invariant parameters in this fashion limits the introduction of artificial dynamics into the system,and provides a robust,fully Bayesian approach for estimating the timeinvariant system parameters as well as the elements of the process noise covariance matrix.This computational framework is implemented by leveraging the robustness of the Markov chain Monte Carlo algorithm permits the estimation of time-invariant parameters while nested nonlinear filters concurrently perform the joint estimation of the system states and time-varying parameters.We demonstrate performance of the framework by first considering a series of examples using synthetic data,followed by an exposition on public health data collected in the province of Ontario.展开更多
To evaluate the performance of a photovoltaic panel, several parameters must be extracted from the photovoltaic. These parameters are very important for the evaluation, monitoring and optimization of photovoltaic. Amo...To evaluate the performance of a photovoltaic panel, several parameters must be extracted from the photovoltaic. These parameters are very important for the evaluation, monitoring and optimization of photovoltaic. Among the methods developed to extract photovoltaic parameters from current-voltage (I-V) characteristic curve, metaheuristic algorithms are the most used nowadays. A new metaheuristic algorithm namely enhanced vibrating particles system algorithm is presented here to extract the best values of parameters of a photovoltaic cell. Five recent algorithms (grey wolf optimization (GWO), moth-flame optimization algorithm (MFOA), multi-verse optimizer (MVO), whale optimization algorithm (WAO), salp swarm-inspired algorithm (SSA)) are also implemented on the same computer. Enhanced vibrating particles system is inspired by the free vibration of the single degree of freedom systems with viscous damping. To extract the photovoltaic parameters using enhanced vibrating particles system algorithm, the problem can be set as an optimization problem with the objective to minimize the difference between measured and estimated current. Four case studies have been implemented here. The results and comparison with other methods exhibit high accuracy and validity of the proposed enhanced vibrating particles system algorithm to extract parameters of a photovoltaic cell and module.展开更多
Parameter estimation is always a difficult issue for crop model users, and inaccurate parameter values will result in deceptive model predictions. Parameter values may vary with different inversion methods due to equi...Parameter estimation is always a difficult issue for crop model users, and inaccurate parameter values will result in deceptive model predictions. Parameter values may vary with different inversion methods due to equifinality and differences in the estimating processes. Therefore, it is of great importance to evaluate the factors which may influence parameter estimates and to make a comparison of the current widely-used methods. In this study, three popular frequentist methods(SCE-UA, GA and PEST) and two Bayesian-based methods(GLUE and MCMC-AM) were applied to estimate nine cultivar parameters using the ORYZA(v3) Model. The results showed that there were substantial differences between the parameter estimates derived by the different methods, and they had strong effects on model predictions. The parameter estimates given by the frequentist methods were obviously sensitive to initial values, and the extent of the sensitivity varied with algorithms and objective functions. Among the frequentist methods, the SCE-UA was recommended due to the balance between stable convergence and high efficiency. All the parameter estimates remarkably improved the goodness of model-fit, and the parameter estimates derived from the Bayesian-based methods had relatively worse performance compared to the frequentist methods. In particular, the parameter estimates with the highest probability density of posterior distributions derived from the MCMC-AM method(MCMC_P_(max)) led to results equivalent to those derived from the frequentist methods, and even better in some situations. Additionally, model accuracy was greatly influenced by the values of phenology parameters in validation.展开更多
Nowadays a common problem when processing data sets with the large number of covariates compared to small sample sizes (fat data sets) is to estimate the parameters associated with each covariate. When the number of c...Nowadays a common problem when processing data sets with the large number of covariates compared to small sample sizes (fat data sets) is to estimate the parameters associated with each covariate. When the number of covariates far exceeds the number of samples, the parameter estimation becomes very difficult. Researchers in many fields such as text categorization deal with the burden of finding and estimating important covariates without overfitting the model. In this study, we developed a Sparse Probit Bayesian Model (SPBM) based on Gibbs sampling which utilizes double exponentials prior to induce shrinkage and reduce the number of covariates in the model. The method was evaluated using ten domains such as mathematics, the corpuses of which were downloaded from Wikipedia. From the downloaded corpuses, we created the TFIDF matrix corresponding to all domains and divided the whole data set randomly into training and testing groups of size 300. To make the model more robust we performed 50 re-samplings on selection of training and test groups. The model was implemented in R and the Gibbs sampler ran for 60 k iterations and the first 20 k was discarded as burn in. We performed classification on training and test groups by calculating P (yi = 1) and according to [1] [2] the threshold of 0.5 was used as decision rule. Our model’s performance was compared to Support Vector Machines (SVM) using average sensitivity and specificity across 50 runs. The SPBM achieved high classification accuracy and outperformed SVM in almost all domains analyzed.展开更多
In single-frequency precise-point positioning of a satellite,ionosphere delay is one of the most important factors impacting the accuracy. Because of the instability of the ionosphere and uncertainty of its physical p...In single-frequency precise-point positioning of a satellite,ionosphere delay is one of the most important factors impacting the accuracy. Because of the instability of the ionosphere and uncertainty of its physical properties, the positioning accuracy is seriously limited when using a precision-limited model for correction. In order to reduce the error, we propose to introduce some ionosphere parameter for real-time ionosphere-delay estimation by applying various mapping functions. Through calculation with data from the IGS( International GPS Service) tracking station and comparison among results of using several different models and mapping functions, the feasibility and effectiveness of the new method are verified.展开更多
为了准确监测和客观评估稻纵卷叶螟对水稻生长发育和产量形成的危害,利用ASD Field Spec3地物波谱仪和SPAD-502叶绿素仪分别采集控制大田试验(2015年和2019年)和自然大田试验(2020年)在各生育期(拔节期、孕穗期、灌浆期、成熟期)水稻的...为了准确监测和客观评估稻纵卷叶螟对水稻生长发育和产量形成的危害,利用ASD Field Spec3地物波谱仪和SPAD-502叶绿素仪分别采集控制大田试验(2015年和2019年)和自然大田试验(2020年)在各生育期(拔节期、孕穗期、灌浆期、成熟期)水稻的冠层高光谱数据和SPAD值,调查采集样点的虫量和水稻卷叶率,对比分析两种试验中稻纵卷叶螟的虫害发生特征、水稻冠层光谱特征和水稻生理生态参数特征,建立基于高光谱参数的水稻受稻纵卷叶螟危害的生理生态参数估算模型。结果表明,(1)两种试验的水稻SPAD值和冠层的红边至近红外波段的反射率均随着稻纵卷叶螟虫害程度的加重而降低,而可见光波段的反射率则相反;(2)自然大田试验的SPAD值和红光至近红外波段的冠层反射率在水稻生长发育前期要显著低于控制大田试验,而到了后期则反而要略高于控制大田试验;(3)综合分析筛选出自然大田试验和控制大田试验中的多个虫害特征参数和植被指数分别构建出了SPAD的单因子和多因子估算模型,各模型均达到了较好的估算效果,在单因子模型中EVI的二项式函数模拟效果最好,而多因子线性回归估测模型的模拟效果优于所有的单因子模型;(4)通过2021年对这些模型的应用检验发现:这些模型中基于虫量、卷叶率、OSAVI、EVI和DVI的单因子估算模型的SPAD估算值与实测值拟合度很高,其Rv 2均超过了0.8,达到了比较理想的估算效果,这为稻纵卷叶螟危害下的水稻SPAD值估测提供了一种精度较高且可行的估算方法。展开更多
文摘Hydrocracking is a catalytic reaction process in the petroleum refineries for converting the higher boiling temperature residue of crude oil into a lighter fraction of hydrocarbons such as gasoline and diesel. In this study, a modified continuous lumping kinetic approach is applied to model the hydro-cracking of vacuum gas oil. The model is modified to take into consideration the reactor temperature on the reaction yield distribution. The model is calibrated by maximizing the likelihood function between the modeled and measured data at four different reactor temperatures. Bayesian approach parameter estimation is also applied to obtain the confidence interval of model parameters by considering the uncertainty associated with the measured errors and the model structural errors. Then Monte Carlo simulation is applied to the posterior range of the model parameters to obtain the 95% confidence interval of the model outputs for each individual fraction of the hydrocracking products. A good agreement is observed between the output of the calibrated model and the measured data points. The Bayesian approach based on the Markov Chain Monte Carlo simulation is shown to be efficient to quantify the uncertainty associated with the parameter values of the continuous lumping model.
文摘Most previous land-surface model calibration studies have defined globalranges for their parameters to search for optimal parameter sets. Little work has been conducted tostudy the impacts of realistic versus global ranges as well as model complexities on the calibrationand uncertainty estimates. The primary purpose of this paper is to investigate these impacts byemploying Bayesian Stochastic Inversion (BSI) to the Chameleon Surface Model (CHASM). The CHASM wasdesigned to explore the general aspects of land-surface energy balance representation within acommon modeling framework that can be run from a simple energy balance formulation to a complexmosaic type structure. The BSI is an uncertainty estimation technique based on Bayes theorem,importance sampling, and very fast simulated annealing. The model forcing data and surface flux datawere collected at seven sites representing a wide range of climate and vegetation conditions. Foreach site, four experiments were performed with simple and complex CHASM formulations as well asrealistic and global parameter ranges. Twenty eight experiments were conducted and 50 000 parametersets were used for each run. The results show that the use of global and realistic ranges givessimilar simulations for both modes for most sites, but the global ranges tend to produce someunreasonable optimal parameter values. Comparison of simple and complex modes shows that the simplemode has more parameters with unreasonable optimal values. Use of parameter ranges and modelcomplexities have significant impacts on frequency distribution of parameters, marginal posteriorprobability density functions, and estimates of uncertainty of simulated sensible and latent heatfluxes. Comparison between model complexity and parameter ranges shows that the former has moresignificant impacts on parameter and uncertainty estimations.
基金the funding from the New Frontiers in Research Fund(NFRF)2022 Special Call e Research for Postpandemic Recovery(Grant no:NFRFR-2022-00395).
文摘We consider state and parameter estimation for compartmental models having both timevarying and time-invariant parameters.In this manuscript,we first detail a general Bayesian computational framework as a continuation of our previous work.Subsequently,this framework is specifically tailored to the susceptible-infectious-removed(SIR)model which describes a basic mechanism for the spread of infectious diseases through a system of coupled nonlinear differential equations.The SIR model consists of three states,namely,the susceptible,infectious,and removed compartments.The coupling among these states is controlled by two parameters,the infection rate and the recovery rate.The simplicity of the SIR model and similar compartmental models make them applicable to many classes of infectious diseases.However,the combined assumption of a deterministic model and time-invariance among the model parameters are two significant impediments which critically limit their use for long-term predictions.The tendency of certain model parameters to vary in time due to seasonal trends,non-pharmaceutical interventions,and other random effects necessitates a model that structurally permits the incorporation of such time-varying effects.Complementary to this,is the need for a robust mechanism for the estimation of the parameters of the resulting model from data.To this end,we consider an augmented state vector,which appends the time-varying parameters to the original system states whereby the time evolution of the time-varying parameters are driven by an artificial noise process in a standard manner.Distinguishing between time-varying and time-invariant parameters in this fashion limits the introduction of artificial dynamics into the system,and provides a robust,fully Bayesian approach for estimating the timeinvariant system parameters as well as the elements of the process noise covariance matrix.This computational framework is implemented by leveraging the robustness of the Markov chain Monte Carlo algorithm permits the estimation of time-invariant parameters while nested nonlinear filters concurrently perform the joint estimation of the system states and time-varying parameters.We demonstrate performance of the framework by first considering a series of examples using synthetic data,followed by an exposition on public health data collected in the province of Ontario.
文摘To evaluate the performance of a photovoltaic panel, several parameters must be extracted from the photovoltaic. These parameters are very important for the evaluation, monitoring and optimization of photovoltaic. Among the methods developed to extract photovoltaic parameters from current-voltage (I-V) characteristic curve, metaheuristic algorithms are the most used nowadays. A new metaheuristic algorithm namely enhanced vibrating particles system algorithm is presented here to extract the best values of parameters of a photovoltaic cell. Five recent algorithms (grey wolf optimization (GWO), moth-flame optimization algorithm (MFOA), multi-verse optimizer (MVO), whale optimization algorithm (WAO), salp swarm-inspired algorithm (SSA)) are also implemented on the same computer. Enhanced vibrating particles system is inspired by the free vibration of the single degree of freedom systems with viscous damping. To extract the photovoltaic parameters using enhanced vibrating particles system algorithm, the problem can be set as an optimization problem with the objective to minimize the difference between measured and estimated current. Four case studies have been implemented here. The results and comparison with other methods exhibit high accuracy and validity of the proposed enhanced vibrating particles system algorithm to extract parameters of a photovoltaic cell and module.
基金supported by the National Natural Science Foundation of China(NSFC 51909004)。
文摘Parameter estimation is always a difficult issue for crop model users, and inaccurate parameter values will result in deceptive model predictions. Parameter values may vary with different inversion methods due to equifinality and differences in the estimating processes. Therefore, it is of great importance to evaluate the factors which may influence parameter estimates and to make a comparison of the current widely-used methods. In this study, three popular frequentist methods(SCE-UA, GA and PEST) and two Bayesian-based methods(GLUE and MCMC-AM) were applied to estimate nine cultivar parameters using the ORYZA(v3) Model. The results showed that there were substantial differences between the parameter estimates derived by the different methods, and they had strong effects on model predictions. The parameter estimates given by the frequentist methods were obviously sensitive to initial values, and the extent of the sensitivity varied with algorithms and objective functions. Among the frequentist methods, the SCE-UA was recommended due to the balance between stable convergence and high efficiency. All the parameter estimates remarkably improved the goodness of model-fit, and the parameter estimates derived from the Bayesian-based methods had relatively worse performance compared to the frequentist methods. In particular, the parameter estimates with the highest probability density of posterior distributions derived from the MCMC-AM method(MCMC_P_(max)) led to results equivalent to those derived from the frequentist methods, and even better in some situations. Additionally, model accuracy was greatly influenced by the values of phenology parameters in validation.
文摘Nowadays a common problem when processing data sets with the large number of covariates compared to small sample sizes (fat data sets) is to estimate the parameters associated with each covariate. When the number of covariates far exceeds the number of samples, the parameter estimation becomes very difficult. Researchers in many fields such as text categorization deal with the burden of finding and estimating important covariates without overfitting the model. In this study, we developed a Sparse Probit Bayesian Model (SPBM) based on Gibbs sampling which utilizes double exponentials prior to induce shrinkage and reduce the number of covariates in the model. The method was evaluated using ten domains such as mathematics, the corpuses of which were downloaded from Wikipedia. From the downloaded corpuses, we created the TFIDF matrix corresponding to all domains and divided the whole data set randomly into training and testing groups of size 300. To make the model more robust we performed 50 re-samplings on selection of training and test groups. The model was implemented in R and the Gibbs sampler ran for 60 k iterations and the first 20 k was discarded as burn in. We performed classification on training and test groups by calculating P (yi = 1) and according to [1] [2] the threshold of 0.5 was used as decision rule. Our model’s performance was compared to Support Vector Machines (SVM) using average sensitivity and specificity across 50 runs. The SPBM achieved high classification accuracy and outperformed SVM in almost all domains analyzed.
基金supported by the National Natural Science Foundation of China(40902081,40774001,40841021)
文摘In single-frequency precise-point positioning of a satellite,ionosphere delay is one of the most important factors impacting the accuracy. Because of the instability of the ionosphere and uncertainty of its physical properties, the positioning accuracy is seriously limited when using a precision-limited model for correction. In order to reduce the error, we propose to introduce some ionosphere parameter for real-time ionosphere-delay estimation by applying various mapping functions. Through calculation with data from the IGS( International GPS Service) tracking station and comparison among results of using several different models and mapping functions, the feasibility and effectiveness of the new method are verified.
文摘为了准确监测和客观评估稻纵卷叶螟对水稻生长发育和产量形成的危害,利用ASD Field Spec3地物波谱仪和SPAD-502叶绿素仪分别采集控制大田试验(2015年和2019年)和自然大田试验(2020年)在各生育期(拔节期、孕穗期、灌浆期、成熟期)水稻的冠层高光谱数据和SPAD值,调查采集样点的虫量和水稻卷叶率,对比分析两种试验中稻纵卷叶螟的虫害发生特征、水稻冠层光谱特征和水稻生理生态参数特征,建立基于高光谱参数的水稻受稻纵卷叶螟危害的生理生态参数估算模型。结果表明,(1)两种试验的水稻SPAD值和冠层的红边至近红外波段的反射率均随着稻纵卷叶螟虫害程度的加重而降低,而可见光波段的反射率则相反;(2)自然大田试验的SPAD值和红光至近红外波段的冠层反射率在水稻生长发育前期要显著低于控制大田试验,而到了后期则反而要略高于控制大田试验;(3)综合分析筛选出自然大田试验和控制大田试验中的多个虫害特征参数和植被指数分别构建出了SPAD的单因子和多因子估算模型,各模型均达到了较好的估算效果,在单因子模型中EVI的二项式函数模拟效果最好,而多因子线性回归估测模型的模拟效果优于所有的单因子模型;(4)通过2021年对这些模型的应用检验发现:这些模型中基于虫量、卷叶率、OSAVI、EVI和DVI的单因子估算模型的SPAD估算值与实测值拟合度很高,其Rv 2均超过了0.8,达到了比较理想的估算效果,这为稻纵卷叶螟危害下的水稻SPAD值估测提供了一种精度较高且可行的估算方法。