The zero_failure data research is a new field in the recent years, but it is required urgently in practical projects, so the work has more theory and practical values. In this paper, for zero_failure data (t i,n i...The zero_failure data research is a new field in the recent years, but it is required urgently in practical projects, so the work has more theory and practical values. In this paper, for zero_failure data (t i,n i) at moment t i , if the prior distribution of the failure probability p i=p{T【t i} is quasi_exponential distribution, the author gives the p i Bayesian estimation and hierarchical Bayesian estimation and the reliability under zero_failure date condition is also obtained.展开更多
Measurement error of unbalance's vibration response plays a crucial role in calibration and on-line updating of influence coefficient(IC). Focusing on the two problems that the moment estimator of data used in cali...Measurement error of unbalance's vibration response plays a crucial role in calibration and on-line updating of influence coefficient(IC). Focusing on the two problems that the moment estimator of data used in calibration process cannot fulfill the accuracy requirement under small sample and the disturbance of measurement error cannot be effectively suppressed in updating process, an IC calibration and on-line updating method based on hierarchical Bayesian method for automatic dynamic balancing machine was proposed. During calibration process, for the repeatedly-measured data obtained from experiments with different trial weights, according to the fact that measurement error of each sensor had the same statistical characteristics, the joint posterior distribution model for the true values of the vibration response under all trial weights and measurement error was established. During the updating process, information obtained from calibration was regarded as prior information, which was utilized to update the posterior distribution of IC combined with the real-time reference information to implement online updating. Moreover, Gibbs sampling method of Markov Chain Monte Carlo(MCMC) was adopted to obtain the maximum posterior estimation of parameters to be estimated. On the independent developed dynamic balancing testbed, prediction was carried out for multiple groups of data through the proposed method and the traditional method respectively, the result indicated that estimator of influence coefficient obtained through the proposed method had higher accuracy; the proposed updating method more effectively guaranteed the measurement accuracy during the whole producing process, and meantime more reasonably compromised between the sensitivity of IC change and suppression of randomness of vibration response.展开更多
In this paper, we provide a Word Emotion Topic (WET) model to predict the complex word e- motion information from text, and discover the dis- trbution of emotions among different topics. A complex emotion is defined...In this paper, we provide a Word Emotion Topic (WET) model to predict the complex word e- motion information from text, and discover the dis- trbution of emotions among different topics. A complex emotion is defined as the combination of one or more singular emotions from following 8 basic emotion categories: joy, love, expectation, sur- prise, anxiety, sorrow, anger and hate. We use a hi- erarchical Bayesian network to model the emotions and topics in the text. Both the complex emotions and topics are drawn from raw texts, without con- sidering any complicated language features. Our ex- periment shows promising results of word emotion prediction, which outperforms the traditional parsing methods such as the Hidden Markov Model and the Conditional Random Fields(CRFs) on raw text. We also explore the topic distribution by examining the emotion topic variation in an emotion topic diagram.展开更多
This study developed a hierarchical Bayesian(HB)model for local and regional flood frequency analysis in the Dongting Lake Basin,in China.The annual maximum daily flows from 15 streamflow-gauged sites in the study are...This study developed a hierarchical Bayesian(HB)model for local and regional flood frequency analysis in the Dongting Lake Basin,in China.The annual maximum daily flows from 15 streamflow-gauged sites in the study area were analyzed with the HB model.The generalized extreme value(GEV)distribution was selected as the extreme flood distribution,and the GEV distribution location and scale parameters were spatially modeled through a regression approach with the drainage area as a covariate.The Markov chain Monte Carlo(MCMC)method with Gibbs sampling was employed to calculate the posterior distribution in the HB model.The results showed that the proposed HB model provided satisfactory Bayesian credible intervals for flood quantiles,while the traditional delta method could not provide reliable uncertainty estimations for large flood quantiles,due to the fact that the lower confidence bounds tended to decrease as the return periods increased.Furthermore,the HB model for regional analysis allowed for a reduction in the value of some restrictive assumptions in the traditional index flood method,such as the homogeneity region assumption and the scale invariance assumption.The HB model can also provide an uncertainty band of flood quantile prediction at a poorly gauged or ungauged site,but the index flood method with L-moments does not demonstrate this uncertainty directly.Therefore,the HB model is an effective method of implementing the flexible local and regional frequency analysis scheme,and of quantifying the associated predictive uncertainty.展开更多
Global spread of infectious disease threatens the well-being of human, domestic, and wildlife health. A proper understanding of global distribution of these diseases is an important part of disease management and poli...Global spread of infectious disease threatens the well-being of human, domestic, and wildlife health. A proper understanding of global distribution of these diseases is an important part of disease management and policy making. However, data are subject to complexities by heterogeneity across host classes. The use of frequentist methods in biostatistics and epidemiology is common and is therefore extensively utilized in answering varied research questions. In this paper, we applied the hierarchical Bayesian approach to study the spatial distribution of tuberculosis in Kenya. The focus was to identify best fitting model for modeling TB relative risk in Kenya. The Markov Chain Monte Carlo (MCMC) method via WinBUGS and R packages was used for simulations. The Deviance Information Criterion (DIC) proposed by [1] was used for models comparison and selection. Among the models considered, unstructured heterogeneity model perfumes better in terms of modeling and mapping TB RR in Kenya. Variation in TB risk is observed among Kenya counties and clustering among counties with high TB Relative Risk (RR). HIV prevalence is identified as the dominant determinant of TB. We find clustering and heterogeneity of risk among high rate counties. Although the approaches are less than ideal, we hope that our formulations provide a useful stepping stone in the development of spatial methodology for the statistical analysis of risk from TB in Kenya.展开更多
The state of in situ stress is a crucial parameter in subsurface engineering,especially for critical projects like nuclear waste repository.As one of the two ISRM suggested methods,the overcoring(OC)method is widely u...The state of in situ stress is a crucial parameter in subsurface engineering,especially for critical projects like nuclear waste repository.As one of the two ISRM suggested methods,the overcoring(OC)method is widely used to estimate the full stress tensors in rocks by independent regression analysis of the data from each OC test.However,such customary independent analysis of individual OC tests,known as no pooling,is liable to yield unreliable test-specific stress estimates due to various uncertainty sources involved in the OC method.To address this problem,a practical and no-cost solution is considered by incorporating into OC data analysis additional information implied within adjacent OC tests,which are usually available in OC measurement campaigns.Hence,this paper presents a Bayesian partial pooling(hierarchical)model for combined analysis of adjacent OC tests.We performed five case studies using OC test data made at a nuclear waste repository research site of Sweden.The results demonstrate that partial pooling of adjacent OC tests indeed allows borrowing of information across adjacent tests,and yields improved stress tensor estimates with reduced uncertainties simultaneously for all individual tests than they are independently analysed as no pooling,particularly for those unreliable no pooling stress estimates.A further model comparison shows that the partial pooling model also gives better predictive performance,and thus confirms that the information borrowed across adjacent OC tests is relevant and effective.展开更多
A concept map is a diagram depicting relationships among concepts which is used as a knowledge representation tool in many knowledge domains. In this paper, we build on the modeling framework of Hui et al. (2008) in o...A concept map is a diagram depicting relationships among concepts which is used as a knowledge representation tool in many knowledge domains. In this paper, we build on the modeling framework of Hui et al. (2008) in order to develop a concept map suitable for testing the empirical evidence of theories. We identify a theory by a set of core tenets each asserting that one set of independent variables affects one dependent variable, moreover every variable can have several operational definitions. Data consist of a selected sample of scientific articles from the empirical literature on the theory under investigation. Our “tenet map” features a number of complexities more than the original version. First the links are two-layer: first-layer links connect variables which are related in the test of the theory at issue;second-layer links represent connections which are found statistically significant. Besides, either layer matrix of link-formation probabilities is block-symmetric. In addition to a form of censoring which resembles the Hui et al. pruning step, observed maps are subject to a further censoring related to second-layer links. Still, we perform a full Bayesian analysis instead of adopting the empirical Bayes approach. Lastly, we develop a three-stage model which accounts for dependence either of data or of parameters. The investigation of the empirical support and consensus degree of new economic theories of the firm motivated the proposed methodology. In this paper, the Transaction Cost Economics view is tested by a tenet map analysis. Both the two-stage and the multilevel models identify the same tenets as the most corroborated by empirical evidence though the latter provides a more comprehensive and complex insight of relationships between constructs.展开更多
Proper understanding of global distribution of infectious diseases is an important part of disease management and policy making. However, data are subject to complexities caused by heterogeneities across host classes ...Proper understanding of global distribution of infectious diseases is an important part of disease management and policy making. However, data are subject to complexities caused by heterogeneities across host classes and space-time epidemic processes. This paper seeks to suggest or propose Bayesian spatio-temporal model for modeling and mapping tuberculosis relative risks in space and time as well identify risks factors associated with the tuberculosis and counties in Kenya with high tuberculosis relative risks. In this paper, we used spatio-temporal Bayesian hierarchical models to study the pattern of tuberculosis relative risks in Kenya. The Markov Chain Monte Carlo method via WinBUGS and R packages were used for simulations and estimation of the parameter estimates. The best fitting model is selected using the Deviance Information Criterion proposed by Spiegelhalter and colleagues. Among the spatio-temporal models used, the Knorr-Held model with space-time interaction type III and IV fit the data well but type IV appears better than type III. Variation in tuberculosis risk is observed among Kenya counties and clustering among counties with high tuberculosis relative risks. The prevalence of HIV is identified as the determinant of TB. We found clustering and heterogeneity of TB risk among high rate counties and the overall tuberculosis risk is slightly decreasing from 2002-2009. We proposed that the Knorr-Held model with interaction type IV should be used to model and map Kenyan tuberculosis relative risks. Interaction of TB relative risk in space and time increases among rural counties that share boundaries with urban counties with high tuberculosis risk. This is due to the ability of models to borrow strength from neighboring counties, such that nearby counties have similar risk. Although the approaches are less than ideal, we hope that our study provide a useful stepping stone in the development of spatial and spatio-temporal methodology for the statistical analysis of risk from tuberculosis in Kenya.展开更多
Small area estimation (SAE) tackles the problem of providing reliable estimates for small areas, i.e., subsets of the population for which sample information is not sufficient to warrant the use of a direct estimator....Small area estimation (SAE) tackles the problem of providing reliable estimates for small areas, i.e., subsets of the population for which sample information is not sufficient to warrant the use of a direct estimator. Hierarchical Bayesian approach to SAE problems offers several advantages over traditional SAE models including the ability of appropriately accounting for the type of surveyed variable. In this paper, a number of model specifications for estimating small area counts are discussed and their relative merits are illustrated. We conducted a simulation study by reproducing in a simplified form the Italian Labour Force Survey and taking the Local Labor Markets as target areas. Simulated data were generated by assuming population characteristics of interest as well as survey sampling design as known. In one set of experiments, numbers of employment/unemployment from census data were utilized, in others population characteristics were varied. Results show persistent model failures for some standard Fay-Herriot specifications and for generalized linear Poisson models with (log-)normal sampling stage, whilst either unmatched or nonnormal sampling stage models get the best performance in terms of bias, accuracy and reliability. Though, the study also found that any model noticeably improves on its performance by letting sampling variances be stochastically determined rather than assumed as known as is the general practice. Moreover, we address the issue of model determination to point out limits and possible deceptions of commonly used criteria for model selection and checking in SAE context.展开更多
<p> <span><span style="font-family:""><span style="font-family:Verdana;">Simulation (stochastic) methods are based on obtaining random samples </span><spa...<p> <span><span style="font-family:""><span style="font-family:Verdana;">Simulation (stochastic) methods are based on obtaining random samples </span><span style="color:#4F4F4F;font-family:Simsun;white-space:normal;background-color:#FFFFFF;"><span style="font-family:Verdana;">θ</span><sup><span style="font-family:Verdana;">5</span></sup></span><span style="font-family:Verdana;"></span><span style="font-family:Verdana;"> </span><span><span style="font-family:Verdana;"> </span><span><span style="font-family:Verdana;">from the desired distribution </span><em><span style="font-family:Verdana;">p</span></em><span style="font-family:Verdana;">(</span><span style="color:#4F4F4F;font-family:Verdana;white-space:normal;background-color:#FFFFFF;">θ</span><span style="font-family:Verdana;"></span><span style="font-family:Verdana;">)</span><span style="font-family:Verdana;"> </span><span style="font-family:Verdana;">and estimating the expectation of any </span></span><span><span style="font-family:Verdana;">function </span><em><span style="font-family:Verdana;">h</span></em><span style="font-family:Verdana;">(</span><span style="color:#4F4F4F;font-family:Verdana;white-space:normal;background-color:#FFFFFF;">θ</span><span style="font-family:Verdana;"></span><span style="font-family:Verdana;">)</span><span style="font-family:Verdana;">. Simulation methods can be used for high-dimensional dis</span></span><span style="font-family:Verdana;">tributions, and there are general algorithms which work for a wide variety of models. Markov chain Monte Carlo (MCMC) methods have been important </span><span style="font-family:Verdana;">in making Bayesian inference practical for generic hierarchical models in</span><span style="font-family:Verdana;"> small area estimation. Small area estimation is a method for producing reliable estimates for small areas. Model based Bayesian small area estimation methods are becoming popular for their ability to combine information from several sources as well as taking account of spatial prediction of spatial data. In this study, detailed simulation algorithm is given and the performance of a non-trivial extension of hierarchical Bayesian model for binary data under spatial misalignment is assessed. Both areal level and unit level latent processes were considered in modeling. The process models generated from the predictors were used to construct the basis so as to alleviate the problem of collinearity </span><span style="font-family:Verdana;">between the true predictor variables and the spatial random process. The</span><span style="font-family:Verdana;"> performance of the proposed model was assessed using MCMC simulation studies. The performance was evaluated with respect to root mean square error </span><span style="font-family:Verdana;">(RMSE), Mean absolute error (MAE) and coverage probability of corres</span><span style="font-family:Verdana;">ponding 95% CI of the estimate. The estimates from the proposed model perform better than the direct estimate.</span></span></span></span> </p> <p> <span></span> </p>展开更多
This paper considers the Bayes and hierarchical Bayes approaches for analyzing clinical data on response times with available values for one or more concomitant variables. Response times are assumed to follow simple e...This paper considers the Bayes and hierarchical Bayes approaches for analyzing clinical data on response times with available values for one or more concomitant variables. Response times are assumed to follow simple exponential distributions, with a different parameter for each patient. The analyses are carried out in case of progressive censoring assuming squared error loss function and gamma distribution as priors and hyperpriors. The possibilities of using the methodology in more general situations like dose- response modeling have also been explored. Bayesian estimators derived in this paper are applied to lung cancer data set with concomitant variables.展开更多
Ore sorting is a preconcentration technology and can dramatically reduce energy and water usage to improve the sustainability and profitability of a mining operation.In porphyry Cu deposits,Cu is the primary target,wi...Ore sorting is a preconcentration technology and can dramatically reduce energy and water usage to improve the sustainability and profitability of a mining operation.In porphyry Cu deposits,Cu is the primary target,with ores usually containing secondary‘pay’metals such as Au,Mo and gangue elements such as Fe and As.Due to sensing technology limitations,secondary and deleterious materials vary in correlation type and strength with Cu but cannot be detected simultaneously via magnetic resonance(MR)ore sorting.Inferring the relationships between Cu and other elemental abundances is particularly critical for mineral processing.The variations in metal grade relationships occur due to the transition into different geological domains.This raises two questions-how to define these geological domains and how the metal grade relationship is influenced by these geological domains.In this paper,linear relationship is assumed between Cu grade and other metal grades.We applies a Bayesian hierarchical(partial-pooling)model to quantify the linear relationships between Cu,Au,and Fe grades from geochemical bore core data.The hierarchical model was compared with two other models-‘complete-pooling’model and‘nopooling’model.Mining blocks were split based on spatial domain to construct hierarchical model.Geochemical bore core data records metal grades measured from laboratory assay with spatial coordinates of sample location.Two case studies from different porphyry Cu deposits were used to evaluate the performance of the hierarchical model.Markov chain Monte Carlo(MCMC)was used to sample the posterior parameters.Our results show that the Bayesian hierarchical model dramatically reduced the posterior predictive variance for metal grades regression compared to the no-pooling model.In addition,the posterior inference in the hierarchical model is insensitive to the choice of prior.The data is wellrepresented in the posterior which indicates a robust model.The results show that the spatial domain can be successfully utilised for metal grade regression.Uncertainty in estimating the relationship between pay metals and both secondary and gangue elements is quantified and shown to be reduced with partial-pooling.Thus,the proposed Bayesian hierarchical model can offer a reliable and stable way to monitor the relationship between metal grades for ore sorting and other mineral processing options.展开更多
大多数操作系统的安全防护主要依赖基于签名或基于规则的方法,因此现有大多数的异常检测方法精度较低。因此,利用贝叶斯模型为同类群体建模,并结合时间效应与分层原则,为用户实体行为分析(User and Entity Behavior Analytics,UEBA)研...大多数操作系统的安全防护主要依赖基于签名或基于规则的方法,因此现有大多数的异常检测方法精度较低。因此,利用贝叶斯模型为同类群体建模,并结合时间效应与分层原则,为用户实体行为分析(User and Entity Behavior Analytics,UEBA)研究提供精度更高的数据集。然后,将基于实际记录的用户行为数据与贝叶斯层级图模型推测出的数据进行比较,降低模型中的误报率。该方法主要分为两个阶段:在第1阶段,基于数据驱动的方法形成用户行为聚类,定义用户的个人身份验证模式;在第2阶段,同时考虑到周期性因素和分层原则,并通过泊松分布建模。研究表明,数据驱动的聚类方法在减少误报方面能够取得更好的结果,并减轻网络安全管理的负担,进一步减少误报数量。展开更多
文摘The zero_failure data research is a new field in the recent years, but it is required urgently in practical projects, so the work has more theory and practical values. In this paper, for zero_failure data (t i,n i) at moment t i , if the prior distribution of the failure probability p i=p{T【t i} is quasi_exponential distribution, the author gives the p i Bayesian estimation and hierarchical Bayesian estimation and the reliability under zero_failure date condition is also obtained.
基金supported by National Hi-tech Research and Development Program of China (863 Program, Grant No. 2008 AA04Z114)
文摘Measurement error of unbalance's vibration response plays a crucial role in calibration and on-line updating of influence coefficient(IC). Focusing on the two problems that the moment estimator of data used in calibration process cannot fulfill the accuracy requirement under small sample and the disturbance of measurement error cannot be effectively suppressed in updating process, an IC calibration and on-line updating method based on hierarchical Bayesian method for automatic dynamic balancing machine was proposed. During calibration process, for the repeatedly-measured data obtained from experiments with different trial weights, according to the fact that measurement error of each sensor had the same statistical characteristics, the joint posterior distribution model for the true values of the vibration response under all trial weights and measurement error was established. During the updating process, information obtained from calibration was regarded as prior information, which was utilized to update the posterior distribution of IC combined with the real-time reference information to implement online updating. Moreover, Gibbs sampling method of Markov Chain Monte Carlo(MCMC) was adopted to obtain the maximum posterior estimation of parameters to be estimated. On the independent developed dynamic balancing testbed, prediction was carried out for multiple groups of data through the proposed method and the traditional method respectively, the result indicated that estimator of influence coefficient obtained through the proposed method had higher accuracy; the proposed updating method more effectively guaranteed the measurement accuracy during the whole producing process, and meantime more reasonably compromised between the sensitivity of IC change and suppression of randomness of vibration response.
基金supported by the Ministry of Education,Science,Sports and Culture,Grant-in-Aid for Scientific Research under Grant No.22240021the Grant-in-Aid for Challenging Exploratory Research under Grant No.21650030
文摘In this paper, we provide a Word Emotion Topic (WET) model to predict the complex word e- motion information from text, and discover the dis- trbution of emotions among different topics. A complex emotion is defined as the combination of one or more singular emotions from following 8 basic emotion categories: joy, love, expectation, sur- prise, anxiety, sorrow, anger and hate. We use a hi- erarchical Bayesian network to model the emotions and topics in the text. Both the complex emotions and topics are drawn from raw texts, without con- sidering any complicated language features. Our ex- periment shows promising results of word emotion prediction, which outperforms the traditional parsing methods such as the Hidden Markov Model and the Conditional Random Fields(CRFs) on raw text. We also explore the topic distribution by examining the emotion topic variation in an emotion topic diagram.
基金supported by the National Natural Science Foundation of China(Grants No.51779074 and 41371052)the Special Fund for the Public Welfare Industry of the Ministry of Water Resources of China(Grant No.201501059)+3 种基金the National Key Research and Development Program of China(Grant No.2017YFC0404304)the Jiangsu Water Conservancy Science and Technology Project(Grant No.2017027)the Program for Outstanding Young Talents in Colleges and Universities of Anhui Province(Grant No.gxyq2018143)the Natural Science Foundation of Wanjiang University of Technology(Grant No.WG18030)
文摘This study developed a hierarchical Bayesian(HB)model for local and regional flood frequency analysis in the Dongting Lake Basin,in China.The annual maximum daily flows from 15 streamflow-gauged sites in the study area were analyzed with the HB model.The generalized extreme value(GEV)distribution was selected as the extreme flood distribution,and the GEV distribution location and scale parameters were spatially modeled through a regression approach with the drainage area as a covariate.The Markov chain Monte Carlo(MCMC)method with Gibbs sampling was employed to calculate the posterior distribution in the HB model.The results showed that the proposed HB model provided satisfactory Bayesian credible intervals for flood quantiles,while the traditional delta method could not provide reliable uncertainty estimations for large flood quantiles,due to the fact that the lower confidence bounds tended to decrease as the return periods increased.Furthermore,the HB model for regional analysis allowed for a reduction in the value of some restrictive assumptions in the traditional index flood method,such as the homogeneity region assumption and the scale invariance assumption.The HB model can also provide an uncertainty band of flood quantile prediction at a poorly gauged or ungauged site,but the index flood method with L-moments does not demonstrate this uncertainty directly.Therefore,the HB model is an effective method of implementing the flexible local and regional frequency analysis scheme,and of quantifying the associated predictive uncertainty.
文摘Global spread of infectious disease threatens the well-being of human, domestic, and wildlife health. A proper understanding of global distribution of these diseases is an important part of disease management and policy making. However, data are subject to complexities by heterogeneity across host classes. The use of frequentist methods in biostatistics and epidemiology is common and is therefore extensively utilized in answering varied research questions. In this paper, we applied the hierarchical Bayesian approach to study the spatial distribution of tuberculosis in Kenya. The focus was to identify best fitting model for modeling TB relative risk in Kenya. The Markov Chain Monte Carlo (MCMC) method via WinBUGS and R packages was used for simulations. The Deviance Information Criterion (DIC) proposed by [1] was used for models comparison and selection. Among the models considered, unstructured heterogeneity model perfumes better in terms of modeling and mapping TB RR in Kenya. Variation in TB risk is observed among Kenya counties and clustering among counties with high TB Relative Risk (RR). HIV prevalence is identified as the dominant determinant of TB. We find clustering and heterogeneity of risk among high rate counties. Although the approaches are less than ideal, we hope that our formulations provide a useful stepping stone in the development of spatial methodology for the statistical analysis of risk from TB in Kenya.
基金supported by the Guangdong Basic and Applied Basic Research Foundation(2023A1515011244).
文摘The state of in situ stress is a crucial parameter in subsurface engineering,especially for critical projects like nuclear waste repository.As one of the two ISRM suggested methods,the overcoring(OC)method is widely used to estimate the full stress tensors in rocks by independent regression analysis of the data from each OC test.However,such customary independent analysis of individual OC tests,known as no pooling,is liable to yield unreliable test-specific stress estimates due to various uncertainty sources involved in the OC method.To address this problem,a practical and no-cost solution is considered by incorporating into OC data analysis additional information implied within adjacent OC tests,which are usually available in OC measurement campaigns.Hence,this paper presents a Bayesian partial pooling(hierarchical)model for combined analysis of adjacent OC tests.We performed five case studies using OC test data made at a nuclear waste repository research site of Sweden.The results demonstrate that partial pooling of adjacent OC tests indeed allows borrowing of information across adjacent tests,and yields improved stress tensor estimates with reduced uncertainties simultaneously for all individual tests than they are independently analysed as no pooling,particularly for those unreliable no pooling stress estimates.A further model comparison shows that the partial pooling model also gives better predictive performance,and thus confirms that the information borrowed across adjacent OC tests is relevant and effective.
文摘A concept map is a diagram depicting relationships among concepts which is used as a knowledge representation tool in many knowledge domains. In this paper, we build on the modeling framework of Hui et al. (2008) in order to develop a concept map suitable for testing the empirical evidence of theories. We identify a theory by a set of core tenets each asserting that one set of independent variables affects one dependent variable, moreover every variable can have several operational definitions. Data consist of a selected sample of scientific articles from the empirical literature on the theory under investigation. Our “tenet map” features a number of complexities more than the original version. First the links are two-layer: first-layer links connect variables which are related in the test of the theory at issue;second-layer links represent connections which are found statistically significant. Besides, either layer matrix of link-formation probabilities is block-symmetric. In addition to a form of censoring which resembles the Hui et al. pruning step, observed maps are subject to a further censoring related to second-layer links. Still, we perform a full Bayesian analysis instead of adopting the empirical Bayes approach. Lastly, we develop a three-stage model which accounts for dependence either of data or of parameters. The investigation of the empirical support and consensus degree of new economic theories of the firm motivated the proposed methodology. In this paper, the Transaction Cost Economics view is tested by a tenet map analysis. Both the two-stage and the multilevel models identify the same tenets as the most corroborated by empirical evidence though the latter provides a more comprehensive and complex insight of relationships between constructs.
文摘Proper understanding of global distribution of infectious diseases is an important part of disease management and policy making. However, data are subject to complexities caused by heterogeneities across host classes and space-time epidemic processes. This paper seeks to suggest or propose Bayesian spatio-temporal model for modeling and mapping tuberculosis relative risks in space and time as well identify risks factors associated with the tuberculosis and counties in Kenya with high tuberculosis relative risks. In this paper, we used spatio-temporal Bayesian hierarchical models to study the pattern of tuberculosis relative risks in Kenya. The Markov Chain Monte Carlo method via WinBUGS and R packages were used for simulations and estimation of the parameter estimates. The best fitting model is selected using the Deviance Information Criterion proposed by Spiegelhalter and colleagues. Among the spatio-temporal models used, the Knorr-Held model with space-time interaction type III and IV fit the data well but type IV appears better than type III. Variation in tuberculosis risk is observed among Kenya counties and clustering among counties with high tuberculosis relative risks. The prevalence of HIV is identified as the determinant of TB. We found clustering and heterogeneity of TB risk among high rate counties and the overall tuberculosis risk is slightly decreasing from 2002-2009. We proposed that the Knorr-Held model with interaction type IV should be used to model and map Kenyan tuberculosis relative risks. Interaction of TB relative risk in space and time increases among rural counties that share boundaries with urban counties with high tuberculosis risk. This is due to the ability of models to borrow strength from neighboring counties, such that nearby counties have similar risk. Although the approaches are less than ideal, we hope that our study provide a useful stepping stone in the development of spatial and spatio-temporal methodology for the statistical analysis of risk from tuberculosis in Kenya.
文摘Small area estimation (SAE) tackles the problem of providing reliable estimates for small areas, i.e., subsets of the population for which sample information is not sufficient to warrant the use of a direct estimator. Hierarchical Bayesian approach to SAE problems offers several advantages over traditional SAE models including the ability of appropriately accounting for the type of surveyed variable. In this paper, a number of model specifications for estimating small area counts are discussed and their relative merits are illustrated. We conducted a simulation study by reproducing in a simplified form the Italian Labour Force Survey and taking the Local Labor Markets as target areas. Simulated data were generated by assuming population characteristics of interest as well as survey sampling design as known. In one set of experiments, numbers of employment/unemployment from census data were utilized, in others population characteristics were varied. Results show persistent model failures for some standard Fay-Herriot specifications and for generalized linear Poisson models with (log-)normal sampling stage, whilst either unmatched or nonnormal sampling stage models get the best performance in terms of bias, accuracy and reliability. Though, the study also found that any model noticeably improves on its performance by letting sampling variances be stochastically determined rather than assumed as known as is the general practice. Moreover, we address the issue of model determination to point out limits and possible deceptions of commonly used criteria for model selection and checking in SAE context.
文摘<p> <span><span style="font-family:""><span style="font-family:Verdana;">Simulation (stochastic) methods are based on obtaining random samples </span><span style="color:#4F4F4F;font-family:Simsun;white-space:normal;background-color:#FFFFFF;"><span style="font-family:Verdana;">θ</span><sup><span style="font-family:Verdana;">5</span></sup></span><span style="font-family:Verdana;"></span><span style="font-family:Verdana;"> </span><span><span style="font-family:Verdana;"> </span><span><span style="font-family:Verdana;">from the desired distribution </span><em><span style="font-family:Verdana;">p</span></em><span style="font-family:Verdana;">(</span><span style="color:#4F4F4F;font-family:Verdana;white-space:normal;background-color:#FFFFFF;">θ</span><span style="font-family:Verdana;"></span><span style="font-family:Verdana;">)</span><span style="font-family:Verdana;"> </span><span style="font-family:Verdana;">and estimating the expectation of any </span></span><span><span style="font-family:Verdana;">function </span><em><span style="font-family:Verdana;">h</span></em><span style="font-family:Verdana;">(</span><span style="color:#4F4F4F;font-family:Verdana;white-space:normal;background-color:#FFFFFF;">θ</span><span style="font-family:Verdana;"></span><span style="font-family:Verdana;">)</span><span style="font-family:Verdana;">. Simulation methods can be used for high-dimensional dis</span></span><span style="font-family:Verdana;">tributions, and there are general algorithms which work for a wide variety of models. Markov chain Monte Carlo (MCMC) methods have been important </span><span style="font-family:Verdana;">in making Bayesian inference practical for generic hierarchical models in</span><span style="font-family:Verdana;"> small area estimation. Small area estimation is a method for producing reliable estimates for small areas. Model based Bayesian small area estimation methods are becoming popular for their ability to combine information from several sources as well as taking account of spatial prediction of spatial data. In this study, detailed simulation algorithm is given and the performance of a non-trivial extension of hierarchical Bayesian model for binary data under spatial misalignment is assessed. Both areal level and unit level latent processes were considered in modeling. The process models generated from the predictors were used to construct the basis so as to alleviate the problem of collinearity </span><span style="font-family:Verdana;">between the true predictor variables and the spatial random process. The</span><span style="font-family:Verdana;"> performance of the proposed model was assessed using MCMC simulation studies. The performance was evaluated with respect to root mean square error </span><span style="font-family:Verdana;">(RMSE), Mean absolute error (MAE) and coverage probability of corres</span><span style="font-family:Verdana;">ponding 95% CI of the estimate. The estimates from the proposed model perform better than the direct estimate.</span></span></span></span> </p> <p> <span></span> </p>
文摘This paper considers the Bayes and hierarchical Bayes approaches for analyzing clinical data on response times with available values for one or more concomitant variables. Response times are assumed to follow simple exponential distributions, with a different parameter for each patient. The analyses are carried out in case of progressive censoring assuming squared error loss function and gamma distribution as priors and hyperpriors. The possibilities of using the methodology in more general situations like dose- response modeling have also been explored. Bayesian estimators derived in this paper are applied to lung cancer data set with concomitant variables.
基金This research was funded by the CSIRO ResearchPlus Science Leader Grant Program.
文摘Ore sorting is a preconcentration technology and can dramatically reduce energy and water usage to improve the sustainability and profitability of a mining operation.In porphyry Cu deposits,Cu is the primary target,with ores usually containing secondary‘pay’metals such as Au,Mo and gangue elements such as Fe and As.Due to sensing technology limitations,secondary and deleterious materials vary in correlation type and strength with Cu but cannot be detected simultaneously via magnetic resonance(MR)ore sorting.Inferring the relationships between Cu and other elemental abundances is particularly critical for mineral processing.The variations in metal grade relationships occur due to the transition into different geological domains.This raises two questions-how to define these geological domains and how the metal grade relationship is influenced by these geological domains.In this paper,linear relationship is assumed between Cu grade and other metal grades.We applies a Bayesian hierarchical(partial-pooling)model to quantify the linear relationships between Cu,Au,and Fe grades from geochemical bore core data.The hierarchical model was compared with two other models-‘complete-pooling’model and‘nopooling’model.Mining blocks were split based on spatial domain to construct hierarchical model.Geochemical bore core data records metal grades measured from laboratory assay with spatial coordinates of sample location.Two case studies from different porphyry Cu deposits were used to evaluate the performance of the hierarchical model.Markov chain Monte Carlo(MCMC)was used to sample the posterior parameters.Our results show that the Bayesian hierarchical model dramatically reduced the posterior predictive variance for metal grades regression compared to the no-pooling model.In addition,the posterior inference in the hierarchical model is insensitive to the choice of prior.The data is wellrepresented in the posterior which indicates a robust model.The results show that the spatial domain can be successfully utilised for metal grade regression.Uncertainty in estimating the relationship between pay metals and both secondary and gangue elements is quantified and shown to be reduced with partial-pooling.Thus,the proposed Bayesian hierarchical model can offer a reliable and stable way to monitor the relationship between metal grades for ore sorting and other mineral processing options.
文摘大多数操作系统的安全防护主要依赖基于签名或基于规则的方法,因此现有大多数的异常检测方法精度较低。因此,利用贝叶斯模型为同类群体建模,并结合时间效应与分层原则,为用户实体行为分析(User and Entity Behavior Analytics,UEBA)研究提供精度更高的数据集。然后,将基于实际记录的用户行为数据与贝叶斯层级图模型推测出的数据进行比较,降低模型中的误报率。该方法主要分为两个阶段:在第1阶段,基于数据驱动的方法形成用户行为聚类,定义用户的个人身份验证模式;在第2阶段,同时考虑到周期性因素和分层原则,并通过泊松分布建模。研究表明,数据驱动的聚类方法在减少误报方面能够取得更好的结果,并减轻网络安全管理的负担,进一步减少误报数量。