Objective: In our previous work, we incorporated complete blood count (CBC) into TNM stage to develop a new prognostic score model, which was validated to improve prediction efficiency of TNM stage for nasopharynge...Objective: In our previous work, we incorporated complete blood count (CBC) into TNM stage to develop a new prognostic score model, which was validated to improve prediction efficiency of TNM stage for nasopharyngeal carcinoma (NPC). The purpose of this study was to revalidate the accuracy of the model, and its superiority to TNM stage, through data from a prospective study.Methods: CBC of 249 eligible patients from the 863 Program No. 2006AA02Z4B4 was evaluated. Prognostic index (PI) of each patient was calculated according to the score model. Then they were divided by the PI into three categories: the low-, intermediate-and high-risk patients. The 5-year disease-specific survival (DSS) of the three categories was compared by a log-rank test. The model and TNM stage (Tth edition) were compared on efficiency for predicting the 5-year DSS, through comparison of the area under curve (AUC) of their receiver-operating characteristic curves.Results: The 5-year DSS of the low-, intermediate- and high-risk patients were 96.0%, 79.1% and 62.2%, respectively. The low- and intermediate-risk patients had better DSS than the high-risk patients (P〈0.001 and P〈0.005, respectively). And there was a trend of better DSS in the low-risk patients, compared with the intermediate-risk patients (P=0.049). The AUC of the model was larger than that of TNM stage (0.726 vs. 0.661, P:0.023). Conclusions: A CBC-based prognostic score model was revalidated to be accurate and superior to TNM stage on predicting 5-year DSS of NPC.展开更多
Several economists agree to say that the need for adjustment was essential for African countries over the decade of the 80’s. The econometric analysis of a sample of 28 sub-Saharan African countries, from variables r...Several economists agree to say that the need for adjustment was essential for African countries over the decade of the 80’s. The econometric analysis of a sample of 28 sub-Saharan African countries, from variables regarded as “representatives” for the adjustment objectives, proves that this assertion cannot be completely rejected.展开更多
Often the lifecycle data occur as count of the vital events and are recorded as integers.The purpose of this article is to model the fertility behavior based on religious,educational,economic,and occupational characte...Often the lifecycle data occur as count of the vital events and are recorded as integers.The purpose of this article is to model the fertility behavior based on religious,educational,economic,and occupational characteristics.The responses of classified groups according to these determinants are examined for significant influence on fertility using Poisson regression model(PRM) based on the National Family Health Survey-3 dataset.The observed and predicted probabilities under PRM indicate modal value of two children for the Poisson distribution modeled data.Presence of dominance of two child in the data motivates the authors to adopt multinomial regression model(MRM) in order to link fertility with various socioeconomic indicators responsible for fertility variation.Choice of the explanatory factors is limited to the availability of data.Trends and patterns of preference for birth counts suggest that religion,caste,wealth,female education,and occupation are the dominant factors shaping the observed birth process.Empirical analysis suggests that both the models used in the study perform similarly on the sample data.However,fitting of MRM by taking birth count of two as comparison category shows improved Akaike information criterion and consistent Akaike information criterion values.Current work contributes to the existing literature as it attempts to provide more insight into the determinants of Indian fertility using Poisson and MRM.展开更多
Different from the usual full counting statistics theoretical work that focuses on the higher order cumulants computation by using cumulant generating function in electrical structures, Monte Carlo simulation of singl...Different from the usual full counting statistics theoretical work that focuses on the higher order cumulants computation by using cumulant generating function in electrical structures, Monte Carlo simulation of single-barrier structure is performed to obtain time series for two types of widely applicable exclusion models, counter-flows model, and tunnel model. With high-order spectrum analysis of Matlab, the validation of Monte Carlo methods is shown through the extracted first four cumulants from the time series, which are in agreement with those from cumulant generating function. After the comparison between the counter-flows model and the tunnel model in a single barrier structure, it is found that the essential difference between them consists in the strictly holding of Pauli principle in the former and in the statistical consideration of Pauli principle in the latter.展开更多
Survival of HIV/AIDS patients is crucially dependent on comprehensive and targeted medical interventions such as supply of antiretroviral therapy and monitoring disease progression with CD4 T-cell counts. Statistical ...Survival of HIV/AIDS patients is crucially dependent on comprehensive and targeted medical interventions such as supply of antiretroviral therapy and monitoring disease progression with CD4 T-cell counts. Statistical modelling approaches are helpful towards this goal. This study aims at developing Bayesian joint models with assumed generalized error distribution (GED) for the longitudinal CD4 data and two accelerated failure time distributions, Lognormal and loglogistic, for the survival time of HIV/AIDS patients. Data are obtained from patients under antiretroviral therapy follow-up at Shashemene referral hospital during January 2006-January 2012 and at Bale Robe general hospital during January 2008-March 2015. The Bayesian joint models are defined through latent variables and association parameters and with specified non-informative prior distributions for the model parameters. Simulations are conducted using Gibbs sampler algorithm implemented in the WinBUGS software. The results of the analyses of the two different data sets show that distributions of measurement errors of the longitudinal CD4 variable follow the generalized error distribution with fatter tails than the normal distribution. The Bayesian joint GED loglogistic models fit better to the data sets compared to the lognormal cases. Findings reveal that patients’ health can be improved over time. Compared to the males, female patients gain more CD4 counts. Survival time of a patient is negatively affected by TB infection. Moreover, increase in number of opportunistic infection implies decline of CD4 counts. Patients’ age negatively affects the disease marker with no effects on survival time. Improving weight may improve survival time of patients. Bayesian joint models with GED and AFT distributions are found to be useful in modelling the longitudinal and survival processes. Thus we recommend the generalized error distributions for measurement errors of the longitudinal data under the Bayesian joint modelling. Further studies may investigate the models with various types of shared random effects and more covariates with predictions.展开更多
This paper proposes some additional moment conditions for the linear feedback model with explanatory variables being predetermined, which is proposed by [1] for the purpose of dealing with count panel data. The newly ...This paper proposes some additional moment conditions for the linear feedback model with explanatory variables being predetermined, which is proposed by [1] for the purpose of dealing with count panel data. The newly proposed moment conditions include those associated with the equidispersion, the Negbin I-type model and the stationarity. The GMM estimators are constructed incorporating the additional moment conditions. Some Monte Carlo experiments indicate that the GMM estimators incorporating the additional moment conditions perform well, compared to that using only the conventional moment conditions proposed by [2,3].展开更多
Count data that exhibit over dispersion (variance of counts is larger than its mean) are commonly analyzed using discrete distributions such as negative binomial, Poisson inverse Gaussian and other models. The Poisson...Count data that exhibit over dispersion (variance of counts is larger than its mean) are commonly analyzed using discrete distributions such as negative binomial, Poisson inverse Gaussian and other models. The Poisson is characterized by the equality of mean and variance whereas the Negative Binomial and the Poisson inverse Gaussian have variance larger than the mean and therefore are more appropriate to model over-dispersed count data. As an alternative to these two models, we shall use the generalized Poisson distribution for group comparisons in the presence of multiple covariates. This problem is known as the ANCOVA and is solved for continuous data. Our objectives were to develop ANCOVA using the generalized Poisson distribution, and compare its goodness of fit to that of the nonparametric Generalized Additive Models. We used real life data to show that the model performs quite satisfactorily when compared to the nonparametric Generalized Additive Models.展开更多
Background: Daily paediatric asthma readmissions within 28 days are a good example of a low count time series and not easily amenable to common time series methods used in studies of asthma seasonality and time trends...Background: Daily paediatric asthma readmissions within 28 days are a good example of a low count time series and not easily amenable to common time series methods used in studies of asthma seasonality and time trends. We sought to model and predict daily trends of childhood asthma readmissions over time inVictoria,Australia. Methods: We used a database of 75,000 childhood asthma admissions from the Department ofHealth,Victoria,Australiain 1997-2009. Daily admissions over time were modeled using a semi parametric Generalized Additive Model (GAM) and by sex and age group. Predictions were also estimated by using these models. Results: N = 2401 asthma readmissions within 28 days occurred during study period. Of these, n = 1358 (57%) were boys. Overall, seasonal peaks occurred in winter (30.5%) followed by autumn (28.6%) and then spring (24.6%) (p展开更多
Clinicians need to predict the number of involved nodes in breast cancer patients in order to ascertain severity, prognosis, and design subsequent treatment. The distribution of involved nodes often displays over-disp...Clinicians need to predict the number of involved nodes in breast cancer patients in order to ascertain severity, prognosis, and design subsequent treatment. The distribution of involved nodes often displays over-dispersion—a larger variability than expected. Until now, the negative binomial model has been used to describe this distribution assuming that over-dispersion is only due to unobserved heterogeneity. The distribution of involved nodes contains a large proportion of excess zeros (negative nodes), which can lead to over-dispersion. In this situation, alternative models may better account for over-dispersion due to excess zeros. This study examines data from 1152 patients who underwent axillary dissections in a tertiary hospital in India during January 1993-January 2005. We fit and compare various count models to test model abilities to predict the number of involved nodes. We also argue for using zero inflated models in such populations where all the excess zeros come from those who have at some risk of the outcome of interest. The negative binomial regression model fits the data better than the Poisson, zero hurdle/inflated Poisson regression models. However, zero hurdle/inflated negative binomial regression models predicted the number of involved nodes much more accurately than the negative binomial model. This suggests that the number of involved nodes displays excess variability not only due to unobserved heterogeneity but also due to excess negative nodes in the data set. In this analysis, only skin changes and primary site were associated with negative nodes whereas parity, skin changes, primary site and size of tumor were associated with a greater number of involved nodes. In case of near equal performances, the zero inflated negative binomial model should be preferred over the hurdle model in describing the nodal frequency because it provides an estimate of negative nodes that are at “high-risk” of nodal involvement.展开更多
Consider a multidimensional renewal risk model, in which the claim sizes {Xk, k ≥1} form a sequence of independent and identically distributed random vectors with nonnegative components that are allowed to be depende...Consider a multidimensional renewal risk model, in which the claim sizes {Xk, k ≥1} form a sequence of independent and identically distributed random vectors with nonnegative components that are allowed to be dependent on each other. The univariate marginal distributions of these vectors have consistently varying tails and finite means. Suppose that the claim sizes and inter-arrival times correspondingly form a sequence of independent and identically distributed random pairs, with each pair obeying a dependence structure. A precise large deviation for the multidimensional renewal risk model is obtained.展开更多
The issue of document management has been raised for a long time, especially with the appearance of office automation in the 1980s, which led to dematerialization and Electronic Document Management (EDM). In the same ...The issue of document management has been raised for a long time, especially with the appearance of office automation in the 1980s, which led to dematerialization and Electronic Document Management (EDM). In the same period, workflow management has experienced significant development, but has become more focused on the industry. However, it seems to us that document workflows have not had the same interest for the scientific community. But nowadays, the emergence and supremacy of the Internet in electronic exchanges are leading to a massive dematerialization of documents;which requires a conceptual reconsideration of the organizational framework for the processing of said documents in both public and private administrations. This problem seems open to us and deserves the interest of the scientific community. Indeed, EDM has mainly focused on the storage (referencing) and circulation of documents (traceability). It paid little attention to the overall behavior of the system in processing documents. The purpose of our researches is to model document processing systems. In the previous works, we proposed a general model and its specialization in the case of small documents (any document processed by a single person at a time during its processing life cycle), which represent 70% of documents processed by administrations, according to our study. In this contribution, we extend the model for processing small documents to the case where they are managed in a system comprising document classes organized in subclasses;which is the case for most administrations. We have thus observed that this model is a Markovian <i>M<sup>L×K</sup>/M<sup>L×K</sup>/</i>1 queues network. We have analyzed the constraints of this model and deduced certain characteristics and metrics. <span style="white-space:normal;"><i></i></span><i>In fine<span style="white-space:normal;"></span></i>, the ultimate objective of our work is to design a document workflow management system, integrating a component of global behavior prediction.展开更多
In order to explore how to extract more transport information from current fluctuation, a theoretical extraction scheme is presented in a single barrier structure based on exclusion models, which include counter-flows...In order to explore how to extract more transport information from current fluctuation, a theoretical extraction scheme is presented in a single barrier structure based on exclusion models, which include counter-flows model and tunnel model. The first four cumulants of these two exclusion models are computed in a single barrier structure, and their characteristics are obtained. A scheme with the help of the first three cumulants is devised to check a transport process to follow the counter-flows model, the tunnel model or neither of them. Time series generated by Monte Carlo techniques is adopted to validate the abstraction procedure, and the result is reasonable.展开更多
In this paper we aim to analyse temporal variation of CD4 cell counts for HIV-infected individuals under antiretroviral therapy by using statistical methods. This is achieved by resorting to recursive binary regressio...In this paper we aim to analyse temporal variation of CD4 cell counts for HIV-infected individuals under antiretroviral therapy by using statistical methods. This is achieved by resorting to recursive binary regression tree approach [1]?[2]. This approach has made it possible to highlight the existence of several segments of the population of interest described by the interactions between the predictive covariates of the response to the treatment regimen.展开更多
It is difficult to measure the sizes of illegal drug user populations directly by using the survey method because of many “hidden drug addicts” and the difficulty of receiving a true response. Systematic and routine...It is difficult to measure the sizes of illegal drug user populations directly by using the survey method because of many “hidden drug addicts” and the difficulty of receiving a true response. Systematic and routine information on treatment episodes of drug users is adopted to estimate the population size in this study. Mixture models of zero-truncated Poisson distributions using the nonparametric maximum likelihood estimators (NPMLE) by means of capture-recapture repeated count data were used to project the number of drug users. The method was applied to surveillance data of drug users identified by treatment episodes in over 1140 health treatment centers in Thailand from the Bureau of Health Service System Development, Ministry of Public Health. We presented how this mixture model could be utilized to construct the unobserved frequency of drug users with no treatment episode and further estimated the total population size of drug users in the country from 2005 to 2007. The result of simulation was confirmed that mixture model is suitable when population is large. By means of mixture models, the estimations for the number of drug users were fitted with excellent goodness-of-fit values and we were also compared to the conventional Chao estimates. The NPMLE for the total number of drug users in Thailand 2005, 2006, and 2007 were 184,045 (95% CI: 181,297-86,793), 230,665 (95% CI: 226,611-234,719), 299,670 (95% CI: 294,217-305,123), respectively, also 125,265 (95% CI: 123,092-127,142), 166,287 (95% CI: 163,222-169,352), 228,898 (95% CI: 224,766 - 233,030) for the number of methamphetamine (Yaba) users, and 11,559 (95% CI: 10,234-12,884), 11,333 (95% CI: 9276-13,390), 8953 (95% CI: 7878-10,028) for the number of heroin users, respectively. The numbers of marijuana, kratom-plant, opium, and inhalant users were underestimated because their symptoms were mild and not severe enough to remedy in health treatment centers which led to the smaller size of the total number of drug users. The well-estimated sizes of heroin and methamphetamine addicts are high reliable because they are based on clearly evident count with a severe addiction problem to health treatment centers. The estimation by means of mixture models can be recommended to monitor drug demand trend and drug health service routinely;it is easy to calculate via the available programs MIXTP based on request.展开更多
基金supported by Hi-Tech Research and Development Program of China (863 Program) (No.2006AA02Z4B4)
文摘Objective: In our previous work, we incorporated complete blood count (CBC) into TNM stage to develop a new prognostic score model, which was validated to improve prediction efficiency of TNM stage for nasopharyngeal carcinoma (NPC). The purpose of this study was to revalidate the accuracy of the model, and its superiority to TNM stage, through data from a prospective study.Methods: CBC of 249 eligible patients from the 863 Program No. 2006AA02Z4B4 was evaluated. Prognostic index (PI) of each patient was calculated according to the score model. Then they were divided by the PI into three categories: the low-, intermediate-and high-risk patients. The 5-year disease-specific survival (DSS) of the three categories was compared by a log-rank test. The model and TNM stage (Tth edition) were compared on efficiency for predicting the 5-year DSS, through comparison of the area under curve (AUC) of their receiver-operating characteristic curves.Results: The 5-year DSS of the low-, intermediate- and high-risk patients were 96.0%, 79.1% and 62.2%, respectively. The low- and intermediate-risk patients had better DSS than the high-risk patients (P〈0.001 and P〈0.005, respectively). And there was a trend of better DSS in the low-risk patients, compared with the intermediate-risk patients (P=0.049). The AUC of the model was larger than that of TNM stage (0.726 vs. 0.661, P:0.023). Conclusions: A CBC-based prognostic score model was revalidated to be accurate and superior to TNM stage on predicting 5-year DSS of NPC.
文摘Several economists agree to say that the need for adjustment was essential for African countries over the decade of the 80’s. The econometric analysis of a sample of 28 sub-Saharan African countries, from variables regarded as “representatives” for the adjustment objectives, proves that this assertion cannot be completely rejected.
基金supported by R&D Grant from University of DelhiDU-DST PURSE GrantICMR Grant No.3/1/3/JRF-2010/HRD-122(35831)
文摘Often the lifecycle data occur as count of the vital events and are recorded as integers.The purpose of this article is to model the fertility behavior based on religious,educational,economic,and occupational characteristics.The responses of classified groups according to these determinants are examined for significant influence on fertility using Poisson regression model(PRM) based on the National Family Health Survey-3 dataset.The observed and predicted probabilities under PRM indicate modal value of two children for the Poisson distribution modeled data.Presence of dominance of two child in the data motivates the authors to adopt multinomial regression model(MRM) in order to link fertility with various socioeconomic indicators responsible for fertility variation.Choice of the explanatory factors is limited to the availability of data.Trends and patterns of preference for birth counts suggest that religion,caste,wealth,female education,and occupation are the dominant factors shaping the observed birth process.Empirical analysis suggests that both the models used in the study perform similarly on the sample data.However,fitting of MRM by taking birth count of two as comparison category shows improved Akaike information criterion and consistent Akaike information criterion values.Current work contributes to the existing literature as it attempts to provide more insight into the determinants of Indian fertility using Poisson and MRM.
基金Project supported by the National Natural Science Foundation of China(Grant No.60676053)Applied Material in Xi'an Innovation Funds(Grant No.XA-AM-200603)
文摘Different from the usual full counting statistics theoretical work that focuses on the higher order cumulants computation by using cumulant generating function in electrical structures, Monte Carlo simulation of single-barrier structure is performed to obtain time series for two types of widely applicable exclusion models, counter-flows model, and tunnel model. With high-order spectrum analysis of Matlab, the validation of Monte Carlo methods is shown through the extracted first four cumulants from the time series, which are in agreement with those from cumulant generating function. After the comparison between the counter-flows model and the tunnel model in a single barrier structure, it is found that the essential difference between them consists in the strictly holding of Pauli principle in the former and in the statistical consideration of Pauli principle in the latter.
文摘Survival of HIV/AIDS patients is crucially dependent on comprehensive and targeted medical interventions such as supply of antiretroviral therapy and monitoring disease progression with CD4 T-cell counts. Statistical modelling approaches are helpful towards this goal. This study aims at developing Bayesian joint models with assumed generalized error distribution (GED) for the longitudinal CD4 data and two accelerated failure time distributions, Lognormal and loglogistic, for the survival time of HIV/AIDS patients. Data are obtained from patients under antiretroviral therapy follow-up at Shashemene referral hospital during January 2006-January 2012 and at Bale Robe general hospital during January 2008-March 2015. The Bayesian joint models are defined through latent variables and association parameters and with specified non-informative prior distributions for the model parameters. Simulations are conducted using Gibbs sampler algorithm implemented in the WinBUGS software. The results of the analyses of the two different data sets show that distributions of measurement errors of the longitudinal CD4 variable follow the generalized error distribution with fatter tails than the normal distribution. The Bayesian joint GED loglogistic models fit better to the data sets compared to the lognormal cases. Findings reveal that patients’ health can be improved over time. Compared to the males, female patients gain more CD4 counts. Survival time of a patient is negatively affected by TB infection. Moreover, increase in number of opportunistic infection implies decline of CD4 counts. Patients’ age negatively affects the disease marker with no effects on survival time. Improving weight may improve survival time of patients. Bayesian joint models with GED and AFT distributions are found to be useful in modelling the longitudinal and survival processes. Thus we recommend the generalized error distributions for measurement errors of the longitudinal data under the Bayesian joint modelling. Further studies may investigate the models with various types of shared random effects and more covariates with predictions.
文摘This paper proposes some additional moment conditions for the linear feedback model with explanatory variables being predetermined, which is proposed by [1] for the purpose of dealing with count panel data. The newly proposed moment conditions include those associated with the equidispersion, the Negbin I-type model and the stationarity. The GMM estimators are constructed incorporating the additional moment conditions. Some Monte Carlo experiments indicate that the GMM estimators incorporating the additional moment conditions perform well, compared to that using only the conventional moment conditions proposed by [2,3].
文摘Count data that exhibit over dispersion (variance of counts is larger than its mean) are commonly analyzed using discrete distributions such as negative binomial, Poisson inverse Gaussian and other models. The Poisson is characterized by the equality of mean and variance whereas the Negative Binomial and the Poisson inverse Gaussian have variance larger than the mean and therefore are more appropriate to model over-dispersed count data. As an alternative to these two models, we shall use the generalized Poisson distribution for group comparisons in the presence of multiple covariates. This problem is known as the ANCOVA and is solved for continuous data. Our objectives were to develop ANCOVA using the generalized Poisson distribution, and compare its goodness of fit to that of the nonparametric Generalized Additive Models. We used real life data to show that the model performs quite satisfactorily when compared to the nonparametric Generalized Additive Models.
文摘Background: Daily paediatric asthma readmissions within 28 days are a good example of a low count time series and not easily amenable to common time series methods used in studies of asthma seasonality and time trends. We sought to model and predict daily trends of childhood asthma readmissions over time inVictoria,Australia. Methods: We used a database of 75,000 childhood asthma admissions from the Department ofHealth,Victoria,Australiain 1997-2009. Daily admissions over time were modeled using a semi parametric Generalized Additive Model (GAM) and by sex and age group. Predictions were also estimated by using these models. Results: N = 2401 asthma readmissions within 28 days occurred during study period. Of these, n = 1358 (57%) were boys. Overall, seasonal peaks occurred in winter (30.5%) followed by autumn (28.6%) and then spring (24.6%) (p
文摘Clinicians need to predict the number of involved nodes in breast cancer patients in order to ascertain severity, prognosis, and design subsequent treatment. The distribution of involved nodes often displays over-dispersion—a larger variability than expected. Until now, the negative binomial model has been used to describe this distribution assuming that over-dispersion is only due to unobserved heterogeneity. The distribution of involved nodes contains a large proportion of excess zeros (negative nodes), which can lead to over-dispersion. In this situation, alternative models may better account for over-dispersion due to excess zeros. This study examines data from 1152 patients who underwent axillary dissections in a tertiary hospital in India during January 1993-January 2005. We fit and compare various count models to test model abilities to predict the number of involved nodes. We also argue for using zero inflated models in such populations where all the excess zeros come from those who have at some risk of the outcome of interest. The negative binomial regression model fits the data better than the Poisson, zero hurdle/inflated Poisson regression models. However, zero hurdle/inflated negative binomial regression models predicted the number of involved nodes much more accurately than the negative binomial model. This suggests that the number of involved nodes displays excess variability not only due to unobserved heterogeneity but also due to excess negative nodes in the data set. In this analysis, only skin changes and primary site were associated with negative nodes whereas parity, skin changes, primary site and size of tumor were associated with a greater number of involved nodes. In case of near equal performances, the zero inflated negative binomial model should be preferred over the hurdle model in describing the nodal frequency because it provides an estimate of negative nodes that are at “high-risk” of nodal involvement.
基金Supported by the National Natural Science Foundation of China(Nos.11571058&11301481)Humanities and Social Science Foundation of the Ministry of Education of China(No.17YJC910007)+1 种基金Zhejiang Provincial Natural Science Foundation of China(No.LY17A010004)Fundamental Research Funds for the Central Universities(No.DUT17LK31)
文摘Consider a multidimensional renewal risk model, in which the claim sizes {Xk, k ≥1} form a sequence of independent and identically distributed random vectors with nonnegative components that are allowed to be dependent on each other. The univariate marginal distributions of these vectors have consistently varying tails and finite means. Suppose that the claim sizes and inter-arrival times correspondingly form a sequence of independent and identically distributed random pairs, with each pair obeying a dependence structure. A precise large deviation for the multidimensional renewal risk model is obtained.
文摘The issue of document management has been raised for a long time, especially with the appearance of office automation in the 1980s, which led to dematerialization and Electronic Document Management (EDM). In the same period, workflow management has experienced significant development, but has become more focused on the industry. However, it seems to us that document workflows have not had the same interest for the scientific community. But nowadays, the emergence and supremacy of the Internet in electronic exchanges are leading to a massive dematerialization of documents;which requires a conceptual reconsideration of the organizational framework for the processing of said documents in both public and private administrations. This problem seems open to us and deserves the interest of the scientific community. Indeed, EDM has mainly focused on the storage (referencing) and circulation of documents (traceability). It paid little attention to the overall behavior of the system in processing documents. The purpose of our researches is to model document processing systems. In the previous works, we proposed a general model and its specialization in the case of small documents (any document processed by a single person at a time during its processing life cycle), which represent 70% of documents processed by administrations, according to our study. In this contribution, we extend the model for processing small documents to the case where they are managed in a system comprising document classes organized in subclasses;which is the case for most administrations. We have thus observed that this model is a Markovian <i>M<sup>L×K</sup>/M<sup>L×K</sup>/</i>1 queues network. We have analyzed the constraints of this model and deduced certain characteristics and metrics. <span style="white-space:normal;"><i></i></span><i>In fine<span style="white-space:normal;"></span></i>, the ultimate objective of our work is to design a document workflow management system, integrating a component of global behavior prediction.
基金Project supported by the National Natural Science Foundation of China (Grant No. 60676053)Applied Material in Xi’an Innovation Funds,China (Grant No. XA-AM-200603)
文摘In order to explore how to extract more transport information from current fluctuation, a theoretical extraction scheme is presented in a single barrier structure based on exclusion models, which include counter-flows model and tunnel model. The first four cumulants of these two exclusion models are computed in a single barrier structure, and their characteristics are obtained. A scheme with the help of the first three cumulants is devised to check a transport process to follow the counter-flows model, the tunnel model or neither of them. Time series generated by Monte Carlo techniques is adopted to validate the abstraction procedure, and the result is reasonable.
文摘In this paper we aim to analyse temporal variation of CD4 cell counts for HIV-infected individuals under antiretroviral therapy by using statistical methods. This is achieved by resorting to recursive binary regression tree approach [1]?[2]. This approach has made it possible to highlight the existence of several segments of the population of interest described by the interactions between the predictive covariates of the response to the treatment regimen.
文摘It is difficult to measure the sizes of illegal drug user populations directly by using the survey method because of many “hidden drug addicts” and the difficulty of receiving a true response. Systematic and routine information on treatment episodes of drug users is adopted to estimate the population size in this study. Mixture models of zero-truncated Poisson distributions using the nonparametric maximum likelihood estimators (NPMLE) by means of capture-recapture repeated count data were used to project the number of drug users. The method was applied to surveillance data of drug users identified by treatment episodes in over 1140 health treatment centers in Thailand from the Bureau of Health Service System Development, Ministry of Public Health. We presented how this mixture model could be utilized to construct the unobserved frequency of drug users with no treatment episode and further estimated the total population size of drug users in the country from 2005 to 2007. The result of simulation was confirmed that mixture model is suitable when population is large. By means of mixture models, the estimations for the number of drug users were fitted with excellent goodness-of-fit values and we were also compared to the conventional Chao estimates. The NPMLE for the total number of drug users in Thailand 2005, 2006, and 2007 were 184,045 (95% CI: 181,297-86,793), 230,665 (95% CI: 226,611-234,719), 299,670 (95% CI: 294,217-305,123), respectively, also 125,265 (95% CI: 123,092-127,142), 166,287 (95% CI: 163,222-169,352), 228,898 (95% CI: 224,766 - 233,030) for the number of methamphetamine (Yaba) users, and 11,559 (95% CI: 10,234-12,884), 11,333 (95% CI: 9276-13,390), 8953 (95% CI: 7878-10,028) for the number of heroin users, respectively. The numbers of marijuana, kratom-plant, opium, and inhalant users were underestimated because their symptoms were mild and not severe enough to remedy in health treatment centers which led to the smaller size of the total number of drug users. The well-estimated sizes of heroin and methamphetamine addicts are high reliable because they are based on clearly evident count with a severe addiction problem to health treatment centers. The estimation by means of mixture models can be recommended to monitor drug demand trend and drug health service routinely;it is easy to calculate via the available programs MIXTP based on request.