Mortality rate of gastric cancer is about 20.93/100000 which is the highest malignancy in China. The scientist of our country are at present interested in studying the postoperative survival model by multivariate anal...Mortality rate of gastric cancer is about 20.93/100000 which is the highest malignancy in China. The scientist of our country are at present interested in studying the postoperative survival model by multivariate analysis method just as stepwise regression model. The proportional hazard model initiated by Cox (1972) is more advanced than other regression method which is unneccessary to suppose the distribution of survival time and easy to analyse censoring data (the latter is difficult). This paper presented the first time application of Cox model in survival analysis of gastric cancer in China. The survival analysis system (SAS-Ⅰ) software complied by the author includes multivariate anlysis by Cox model, PV analysis and estimation of survival function which could provide useful information to surgeon for treatment of cancer patients.展开更多
In the applications of COX regression models, we always encounter data sets t<span>hat contain too many variables that only a few of them contribute to the</span> model. Therefore, it will waste much more ...In the applications of COX regression models, we always encounter data sets t<span>hat contain too many variables that only a few of them contribute to the</span> model. Therefore, it will waste much more samples to estimate the “noneffective” variables in the inference. In this paper, we use a sequential procedure for constructing<span><span><span style="font-family:;" "=""> </span></span></span><span><span><span style="font-family:;" "="">the fixed size confidence set for the “effective” parameters to the model based on an adaptive shrinkage estimate such that the “effective” coefficients can be efficiently identified with the minimum sample size. Fixed design is considered for numerical simulation. The strong consistency, asymptotic distributions and convergence rates of estimates under the fixed design are obtained. In addition, the sequential procedure is shown to be asymptotically optimal in the sense of Chow and Robbins (1965).</span></span></span>展开更多
AIM:To investigate the efficiency of Cox proportional hazard model in detecting prognostic factors for gastric cancer.METHODS:We used the log-normal regression model to evaluate prognostic factors in gastric cancer an...AIM:To investigate the efficiency of Cox proportional hazard model in detecting prognostic factors for gastric cancer.METHODS:We used the log-normal regression model to evaluate prognostic factors in gastric cancer and compared it with the Cox model.Three thousand and eighteen gastric cancer patients who received a gastrectomy between 1980 and 2004 were retrospectively evaluated.Clinic-pathological factors were included in a log-normal model as well as Cox model.The akaike information criterion (AIC) was employed to compare the efficiency of both models.Univariate analysis indicated that age at diagnosis,past history,cancer location,distant metastasis status,surgical curative degree,combined other organ resection,Borrmann type,Lauren's classification,pT stage,total dissected nodes and pN stage were prognostic factors in both log-normal and Cox models.RESULTS:In the final multivariate model,age at diagnosis,past history,surgical curative degree,Borrmann type,Lauren's classification,pT stage,and pN stage were significant prognostic factors in both log-normal and Cox models.However,cancer location,distant metastasis status,and histology types were found to be significant prognostic factors in log-normal results alone.According to AIC,the log-normal model performed better than the Cox proportional hazard model (AIC value:2534.72 vs 1693.56).CONCLUSION:It is suggested that the log-normal regression model can be a useful statistical model to evaluate prognostic factors instead of the Cox proportional hazard model.展开更多
Given a sample of regression data from (Y, Z), a new diagnostic plotting method is proposed for checking the hypothesis H0: the data are from a given Cox model with the time-dependent covariates Z. It compares two est...Given a sample of regression data from (Y, Z), a new diagnostic plotting method is proposed for checking the hypothesis H0: the data are from a given Cox model with the time-dependent covariates Z. It compares two estimates of the marginal distribution FY of Y. One is an estimate of the modified expression of FY under H0, based on a consistent estimate of the parameter under H0, and based on the baseline distribution of the data. The other is the Kaplan-Meier-estimator of FY, together with its confidence band. The new plot, called the marginal distribution plot, can be viewed as a test for testing H0. The main advantage of the test over the existing residual tests is in the case that the data do not satisfy any Cox model or the Cox model is mis-specified. Then the new test is still valid, but not the residual tests and the residual tests often make type II error with a very large probability.展开更多
Bangladesh is a subtropical monsoon climate characterized by wide seasonal variations in rainfall, moderately warm temperatures, and high humidity. Rainfall is the main source of irrigation water everywhere in the Ban...Bangladesh is a subtropical monsoon climate characterized by wide seasonal variations in rainfall, moderately warm temperatures, and high humidity. Rainfall is the main source of irrigation water everywhere in the Bangladesh where the inhabitants derive their income primarily from farming. Stochastic rainfall models were concerned with the occurrence of wet day and depth of rainfall for different regions to model the daily occurrence of rainfall and achieved satisfactory results around the world. In connection to the Markov chain of different order, logistic regression is conducted to visualize the dependence of current rainfall upon the rainfall of previous two-time period. It had been shown that wet day of the previous two time period compared to the dry day of previous two time period influences positively the wet day of current time period, that is the dependency of dry-wet spell for the occurrence of rain in the rainy season from April to September in the study area. Daily data are collected from meteorological department of about 26 years on rainfall of Dhaka station during the period January 1985-August 2011 to conduct the study. The test result shows that the occurrence of rainfall follows a second order Markov chain and logistic regression also tells that dry followed by dry and wet followed by wet is more likely for the rainfall of Dhaka station and also the model could perform adequately for many applications of rainfall data satisfactorily.展开更多
To stay competitive, the mobile telecommunication companies spend millions of Ghana cedi each year on building long-term relationships with their customers. Marketing managers are constantly challenged with the proble...To stay competitive, the mobile telecommunication companies spend millions of Ghana cedi each year on building long-term relationships with their customers. Marketing managers are constantly challenged with the problem of where to channel the limited resources in order to retain existing customers. This study approaches the customer retention problem in the mobile phone sector from a behavioural perspective, applying the Behavioural Perspective Model as the main analytical framework and further exploits some other factors that influence customer retention. The model includes a set of pre-behaviour and post-behaviour factors to study consumer choice, and explains its relevant drivers in a viable and comprehensive way, grounded in radical behaviourism. Data for the analysis were collected from tertiary students from Accra and Takoradi. Data collected were analysed using the multinomial regression technique. Analysis of the data revealed that the Behaviour setting factor is the only significant element in Behaviour Perspective Model. Further exploitation of behaviour situation revealed that the number of networks a customer uses, previous experience of a customer and customer’s intention are significant factors in determining customer retention in Ghana’s mobile telecommunication industry.展开更多
Background: The Poisson and the Negative Binomial distributions are commonly used to model count data. The Poisson is characterized by the equality of mean and variance whereas the Negative Binomial has a variance lar...Background: The Poisson and the Negative Binomial distributions are commonly used to model count data. The Poisson is characterized by the equality of mean and variance whereas the Negative Binomial has a variance larger than the mean and therefore both models are appropriate to model over-dispersed count data. Objectives: A new two-parameter probability distribution called the Quasi-Negative Binomial Distribution (QNBD) is being studied in this paper, generalizing the well-known negative binomial distribution. This model turns out to be quite flexible for analyzing count data. Our main objectives are to estimate the parameters of the proposed distribution and to discuss its applicability to genetics data. As an application, we demonstrate that the QNBD regression representation is utilized to model genomics data sets. Results: The new distribution is shown to provide a good fit with respect to the “Akaike Information Criterion”, AIC, considered a measure of model goodness of fit. The proposed distribution may serve as a viable alternative to other distributions available in the literature for modeling count data exhibiting overdispersion, arising in various fields of scientific investigation such as genomics and biomedicine.展开更多
App store provides rich information for software vendors and customers to understand the market of mobile applications. However, app store analysis don’t consider some vital factors such as version number, app descri...App store provides rich information for software vendors and customers to understand the market of mobile applications. However, app store analysis don’t consider some vital factors such as version number, app description and app name currently. In this paper we propose an approach that App Store Analysis can be used to predict app downloads. We use data mining to extract app name and description and app rank information etc. from the Wandoujia App Store and AppCha App Store. We use questionnaire and sentimentanalysis to quantify some app nonnumeric information. We revealed strong correlations app name score, app rank, app rating with app downloads by Spearman’s rank correlation analysis respectively. Finally, we establish a multiple nonlinear regression model which app downloads defined as dependent variable and three relevant attributes defined as independent variable. On average, 59.28 % of apps in Wandoujia App Store and 66.68 % of apps in AppCha App Store can be predicted accurately within threshold which error rate is 25 %. One can observe the more detailed classification of app store, the more accurate for regression modeling to predict app downloads. Our approach can help app developers to notice and optimize the vital factors which influence app downloads.展开更多
Background: The Center of Molecular Immunology (CIM) is a center in Cuba devoted to the research, development and manufacturing of biotechnological products. CIMAvax?EGF is a vaccine for the treatment of non-small cel...Background: The Center of Molecular Immunology (CIM) is a center in Cuba devoted to the research, development and manufacturing of biotechnological products. CIMAvax?EGF is a vaccine for the treatment of non-small cell lung cancer patients (NSCL). Purpose: The aim of this work is to evaluate the effects of some potential prognostic factors on the overall survival of patients treated with CIMAvax?EGF vaccine, based on data collected in a phase II and a phase III clinical trials. Methods: The stratified Cox regression model is used to evaluate the effects of these prognostic factors, based on separate analysis for each trial, and on the combined data from both trials. Results: Patients with Performance status 0 or 1, with IV stage of tumor and male under 60 years obtain more benefit in terms of overall survival if they receive CIMAvax?EGF. Conclusions: Vaccinated group has a better performance if patients have a performance status 0 or 1, stage IV and age under 60 years. These prognostic factors influence overall survival in a positive way for those patients that received CIMAvax?EGF.展开更多
Amyotrophic lateral sclerosis is a rare neurodegenerative disease characterized by the involvement of both upper and lower motor neurons.Early bilateral limb involvement significantly affects patients'daily lives ...Amyotrophic lateral sclerosis is a rare neurodegenerative disease characterized by the involvement of both upper and lower motor neurons.Early bilateral limb involvement significantly affects patients'daily lives and may lead them to be confined to bed.However,the effect of upper and lower motor neuron impairment and other risk factors on bilateral limb involvement is unclear.To address this issue,we retrospectively collected data from 586 amyotrophic lateral sclerosis patients with limb onset diagnosed at Peking University Third Hospital between January 2020 and May 2022.A univariate analysis revealed no significant differences in the time intervals of spread in different directions between individuals with upper motor neuron-dominant amyotrophic lateral sclerosis and those with classic amyotrophic lateral sclerosis.We used causal directed acyclic graphs for risk factor determination and Cox proportional hazards models to investigate the association between the duration of bilateral limb involvement and clinical baseline characteristics in amyotrophic lateral sclerosis patients.Multiple factor analyses revealed that higher upper motor neuron scores(hazard ratio[HR]=1.05,95%confidence interval[CI]=1.01–1.09,P=0.018),onset in the left limb(HR=0.72,95%CI=0.58–0.89,P=0.002),and a horizontal pattern of progression(HR=0.46,95%CI=0.37–0.58,P<0.001)were risk factors for a shorter interval until bilateral limb involvement.The results demonstrated that a greater degree of upper motor neuron involvement might cause contralateral limb involvement to progress more quickly in limb-onset amyotrophic lateral sclerosis patients.These findings may improve the management of amyotrophic lateral sclerosis patients with limb onset and the prediction of patient prognosis.展开更多
Penalized empirical likelihood inferential procedure is proposed for Cox's pro- portional hazards model with adaptive LASSO(ALASSO). Under reasonable conditions, we show that the proposed method has oracle property...Penalized empirical likelihood inferential procedure is proposed for Cox's pro- portional hazards model with adaptive LASSO(ALASSO). Under reasonable conditions, we show that the proposed method has oracle property and the limiting distribution of a penal- ized empirical likelihood ratio via ALASSO is a chi-square distributions. The advantage of penalized empirical likelihood is illustrated in testing hypothesis and constructing confidence sets by simulation studies and a real example.展开更多
Exclusive hypothesis testing is a new and special class of hypothesis testing.This kind of testing can be applied in survival analysis to understand the association between genomics information and clinical informatio...Exclusive hypothesis testing is a new and special class of hypothesis testing.This kind of testing can be applied in survival analysis to understand the association between genomics information and clinical information about the survival time.Besides,it is well known that Cox's proportional hazards model is the most commonly used model for regression analysis of failure time.In this paper,the authors consider doing the exclusive hypothesis testing for Cox's proportional hazards model with right-censored data.The authors propose the comprehensive test statistics to make decision,and show that the corresponding decision rule can control the asymptotic TypeⅠerrors and have good powers in theory.The numerical studies indicate that the proposed approach works well for practical situations and it is applied to a set of real data arising from Rotterdam Breast Cancer Data study that motivated this study.展开更多
文摘Mortality rate of gastric cancer is about 20.93/100000 which is the highest malignancy in China. The scientist of our country are at present interested in studying the postoperative survival model by multivariate analysis method just as stepwise regression model. The proportional hazard model initiated by Cox (1972) is more advanced than other regression method which is unneccessary to suppose the distribution of survival time and easy to analyse censoring data (the latter is difficult). This paper presented the first time application of Cox model in survival analysis of gastric cancer in China. The survival analysis system (SAS-Ⅰ) software complied by the author includes multivariate anlysis by Cox model, PV analysis and estimation of survival function which could provide useful information to surgeon for treatment of cancer patients.
文摘In the applications of COX regression models, we always encounter data sets t<span>hat contain too many variables that only a few of them contribute to the</span> model. Therefore, it will waste much more samples to estimate the “noneffective” variables in the inference. In this paper, we use a sequential procedure for constructing<span><span><span style="font-family:;" "=""> </span></span></span><span><span><span style="font-family:;" "="">the fixed size confidence set for the “effective” parameters to the model based on an adaptive shrinkage estimate such that the “effective” coefficients can be efficiently identified with the minimum sample size. Fixed design is considered for numerical simulation. The strong consistency, asymptotic distributions and convergence rates of estimates under the fixed design are obtained. In addition, the sequential procedure is shown to be asymptotically optimal in the sense of Chow and Robbins (1965).</span></span></span>
基金Supported by the Gastric Cancer Laboratory and Pathology Department of Chinese Medical University,Shenyang,Chinathe Science and Technology Program of Shenyang,No. 1081232-1-00
文摘AIM:To investigate the efficiency of Cox proportional hazard model in detecting prognostic factors for gastric cancer.METHODS:We used the log-normal regression model to evaluate prognostic factors in gastric cancer and compared it with the Cox model.Three thousand and eighteen gastric cancer patients who received a gastrectomy between 1980 and 2004 were retrospectively evaluated.Clinic-pathological factors were included in a log-normal model as well as Cox model.The akaike information criterion (AIC) was employed to compare the efficiency of both models.Univariate analysis indicated that age at diagnosis,past history,cancer location,distant metastasis status,surgical curative degree,combined other organ resection,Borrmann type,Lauren's classification,pT stage,total dissected nodes and pN stage were prognostic factors in both log-normal and Cox models.RESULTS:In the final multivariate model,age at diagnosis,past history,surgical curative degree,Borrmann type,Lauren's classification,pT stage,and pN stage were significant prognostic factors in both log-normal and Cox models.However,cancer location,distant metastasis status,and histology types were found to be significant prognostic factors in log-normal results alone.According to AIC,the log-normal model performed better than the Cox proportional hazard model (AIC value:2534.72 vs 1693.56).CONCLUSION:It is suggested that the log-normal regression model can be a useful statistical model to evaluate prognostic factors instead of the Cox proportional hazard model.
文摘Given a sample of regression data from (Y, Z), a new diagnostic plotting method is proposed for checking the hypothesis H0: the data are from a given Cox model with the time-dependent covariates Z. It compares two estimates of the marginal distribution FY of Y. One is an estimate of the modified expression of FY under H0, based on a consistent estimate of the parameter under H0, and based on the baseline distribution of the data. The other is the Kaplan-Meier-estimator of FY, together with its confidence band. The new plot, called the marginal distribution plot, can be viewed as a test for testing H0. The main advantage of the test over the existing residual tests is in the case that the data do not satisfy any Cox model or the Cox model is mis-specified. Then the new test is still valid, but not the residual tests and the residual tests often make type II error with a very large probability.
文摘Bangladesh is a subtropical monsoon climate characterized by wide seasonal variations in rainfall, moderately warm temperatures, and high humidity. Rainfall is the main source of irrigation water everywhere in the Bangladesh where the inhabitants derive their income primarily from farming. Stochastic rainfall models were concerned with the occurrence of wet day and depth of rainfall for different regions to model the daily occurrence of rainfall and achieved satisfactory results around the world. In connection to the Markov chain of different order, logistic regression is conducted to visualize the dependence of current rainfall upon the rainfall of previous two-time period. It had been shown that wet day of the previous two time period compared to the dry day of previous two time period influences positively the wet day of current time period, that is the dependency of dry-wet spell for the occurrence of rain in the rainy season from April to September in the study area. Daily data are collected from meteorological department of about 26 years on rainfall of Dhaka station during the period January 1985-August 2011 to conduct the study. The test result shows that the occurrence of rainfall follows a second order Markov chain and logistic regression also tells that dry followed by dry and wet followed by wet is more likely for the rainfall of Dhaka station and also the model could perform adequately for many applications of rainfall data satisfactorily.
文摘To stay competitive, the mobile telecommunication companies spend millions of Ghana cedi each year on building long-term relationships with their customers. Marketing managers are constantly challenged with the problem of where to channel the limited resources in order to retain existing customers. This study approaches the customer retention problem in the mobile phone sector from a behavioural perspective, applying the Behavioural Perspective Model as the main analytical framework and further exploits some other factors that influence customer retention. The model includes a set of pre-behaviour and post-behaviour factors to study consumer choice, and explains its relevant drivers in a viable and comprehensive way, grounded in radical behaviourism. Data for the analysis were collected from tertiary students from Accra and Takoradi. Data collected were analysed using the multinomial regression technique. Analysis of the data revealed that the Behaviour setting factor is the only significant element in Behaviour Perspective Model. Further exploitation of behaviour situation revealed that the number of networks a customer uses, previous experience of a customer and customer’s intention are significant factors in determining customer retention in Ghana’s mobile telecommunication industry.
文摘Background: The Poisson and the Negative Binomial distributions are commonly used to model count data. The Poisson is characterized by the equality of mean and variance whereas the Negative Binomial has a variance larger than the mean and therefore both models are appropriate to model over-dispersed count data. Objectives: A new two-parameter probability distribution called the Quasi-Negative Binomial Distribution (QNBD) is being studied in this paper, generalizing the well-known negative binomial distribution. This model turns out to be quite flexible for analyzing count data. Our main objectives are to estimate the parameters of the proposed distribution and to discuss its applicability to genetics data. As an application, we demonstrate that the QNBD regression representation is utilized to model genomics data sets. Results: The new distribution is shown to provide a good fit with respect to the “Akaike Information Criterion”, AIC, considered a measure of model goodness of fit. The proposed distribution may serve as a viable alternative to other distributions available in the literature for modeling count data exhibiting overdispersion, arising in various fields of scientific investigation such as genomics and biomedicine.
文摘App store provides rich information for software vendors and customers to understand the market of mobile applications. However, app store analysis don’t consider some vital factors such as version number, app description and app name currently. In this paper we propose an approach that App Store Analysis can be used to predict app downloads. We use data mining to extract app name and description and app rank information etc. from the Wandoujia App Store and AppCha App Store. We use questionnaire and sentimentanalysis to quantify some app nonnumeric information. We revealed strong correlations app name score, app rank, app rating with app downloads by Spearman’s rank correlation analysis respectively. Finally, we establish a multiple nonlinear regression model which app downloads defined as dependent variable and three relevant attributes defined as independent variable. On average, 59.28 % of apps in Wandoujia App Store and 66.68 % of apps in AppCha App Store can be predicted accurately within threshold which error rate is 25 %. One can observe the more detailed classification of app store, the more accurate for regression modeling to predict app downloads. Our approach can help app developers to notice and optimize the vital factors which influence app downloads.
基金supported by a UICC International Cancer Technology Transfer Fellowship.
文摘Background: The Center of Molecular Immunology (CIM) is a center in Cuba devoted to the research, development and manufacturing of biotechnological products. CIMAvax?EGF is a vaccine for the treatment of non-small cell lung cancer patients (NSCL). Purpose: The aim of this work is to evaluate the effects of some potential prognostic factors on the overall survival of patients treated with CIMAvax?EGF vaccine, based on data collected in a phase II and a phase III clinical trials. Methods: The stratified Cox regression model is used to evaluate the effects of these prognostic factors, based on separate analysis for each trial, and on the combined data from both trials. Results: Patients with Performance status 0 or 1, with IV stage of tumor and male under 60 years obtain more benefit in terms of overall survival if they receive CIMAvax?EGF. Conclusions: Vaccinated group has a better performance if patients have a performance status 0 or 1, stage IV and age under 60 years. These prognostic factors influence overall survival in a positive way for those patients that received CIMAvax?EGF.
基金supported by the National Natural Science Foundation of China,Nos.82071426,81873784Clinical Cohort Construction Program of Peking University Third Hospital,No.BYSYDL2019002(all to DF)。
文摘Amyotrophic lateral sclerosis is a rare neurodegenerative disease characterized by the involvement of both upper and lower motor neurons.Early bilateral limb involvement significantly affects patients'daily lives and may lead them to be confined to bed.However,the effect of upper and lower motor neuron impairment and other risk factors on bilateral limb involvement is unclear.To address this issue,we retrospectively collected data from 586 amyotrophic lateral sclerosis patients with limb onset diagnosed at Peking University Third Hospital between January 2020 and May 2022.A univariate analysis revealed no significant differences in the time intervals of spread in different directions between individuals with upper motor neuron-dominant amyotrophic lateral sclerosis and those with classic amyotrophic lateral sclerosis.We used causal directed acyclic graphs for risk factor determination and Cox proportional hazards models to investigate the association between the duration of bilateral limb involvement and clinical baseline characteristics in amyotrophic lateral sclerosis patients.Multiple factor analyses revealed that higher upper motor neuron scores(hazard ratio[HR]=1.05,95%confidence interval[CI]=1.01–1.09,P=0.018),onset in the left limb(HR=0.72,95%CI=0.58–0.89,P=0.002),and a horizontal pattern of progression(HR=0.46,95%CI=0.37–0.58,P<0.001)were risk factors for a shorter interval until bilateral limb involvement.The results demonstrated that a greater degree of upper motor neuron involvement might cause contralateral limb involvement to progress more quickly in limb-onset amyotrophic lateral sclerosis patients.These findings may improve the management of amyotrophic lateral sclerosis patients with limb onset and the prediction of patient prognosis.
文摘Penalized empirical likelihood inferential procedure is proposed for Cox's pro- portional hazards model with adaptive LASSO(ALASSO). Under reasonable conditions, we show that the proposed method has oracle property and the limiting distribution of a penal- ized empirical likelihood ratio via ALASSO is a chi-square distributions. The advantage of penalized empirical likelihood is illustrated in testing hypothesis and constructing confidence sets by simulation studies and a real example.
基金supported by the National Natural Science Foundation of China under Grant Nos.11971064,12371262,and 12171374。
文摘Exclusive hypothesis testing is a new and special class of hypothesis testing.This kind of testing can be applied in survival analysis to understand the association between genomics information and clinical information about the survival time.Besides,it is well known that Cox's proportional hazards model is the most commonly used model for regression analysis of failure time.In this paper,the authors consider doing the exclusive hypothesis testing for Cox's proportional hazards model with right-censored data.The authors propose the comprehensive test statistics to make decision,and show that the corresponding decision rule can control the asymptotic TypeⅠerrors and have good powers in theory.The numerical studies indicate that the proposed approach works well for practical situations and it is applied to a set of real data arising from Rotterdam Breast Cancer Data study that motivated this study.