This paper discusses efficient estimation for the additive hazards regression model when only bivariate current status data are available. Current status data occur in many fields including demographical studies and t...This paper discusses efficient estimation for the additive hazards regression model when only bivariate current status data are available. Current status data occur in many fields including demographical studies and tumorigenicity experiments (Keiding, 1991; Sun, 2006) and several approaches have been proposed for the additive hazards model with univariate current status data (Linet M., 1998; Martinussen and Scheike, 2002). For bivariate data, in addition to facing the same problems as those with univariate data, one needs to deal with the association or correlation between two related failure time variables of interest. For this, we employ the copula model and an efficient estimation procedure is developed for inference. Simulation studies are performed to evaluate the proposed estimates and suggest that the approach works well in practical situations. An illustrative example is provided.展开更多
Consider the model Y=Xτβ+g(T)+ε. Here g is a smooth but unknown function, β is a k×1 parameter vector to be estimated and ε, is an random error with mean 0 and variance σ2. The asymptotically efficient esti...Consider the model Y=Xτβ+g(T)+ε. Here g is a smooth but unknown function, β is a k×1 parameter vector to be estimated and ε, is an random error with mean 0 and variance σ2. The asymptotically efficient estimator of β is constructed on the basis of the model Yi=Xτiβ+g(Ti)+εi, i=1,…,n, when the density functions of (X,T) and ε are known or unknown.Finally, an asymptotically normal estimator of σ2 is given.展开更多
This article considers a semiparametric varying-coefficient partially linear binary regression model. The semiparametric varying-coefficient partially linear regression binary model which is a generalization of binary...This article considers a semiparametric varying-coefficient partially linear binary regression model. The semiparametric varying-coefficient partially linear regression binary model which is a generalization of binary regression model and varying-coefficient regression model that allows one to explore the possibly nonlinear effect of a certain covariate on the response variable. A Sieve maximum likelihood estimation method is proposed and the asymptotic properties of the proposed estimators are discussed. One of our main objects is to estimate nonparametric component and the unknowen parameters simultaneously. It is easier to compute, and the required computation burden is much less than that of the existing two-stage estimation method. Under some mild conditions, the estimators are shown to be strongly consistent. The convergence rate of the estimator for the unknown smooth function is obtained, and the estimator for the unknown parameter is shown to be asymptotically efficient and normally distributed. Simulation studies are carried out to investigate the performance of the proposed method.展开更多
This article considers a semiparametric varying-coefficient partially linear regression model with current status data. The semiparametric varying-coefficient partially linear regression model which is a generalizatio...This article considers a semiparametric varying-coefficient partially linear regression model with current status data. The semiparametric varying-coefficient partially linear regression model which is a generalization of the partially linear regression model and varying-coefficient regression model that allows one to explore the possibly nonlinear effect of a certain covariate on the response variable. A Sieve maximum likelihood estimation method is proposed and the asymptotic properties of the proposed estimators are discussed. Under some mild conditions, the estimators are shown to be strongly consistent. The convergence rate of the estimator for the unknown smooth function is obtained and the estimator for the unknown parameter is shown to be asymptotically efficient and normally distributed. Simulation studies are conducted to examine the small-sample properties of the proposed estimates and a real dataset is used to illustrate our approach.展开更多
In[Phys.Rev.A 107012427(2023)],Baldwin and Jones prove that Uhlmann–Jozsa’s fidelity between two quantum statesρandσ,i.e.,F(ρ,σ)=(Tr√√ρσ√ρ)^(2),can be written in a simplified form as F(ρ,σ)=(Tr√ρσ)^(2...In[Phys.Rev.A 107012427(2023)],Baldwin and Jones prove that Uhlmann–Jozsa’s fidelity between two quantum statesρandσ,i.e.,F(ρ,σ)=(Tr√√ρσ√ρ)^(2),can be written in a simplified form as F(ρ,σ)=(Tr√ρσ)^(2).In this article,we give an alternative proof of this result,using a function power series expansion and the properties of the trace function.Our approach not only reinforces the validity of the simplified expression but also facilitates the exploration of novel dissimilarity functions for quantum states and more complex trace functions of density operators.展开更多
A new relative efficiency of parameter estimation for generalized Gauss-Markov linear model was proposed. Its lower bound was also derived. Its properties were explored in comparison with three currently very popular ...A new relative efficiency of parameter estimation for generalized Gauss-Markov linear model was proposed. Its lower bound was also derived. Its properties were explored in comparison with three currently very popular relative efficiencies. The new relative efficiency not only reflects sensitively the error and loss caused by the substitution of the least square estimator for the best linear unbiased estimator, but also overcomes the disadvantage of weak dependence on the design matrix.展开更多
This paper studies the estimation and inference for a class of varying-coefficient regression models with error-prone covariates.The authors focus on the situation where the covariates are unobserved,there are no repe...This paper studies the estimation and inference for a class of varying-coefficient regression models with error-prone covariates.The authors focus on the situation where the covariates are unobserved,there are no repeated measurements,and the covariance matrix of the measurement errors is unknown,but some auxiliary information is available.The authors propose an instrumental variable type local polynomial estimator for the unknown varying-coefficient functions,and show that the estimator achieves the optimal nonparametric convergence rate,is asymptotically normal,and avoids using undersmoothing to allow the bandwidths to be selected using data-driven methods.A simulation is carried out to study the finite sample performance of the proposed estimator,and a real date set is analyzed to illustrate the usefulness of the developed methodology.展开更多
This article concerded with a semiparametric generalized partial linear model (GPLM) with the type Ⅱ censored data. A sieve maximum likelihood estimator (MLE) is proposed to estimate the parameter component, allo...This article concerded with a semiparametric generalized partial linear model (GPLM) with the type Ⅱ censored data. A sieve maximum likelihood estimator (MLE) is proposed to estimate the parameter component, allowing exploration of the nonlinear relationship between a certain covariate and the response function. Asymptotic properties of the proposed sieve MLEs are discussed. Under some mild conditions, the estimators are shown to be strongly consistent. Moreover, the estimators of the unknown parameters are asymptotically normal and efficient, and the estimator of the nonparametric function has an optimal convergence rate.展开更多
Kink model is developed to analyze the data where the regression function is two-stage piecewise linear with respect to the threshold covariate but continuous at an unknown kink point.In quantile regression for longit...Kink model is developed to analyze the data where the regression function is two-stage piecewise linear with respect to the threshold covariate but continuous at an unknown kink point.In quantile regression for longitudinal data,kink point where the kink effect happens is often assumed to be heterogeneous across different quantiles.However,the kink point tends to be the same across different quantiles,especially in a region of neighboring quantile levels.Incorporating such homogeneity information could increase the estimation efficiency of the common kink point.In this paper,we propose a composite quantile estimation approach for the common kink point by combining information from multiple neighboring quantiles.Asymptotic normality of the proposed estimator is studied.In addition,we also develop a sup-likelihood-ratio test to check the existence of the kink effect at a given quantile level.A test-inversion confidence interval for the common kink point is also developed based on the quantile rank score test.The simulation studies show that the proposed composite kink estimator is more efficient than the single quantile regression estimator.We also illustrate the proposed method via an application to a longitudinal data set on blood pressure and body mass index.展开更多
Case-cohort study designs are widely used to reduce the cost of large cohort studies. When several diseases are of interest, we can use the same subcohort. In this paper, we will study the casecohort design of margina...Case-cohort study designs are widely used to reduce the cost of large cohort studies. When several diseases are of interest, we can use the same subcohort. In this paper, we will study the casecohort design of marginal additive hazards model for multiple outcomes by a more efficient version. Instead of analyzing each disease separately, ignoring the additional exposure measurements collected on subjects with other diseases, we propose a new weighted estimating equation approach to improve the efficiency by utilizing as much information collected as possible. The consistency and asymptotic normality of the resulting estimator are established. Simulation studies are conducted to examine the finite sample performance of the proposed estimator, which confirm the efficiency gains.展开更多
To analyze the efficiency of area estimations(i.e.estimation accuracy and variation of estimation)impacted by crop mapping error,we simulated error at eight levels for thematic maps using a stratified sampling estimat...To analyze the efficiency of area estimations(i.e.estimation accuracy and variation of estimation)impacted by crop mapping error,we simulated error at eight levels for thematic maps using a stratified sampling estimation methodology.The results show that the estimation efficiency is influenced by the combination of the sample size and the error level.Evaluating the trade-offs between sample size and error level showed that reducing the crop mapping error level provides the most benefit(i.e.higher estimation efficiency).Further,sampling performance differed based on the heterogeneity of the crop area.The results demonstrated that the influence of increasing the error level on estimation efficiency is more detrimental in heterogeneous areas than in homogeneous ones.Therefore,to obtain higher estimation efficiency,a larger sample size and lower error level or both are needed,especially in heterogeneous areas.We suggest that existing land-cover maps should first be used to determine the heterogeneity of the area.The appropriate sample size for these areas then can be determined according to all three factors:heterogeneity,expected estimation efficiency,and sampling budget.Overall,extending our understanding of the impacts of crop mapping error is necessary for decision making to improve our ability to effectively estimate crop area.展开更多
This investigation was designed to approach a novel estimation method of glottal vocal efficiency (GVE) based on conversion function of voice source. The conversion function of voice source was defined the ratio of su...This investigation was designed to approach a novel estimation method of glottal vocal efficiency (GVE) based on conversion function of voice source. The conversion function of voice source was defined the ratio of supra-glottal acoustic voice source signal to the glottal air volume flow velocity waveform in frequency domain. A carefully designed in vivo canine larynx experiment and several human experiments including different vowels, pressed, falsetto, breath and typical laryngeal diseases were adopted to demonstrate this alternative GVE method. Compared with other vocal efficiency, it is shown that this method could eliminate the contribution from the super vocal tract transmission and resonance to GVE, and reflect the differences of phonation modes. The average magnitude of this conversion function in frequency domain represents GVE, and the variation of the magnitude in fundamental frequency is identical to AC/DC value.展开更多
The randomized response (RR) technique is an effective survey method when collecting sensitive information. This paper proposes a new non-randomized response model for survey sampling with polychotomous sensitive qu...The randomized response (RR) technique is an effective survey method when collecting sensitive information. This paper proposes a new non-randomized response model for survey sampling with polychotomous sensitive questions: Two-valued response technique. The proportion of the sensitive attribute and its estimator's variance are estimated under the simple random sampling without replacement and with replacement designs. The relation between the efficiency and the protection of privacy is discussed. The result indicates that the efficiency is in conflict with protection of privacy in the new proposed model, which is the same as that in the RR technique, but the new technique has better characteristics in sociological practice aspects than the RR technique, because it is of low cost, saving time and being operated easily.展开更多
This paper discusses regression analysis of right-censored failure time data when censoring indicators are missing for some subjects. Several methods have been developed for the analysis under different situations and...This paper discusses regression analysis of right-censored failure time data when censoring indicators are missing for some subjects. Several methods have been developed for the analysis under different situations and especially, Goetghebeur and Ryan considered the situation where both the failure time and the censoring time follow the proportional hazards models marginally and developed an estimating equation approach. One limitation of their approach is that the two baseline hazard functions were assumed to be proportional to each other. We consider the same problem and present an efficient estimation procedure for regression parameters that does not require the proportionality assumption. An EM algorithm is developed and the method is evaluated by a simulation study, which indicates that the proposed methodology performs well for practical situations. An illustrative example is provided.展开更多
This paper studies identification of systems in which the system output is quantized, transmitted through a digital communication channel, and observed afterwards. The concept of the CR Ratio is introduced to characte...This paper studies identification of systems in which the system output is quantized, transmitted through a digital communication channel, and observed afterwards. The concept of the CR Ratio is introduced to characterize impact of communication channels on identification. The relationship between the CR Ratio and Shannon channel capacity is discussed. Identification algorithms are further developed when the channel error probability is unknown.展开更多
In many settings,multiple data collections and analyses on the same topic are summarised separately through statistical estimators of parameters and variances,and yet there are scientificreasons for sharing some stati...In many settings,multiple data collections and analyses on the same topic are summarised separately through statistical estimators of parameters and variances,and yet there are scientificreasons for sharing some statistical parameters across these different studies.This paper summarises what is known from large-sample theory about when estimators of a common structuralparameter from several independent samples can be combined functionally,or more specificallylinearly,to obtain an asymptotically efficient estimator from the combined sample.The main ideais that such combination can be done when the separate-sample nuisance parameters,if anyexist,vary freely and independently of one another.The issues are illustrated using data from amulti-centre lung cancer clinical trial.Examples are presented to show that separate estimatorscannot always be combined in this way,and that the functionally combined separate estimators may have low or 0 efficiency compared to the unified analysis that could be performed bypooling the datasets.展开更多
基金partly supported by National Natural Science Foundation of China (Grant No. 10971015, 11131002)Key Project of Chinese Ministry of Education (Grant No. 309007)the Fundamental Research Funds for the Central Universities
文摘This paper discusses efficient estimation for the additive hazards regression model when only bivariate current status data are available. Current status data occur in many fields including demographical studies and tumorigenicity experiments (Keiding, 1991; Sun, 2006) and several approaches have been proposed for the additive hazards model with univariate current status data (Linet M., 1998; Martinussen and Scheike, 2002). For bivariate data, in addition to facing the same problems as those with univariate data, one needs to deal with the association or correlation between two related failure time variables of interest. For this, we employ the copula model and an efficient estimation procedure is developed for inference. Simulation studies are performed to evaluate the proposed estimates and suggest that the approach works well in practical situations. An illustrative example is provided.
文摘Consider the model Y=Xτβ+g(T)+ε. Here g is a smooth but unknown function, β is a k×1 parameter vector to be estimated and ε, is an random error with mean 0 and variance σ2. The asymptotically efficient estimator of β is constructed on the basis of the model Yi=Xτiβ+g(Ti)+εi, i=1,…,n, when the density functions of (X,T) and ε are known or unknown.Finally, an asymptotically normal estimator of σ2 is given.
基金Supported by National Natural Science Foundation of China (Grant Nos.10771017,10971015,10901020)Key Project of MOE,PRC (Grant No.309007)
文摘This article considers a semiparametric varying-coefficient partially linear binary regression model. The semiparametric varying-coefficient partially linear regression binary model which is a generalization of binary regression model and varying-coefficient regression model that allows one to explore the possibly nonlinear effect of a certain covariate on the response variable. A Sieve maximum likelihood estimation method is proposed and the asymptotic properties of the proposed estimators are discussed. One of our main objects is to estimate nonparametric component and the unknowen parameters simultaneously. It is easier to compute, and the required computation burden is much less than that of the existing two-stage estimation method. Under some mild conditions, the estimators are shown to be strongly consistent. The convergence rate of the estimator for the unknown smooth function is obtained, and the estimator for the unknown parameter is shown to be asymptotically efficient and normally distributed. Simulation studies are carried out to investigate the performance of the proposed method.
基金Supported by the National Natural Science Foundation of China(No.10771017,No.10231030)Key Project of Ministry of Education,PRC(No.309007)
文摘This article considers a semiparametric varying-coefficient partially linear regression model with current status data. The semiparametric varying-coefficient partially linear regression model which is a generalization of the partially linear regression model and varying-coefficient regression model that allows one to explore the possibly nonlinear effect of a certain covariate on the response variable. A Sieve maximum likelihood estimation method is proposed and the asymptotic properties of the proposed estimators are discussed. Under some mild conditions, the estimators are shown to be strongly consistent. The convergence rate of the estimator for the unknown smooth function is obtained and the estimator for the unknown parameter is shown to be asymptotically efficient and normally distributed. Simulation studies are conducted to examine the small-sample properties of the proposed estimates and a real dataset is used to illustrate our approach.
文摘In[Phys.Rev.A 107012427(2023)],Baldwin and Jones prove that Uhlmann–Jozsa’s fidelity between two quantum statesρandσ,i.e.,F(ρ,σ)=(Tr√√ρσ√ρ)^(2),can be written in a simplified form as F(ρ,σ)=(Tr√ρσ)^(2).In this article,we give an alternative proof of this result,using a function power series expansion and the properties of the trace function.Our approach not only reinforces the validity of the simplified expression but also facilitates the exploration of novel dissimilarity functions for quantum states and more complex trace functions of density operators.
文摘A new relative efficiency of parameter estimation for generalized Gauss-Markov linear model was proposed. Its lower bound was also derived. Its properties were explored in comparison with three currently very popular relative efficiencies. The new relative efficiency not only reflects sensitively the error and loss caused by the substitution of the least square estimator for the best linear unbiased estimator, but also overcomes the disadvantage of weak dependence on the design matrix.
基金supported by the Graduate Student Innovation Foundation of SHUFE(#CXJJ-2011-351)supported by the Natural Sciences and Engineering Research Council of Canada
文摘This paper studies the estimation and inference for a class of varying-coefficient regression models with error-prone covariates.The authors focus on the situation where the covariates are unobserved,there are no repeated measurements,and the covariance matrix of the measurement errors is unknown,but some auxiliary information is available.The authors propose an instrumental variable type local polynomial estimator for the unknown varying-coefficient functions,and show that the estimator achieves the optimal nonparametric convergence rate,is asymptotically normal,and avoids using undersmoothing to allow the bandwidths to be selected using data-driven methods.A simulation is carried out to study the finite sample performance of the proposed estimator,and a real date set is analyzed to illustrate the usefulness of the developed methodology.
基金The talent research fund launched (3004-893325) of Dalian University of Technologythe NNSF (10271049) of China.
文摘This article concerded with a semiparametric generalized partial linear model (GPLM) with the type Ⅱ censored data. A sieve maximum likelihood estimator (MLE) is proposed to estimate the parameter component, allowing exploration of the nonlinear relationship between a certain covariate and the response function. Asymptotic properties of the proposed sieve MLEs are discussed. Under some mild conditions, the estimators are shown to be strongly consistent. Moreover, the estimators of the unknown parameters are asymptotically normal and efficient, and the estimator of the nonparametric function has an optimal convergence rate.
基金Supported by the National Natural Science Foundation of China(Grant Nos.11922117,11771361)Fujian Provincial Science Fund for Distinguished Young Scholars(Grant No.2019J06004)。
文摘Kink model is developed to analyze the data where the regression function is two-stage piecewise linear with respect to the threshold covariate but continuous at an unknown kink point.In quantile regression for longitudinal data,kink point where the kink effect happens is often assumed to be heterogeneous across different quantiles.However,the kink point tends to be the same across different quantiles,especially in a region of neighboring quantile levels.Incorporating such homogeneity information could increase the estimation efficiency of the common kink point.In this paper,we propose a composite quantile estimation approach for the common kink point by combining information from multiple neighboring quantiles.Asymptotic normality of the proposed estimator is studied.In addition,we also develop a sup-likelihood-ratio test to check the existence of the kink effect at a given quantile level.A test-inversion confidence interval for the common kink point is also developed based on the quantile rank score test.The simulation studies show that the proposed composite kink estimator is more efficient than the single quantile regression estimator.We also illustrate the proposed method via an application to a longitudinal data set on blood pressure and body mass index.
基金supported by Graduate Innovation Foundation of Shanghai University of Finance and Economics,China(Grant No.CXJJ2014-453)the second author is supported by National Natural Science Foundation of China(Grant No.11301355)+1 种基金the Technology Foundation for Selected Overseas Chinese Scholar,Ministry of Personnel of BeijingChina
文摘Case-cohort study designs are widely used to reduce the cost of large cohort studies. When several diseases are of interest, we can use the same subcohort. In this paper, we will study the casecohort design of marginal additive hazards model for multiple outcomes by a more efficient version. Instead of analyzing each disease separately, ignoring the additional exposure measurements collected on subjects with other diseases, we propose a new weighted estimating equation approach to improve the efficiency by utilizing as much information collected as possible. The consistency and asymptotic normality of the resulting estimator are established. Simulation studies are conducted to examine the finite sample performance of the proposed estimator, which confirm the efficiency gains.
基金the Major Project of High-Resolution Earth Observation System,China[grant number 09-20A05-9001-17/18]the New Hampshire Agricultural Experiment Station.This is Scientific Contribution Number 2728the USDA National Institute of Food and Agriculture McIntire Stennis Project#NH00077-M(Accession#1002519)。
文摘To analyze the efficiency of area estimations(i.e.estimation accuracy and variation of estimation)impacted by crop mapping error,we simulated error at eight levels for thematic maps using a stratified sampling estimation methodology.The results show that the estimation efficiency is influenced by the combination of the sample size and the error level.Evaluating the trade-offs between sample size and error level showed that reducing the crop mapping error level provides the most benefit(i.e.higher estimation efficiency).Further,sampling performance differed based on the heterogeneity of the crop area.The results demonstrated that the influence of increasing the error level on estimation efficiency is more detrimental in heterogeneous areas than in homogeneous ones.Therefore,to obtain higher estimation efficiency,a larger sample size and lower error level or both are needed,especially in heterogeneous areas.We suggest that existing land-cover maps should first be used to determine the heterogeneity of the area.The appropriate sample size for these areas then can be determined according to all three factors:heterogeneity,expected estimation efficiency,and sampling budget.Overall,extending our understanding of the impacts of crop mapping error is necessary for decision making to improve our ability to effectively estimate crop area.
基金This Project was supported bythe National Natural Science Foundation ofChina and under grantsNo.69925101 and No.69871023.
文摘This investigation was designed to approach a novel estimation method of glottal vocal efficiency (GVE) based on conversion function of voice source. The conversion function of voice source was defined the ratio of supra-glottal acoustic voice source signal to the glottal air volume flow velocity waveform in frequency domain. A carefully designed in vivo canine larynx experiment and several human experiments including different vowels, pressed, falsetto, breath and typical laryngeal diseases were adopted to demonstrate this alternative GVE method. Compared with other vocal efficiency, it is shown that this method could eliminate the contribution from the super vocal tract transmission and resonance to GVE, and reflect the differences of phonation modes. The average magnitude of this conversion function in frequency domain represents GVE, and the variation of the magnitude in fundamental frequency is identical to AC/DC value.
基金This research is supported by the National Natural Science Foundation of China under Grant No. 10761004, the Natural Science Foundation of Inner Mongolia under Grant Nos. 2009MS0107 and 2010MS01116, the Talent Development Foundation of Inner Mongolia (2007), and the Natural Science Foundation of Inner Mongolia University of Technology under Grant No. X200934.
文摘The randomized response (RR) technique is an effective survey method when collecting sensitive information. This paper proposes a new non-randomized response model for survey sampling with polychotomous sensitive questions: Two-valued response technique. The proportion of the sensitive attribute and its estimator's variance are estimated under the simple random sampling without replacement and with replacement designs. The relation between the efficiency and the protection of privacy is discussed. The result indicates that the efficiency is in conflict with protection of privacy in the new proposed model, which is the same as that in the RR technique, but the new technique has better characteristics in sociological practice aspects than the RR technique, because it is of low cost, saving time and being operated easily.
文摘This paper discusses regression analysis of right-censored failure time data when censoring indicators are missing for some subjects. Several methods have been developed for the analysis under different situations and especially, Goetghebeur and Ryan considered the situation where both the failure time and the censoring time follow the proportional hazards models marginally and developed an estimating equation approach. One limitation of their approach is that the two baseline hazard functions were assumed to be proportional to each other. We consider the same problem and present an efficient estimation procedure for regression parameters that does not require the proportionality assumption. An EM algorithm is developed and the method is evaluated by a simulation study, which indicates that the proposed methodology performs well for practical situations. An illustrative example is provided.
基金This research is supported in part by supported in part by the National Science Foundation under ECS-0329597, DMS-0603287, and DMS-0624849.
文摘This paper studies identification of systems in which the system output is quantized, transmitted through a digital communication channel, and observed afterwards. The concept of the CR Ratio is introduced to characterize impact of communication channels on identification. The relationship between the CR Ratio and Shannon channel capacity is discussed. Identification algorithms are further developed when the channel error probability is unknown.
基金The authors gratefully acknowledge the Eastern Cooperative Oncology Group as the source for the ECOG EST 1582dataset,and the suggestion of a referee to expand our treatment of(V)to estimating equations.
文摘In many settings,multiple data collections and analyses on the same topic are summarised separately through statistical estimators of parameters and variances,and yet there are scientificreasons for sharing some statistical parameters across these different studies.This paper summarises what is known from large-sample theory about when estimators of a common structuralparameter from several independent samples can be combined functionally,or more specificallylinearly,to obtain an asymptotically efficient estimator from the combined sample.The main ideais that such combination can be done when the separate-sample nuisance parameters,if anyexist,vary freely and independently of one another.The issues are illustrated using data from amulti-centre lung cancer clinical trial.Examples are presented to show that separate estimatorscannot always be combined in this way,and that the functionally combined separate estimators may have low or 0 efficiency compared to the unified analysis that could be performed bypooling the datasets.