In this paper,we analyze the relationship between the equilibrium reinsurance strategy and the tail of the distribution of the risk.Since Mean Residual Life(MRL)has a close relationship with the tail of the distributi...In this paper,we analyze the relationship between the equilibrium reinsurance strategy and the tail of the distribution of the risk.Since Mean Residual Life(MRL)has a close relationship with the tail of the distribution,we consider two classes of risk distributions,Decreasing Mean Residual Life(DMRL)and Increasing Mean Residual Life(IMRL)distributions,which can be used to classify light-tailed and heavy-tailed distributions,respectively.We assume that the underlying risk process is modelled by the classical CramérLundberg model process.Under the mean-variance criterion,by solving the extended Hamilton-Jacobi-Bellman equation,we derive the equilibrium reinsurance strategy for the insurer and the reinsurer under DMRL and IMRL,respectively.Furthermore,we analyze how to choose the reinsurance premium to make the insurer and the reinsurer agree with the same reinsurance strategy.We find that under the case of DMRL,if the distribution and the risk aversions satisfy certain conditions,the insurer and the reinsurer can adopt a reinsurance premium to agree on a reinsurance strategy,and under the case of IMRL,the insurer and the reinsurer can only agree with each other that the insurer do not purchase the reinsurance.展开更多
Because of advances in data collection and storage,statistical analysis in modern scientific research and practice now has opportunities to utilize external information such as summary statistics from similar studies....Because of advances in data collection and storage,statistical analysis in modern scientific research and practice now has opportunities to utilize external information such as summary statistics from similar studies.A likelihood approach based on a parametric model assumption has been developed in the literature to utilize external summary information when the populations for external and main internal data are assumed to be the same.In this article,we instead consider the generalized estimation equation(GEE)approach for statistical inference,which is semiparametric or nonparametric,and show how to utilize external summary information even when internal and external data populations are not the same.Our approach is coupling the internal data and external summary information to form additional estimation equations and then applying the generalized method of moments(GMM).We show that the proposed GMM estimator is asymptotically normal and,under some conditions,is more efficient than the GEE estimator without using external summary information.Estimators of the asymptotic covariance matrix of the GMM estimators are also proposed.Simulation results are obtained to confirm our theory and quantify the improvements by utilizing external data.An example is also included for illustration.展开更多
Space-filling designs are widely used in computer experiments.They are frequently evaluated by the orthogonality and distance-related criteria.Rotating orthogonal arrays is an appealing approach to constructing orthog...Space-filling designs are widely used in computer experiments.They are frequently evaluated by the orthogonality and distance-related criteria.Rotating orthogonal arrays is an appealing approach to constructing orthogonal space-filling designs.An important issue that has been rarely addressed in the literature is the design selection for the initial orthogonal arrays.This paper studies the maximin L_(2)-distance properties of orthogonal designs generated by rotating two-level orthogonal arrays under three criteria.We provide theoretical justifications for the rotation method from a maximin distance perspective and further propose to select initial orthogonal arrays by the minimum G_(2)-aberration criterion.New infinite families of orthogonal or 3-orthogonal U-type designs,which also perform well under the maximin distance criterion,are obtained and tabulated.Examples are presented to show the effectiveness of the constructed designs for building statistical surrogate models.展开更多
Motivated by a medical study that attempts to analyze the relationship between growth curves and other variables and to measure the association among multiple growth curves,the authors develop a functional multiple-ou...Motivated by a medical study that attempts to analyze the relationship between growth curves and other variables and to measure the association among multiple growth curves,the authors develop a functional multiple-outcome model to decompose the total variation of multiple functional outcomes into variation explained by independent variables with time-varying coefficient functions,by latent factors and by noise.The latent factors are the hidden common factors that influence the multiple outcomes and are found through the combined functional principal component analysis approach.Through the coefficients of the latent factors one may further explore the association of the multiple outcomes.This method is applied to the multivariate growth data of infants in a real medical study in Shanghai and produces interpretable results.Convergence rates for the proposed estimates of the varying coefficient and covariance functions of the model are derived under mild conditions.展开更多
In this thesis,we establish non-linear wavelet density estimators and studying the asymptotic properties of the estimators with data missing at random when covariates are present.The outstanding advantage of non-linea...In this thesis,we establish non-linear wavelet density estimators and studying the asymptotic properties of the estimators with data missing at random when covariates are present.The outstanding advantage of non-linear wavelet method is estimating the unsoothed functions,however,the classical kernel estimation cannot do this work.At the same time,we study the larger sample properties of the ISE for hazard rate estimator.展开更多
The paper first analyzes price change due to stock splits in Chinese stock markets,which shows stock prices typically go up for stock splits.Then theoretical analyses based on risk theory are presented to explain the ...The paper first analyzes price change due to stock splits in Chinese stock markets,which shows stock prices typically go up for stock splits.Then theoretical analyses based on risk theory are presented to explain the reason,where the method comes from a new perspective and obtained theoretical conclusions show that stock splits typically make stock price go up if risk-compensation function is convex,and go down if risk-compensation function is concave.Stock prices typically go up for stock splits because risk-compensation functions are mainly convex.The obtained conclusions are consistent with the known results in the last three decades.展开更多
Different psychiatric disorders share genetic relationships and pleiotropic loci to certain extent.We integrated and analyzed datasets related to major depressive disorder(MDD),bipolar disorder(BIP),and schizophrenia(...Different psychiatric disorders share genetic relationships and pleiotropic loci to certain extent.We integrated and analyzed datasets related to major depressive disorder(MDD),bipolar disorder(BIP),and schizophrenia(SCZ)from the Psychiatric Genomics Consortium using multitrait analysis of genome-wide association analysis(MTAG).MTAG significantly increased the effective sample size from 99,773 to 119,754 for MDD,from 909,061 to 1,450,972 for BIP,and from 856,677 to 940,613 for SCZ.We discovered 7,32,and 43 novel lead single nucleotide polymorphisms(SNPs)and 1,6,and 3 novel causal SNPs for MDD,BIP,and SCZ,respectively,after fine-mapping.We identified rs8039305 in the FURIN gene as a novel pleiotropic locus across the three disorders.We performed marker analysis of genomic annotation(MAGMA)and Hi-C-coupled MAGMA(H-MAGMA)based gene-set analysis and identified 101 genes associated with the three disorders,which were enriched in the regulation of postsynaptic membranes,postsynaptic membrane dopaminergic synapses,and Notch signaling pathway.Next,we performed Mendelian randomization analysis using different tools and detected a causal effect of BIP on SCZ.Overall,we demonstrated the usage of combined genome-wide association studies summary statistics for exploring potential novel mechanisms of the three psychiatric disorders,providing an alternative approach to integrate publicly available summary data.展开更多
The unified weighing scheme for the local-linear smoother in analysing functional data can deal with data that are dense,sparse or of neither type.In this paper,we focus on the convergence rate of functional principal...The unified weighing scheme for the local-linear smoother in analysing functional data can deal with data that are dense,sparse or of neither type.In this paper,we focus on the convergence rate of functional principal component analysis using this method.Almost sure asymptotic consistency and rates of convergence for the estimators of eigenvalues and eigenfunctions have been established.We also provide the convergence rate of the variance estimation of the measurement error.Based on the results,the number of observations within each curve can be of any rate relative to the sample size,which is consistent with the earlier conclusions about the asymptotic properties of the mean and covariance estimators.展开更多
We propose two simple regression models of Pearson correlation coefficient of two normal responses or binary responses to assess the effect of covariates of interest.Likelihood-based inference is established to estima...We propose two simple regression models of Pearson correlation coefficient of two normal responses or binary responses to assess the effect of covariates of interest.Likelihood-based inference is established to estimate the regression coefficients,upon which bootstrap-based method is used to test the significance of covariates of interest.Simulation studies show the effectiveness of the method in terms of type-I error control,power performance in moderate sample size and robustness with respect to model mis-specification.We illustrate the application of the proposed method to some real data concerning health measurements.展开更多
Length-biased data are encountered in many fields,including economics,engineering and epidemiological cohort studies.There are two main challenges in the analysis of such data:the assumption of independent censoring i...Length-biased data are encountered in many fields,including economics,engineering and epidemiological cohort studies.There are two main challenges in the analysis of such data:the assumption of independent censoring is violated and the assumed model for the underlying population is no longer satisfied for the observed data.In this paper,a proportional mean residual life varyingcoefficient model for length-biased data is considered and a local pseudo likelihood method is proposed for estimating the coefficient functions in the model.Asymptotic properties are investigated for the proposed estimators.The finite sample performance of the proposed methodology is demonstrated by simulation studies.Finally,the method is applied to a real data set concerning the Academy Awards.展开更多
The partially linear single-index model(PLSIM) is a flexible and powerful model for analyzing the relationship between the response and the multivariate covariates. This paper considers the PLSIM with measurement erro...The partially linear single-index model(PLSIM) is a flexible and powerful model for analyzing the relationship between the response and the multivariate covariates. This paper considers the PLSIM with measurement error possibly in all the variables. The authors propose a new efficient estimation procedure based on the local linear smoothing and the simulation-extrapolation method,and further establish the asymptotic normality of the proposed estimators for both the index parameter and nonparametric link function. The authors also carry out extensive Monte Carlo simulation studies to evaluate the finite sample performance of the new method, and apply it to analyze the osteoporosis prevention data.展开更多
The problem of hazard rate estimation under right-censored assumption has been investigated extensively.Integrated square error(ISE)of estimation is one of the most widely accepted measurements of the global performan...The problem of hazard rate estimation under right-censored assumption has been investigated extensively.Integrated square error(ISE)of estimation is one of the most widely accepted measurements of the global performance for nonparametric kernel estimation.But there are no results available for ISE of hazard rate estimation under right-censored model with censoring indicators missing at random(MAR)so far.This paper constructs an imputation estimator of the hazard rate function and establish asymptotic normality of the ISE for the kernel hazard rate estimator with censoring indicators MAR.At the same time,an asymptotic representation of the mean integrated square error(MISE)is also presented.The finite sample behavior of the estimator is investigated via one simple simulation.展开更多
The authors propose a two-step test for the two-sample problem of processes of OrnsteinUhlenbeck type. In the first step, the authors test the equality of correlation structures, based on the least square estimators o...The authors propose a two-step test for the two-sample problem of processes of OrnsteinUhlenbeck type. In the first step, the authors test the equality of correlation structures, based on the least square estimators of the correlation parameters, and the test statistic follows the standard normal distribution. If the null hypothesis is not rejected in the first step, the authors consider a second step to test the equality of marginal distributions, based on the weighted deviation of the empirical characteristic functions;the test statistic has a complicated asymptotic distribution, so that sequential bootstrap method is applied to reach a temporary decision. Simulation studies and real data analysis suggest that the proposed approach performs well in finite samples.展开更多
The rapid emergence of massive datasets in various fields poses a serious challenge to tra-ditional statistical methods.Meanwhile,it provides opportunities for researchers to develop novel algorithms.Inspired by the i...The rapid emergence of massive datasets in various fields poses a serious challenge to tra-ditional statistical methods.Meanwhile,it provides opportunities for researchers to develop novel algorithms.Inspired by the idea of divide-and-conquer,various distributed frameworks for statistical estimation and inference have been proposed.They were developed to deal with large-scale statistical optimization problems.This paper aims to provide a comprehensive review for related literature.It includes parametric models,nonparametric models,and other frequently used models.Their key ideas and theoretical properties are summarized.The trade-off between communication cost and estimate precision together with other concerns is discussed.展开更多
Firstly,this paper proposes a generalized log-determinant optimization model with the purpose of estimating the high-dimensional sparse inverse covariance matrices.Under the normality assumption,the zero components in...Firstly,this paper proposes a generalized log-determinant optimization model with the purpose of estimating the high-dimensional sparse inverse covariance matrices.Under the normality assumption,the zero components in the inverse covariance matrices represent the conditional independence between pairs of variables given all the other variables.The generalized model considered in this study,because of the setting of the eigenvalue bounded constraints,covers a large number of existing estimators as special cases.Secondly,rather than directly tracking the challenging optimization problem,this paper uses a couple of alternating direction methods of multipliers(ADMM)to solve its dual model where 5 separable structures are contained.The first implemented algorithm is based on a single Gauss–Seidel iteration,but it does not necessarily converge theoretically.In contrast,the second algorithm employs the symmetric Gauss–Seidel(sGS)based ADMM which is equivalent to the 2-block iterative scheme from the latest sGS decomposition theorem.Finally,we do numerical simulations using the synthetic data and the real data set which show that both algorithms are very effective in estimating high-dimensional sparse inverse covariance matrix.展开更多
This paper is devoted to study the proportional reinsurance/new business and investment problem under the mean-variance criterion in a continuous-time setting.The strategies are constrained in the non-negative cone an...This paper is devoted to study the proportional reinsurance/new business and investment problem under the mean-variance criterion in a continuous-time setting.The strategies are constrained in the non-negative cone and all coefficients in the model except the interest rate are stochastic processes adapted the filtration generated by a Markov chain.With the help of a backward stochastic differential equation driven by the Markov chain,we obtain the optimal strategy and optimal cost explicitly under this non-Markovian regime-switching model.The cases with one risky asset and Markov regime-switching model are considered as special cases.展开更多
We thank the editor,Professor Jun Shao,for orga-nizing this stimulating discussion.We are grateful to all discussants for their insightful comments on our review article on the distributed statistical inference.Due to...We thank the editor,Professor Jun Shao,for orga-nizing this stimulating discussion.We are grateful to all discussants for their insightful comments on our review article on the distributed statistical inference.Due to the urgent need to process the datasets with massive sizes,various distributed computing methods have been proposed for the large-scale statistical prob-lems.Meanwhile,some important theoretical results were established.While we want to give a relatively comprehensive overview on this hot topic,there are still some important works that have been missed in our review written over a year ago.However,we are glad to see the discussants provide reviews of some new works and references.We hope that these discussions and our review would serve as a stimulus for further studies in this rapidly developing area.展开更多
Prevalent cohort studies frequently involve length-biased and right-censored data, a fact that has drawn considerable attention in survival analysis. In this article, we consider survival data arising from lengthbiase...Prevalent cohort studies frequently involve length-biased and right-censored data, a fact that has drawn considerable attention in survival analysis. In this article, we consider survival data arising from lengthbiased sampling, and propose a new semiparametric-model-based approach to estimate quantile differences of failure time. We establish the asymptotic properties of our new estimators theoretically under mild technical conditions, and propose a resampling method for estimating their asymptotic variance. We then conduct simulations to evaluate the empirical performance and efficiency of the proposed estimators, and demonstrate their application by a real data analysis.展开更多
The identification of within-subject dependence is important for constructing efficient estimation in longitudinal data models.In this paper,we proposed a flexible way to study this dependence by using nonparametric r...The identification of within-subject dependence is important for constructing efficient estimation in longitudinal data models.In this paper,we proposed a flexible way to study this dependence by using nonparametric regression models.Specifically,we considered the estimation of varying coefficient longitudinal data model with non-stationary varying coefficient autoregressive error process over observational time quantum.Based on spline approximation and local polynomial techniques,we proposed a two-stage nonparametric estimation for unknown functional coefficients and didn’t not drop any observations in a hybrid least square loss framework.Moreover,we showed that the estimated coefficient functions are asymptotically normal and derived the asymptotic biases and variances accordingly.Monte Carlo studies and two real applications were conducted for illustrating the performance of our proposed methods.展开更多
Published auxiliary information can be helpful in conducting statistical inference in a new study.In this paper,we synthesize the auxiliary information with semiparametric likelihood-based inference for censoring data...Published auxiliary information can be helpful in conducting statistical inference in a new study.In this paper,we synthesize the auxiliary information with semiparametric likelihood-based inference for censoring data with the total sample size is available.We express the auxiliary information as constraints on the regression coefficients and the covariate distribution,then use empirical likelihood method for general estimating equations to improve the efficiency of the interested parameters in the specified model.The consistency and asymptotic normality of the resulting regression parameter estimators established.Also numerical simulation and application with different supposed conditions show that the proposed method yields a substantial gain in efficiency of the interested parameters.展开更多
基金supported by the National Key R&D Program of China(2022YFA1007900)the National Natural Science Foundation of China(Nos.12271171,12171158,12071147,12001200)+3 种基金the Shanghai Philosophy Social Science Planning Office Project(Grant No.2022ZJB005)the Fundamental Research Funds for the Central Universities(2022QKT001)the State Key Program of National Natural Science Foundation of China(71931004)the Humanity and Social Science Foundation of Ningbo University(XPYB19002)。
文摘In this paper,we analyze the relationship between the equilibrium reinsurance strategy and the tail of the distribution of the risk.Since Mean Residual Life(MRL)has a close relationship with the tail of the distribution,we consider two classes of risk distributions,Decreasing Mean Residual Life(DMRL)and Increasing Mean Residual Life(IMRL)distributions,which can be used to classify light-tailed and heavy-tailed distributions,respectively.We assume that the underlying risk process is modelled by the classical CramérLundberg model process.Under the mean-variance criterion,by solving the extended Hamilton-Jacobi-Bellman equation,we derive the equilibrium reinsurance strategy for the insurer and the reinsurer under DMRL and IMRL,respectively.Furthermore,we analyze how to choose the reinsurance premium to make the insurer and the reinsurer agree with the same reinsurance strategy.We find that under the case of DMRL,if the distribution and the risk aversions satisfy certain conditions,the insurer and the reinsurer can adopt a reinsurance premium to agree on a reinsurance strategy,and under the case of IMRL,the insurer and the reinsurer can only agree with each other that the insurer do not purchase the reinsurance.
基金supported by National Natural Science Foundation of China(Grant No.11831008)National Natural Science Foundation of China(Grant No.12271272)+1 种基金National Science Foundation of USA(Grant No.DMS-1914411)supported by the Fundamental Research Funds for the Central Universities。
文摘Because of advances in data collection and storage,statistical analysis in modern scientific research and practice now has opportunities to utilize external information such as summary statistics from similar studies.A likelihood approach based on a parametric model assumption has been developed in the literature to utilize external summary information when the populations for external and main internal data are assumed to be the same.In this article,we instead consider the generalized estimation equation(GEE)approach for statistical inference,which is semiparametric or nonparametric,and show how to utilize external summary information even when internal and external data populations are not the same.Our approach is coupling the internal data and external summary information to form additional estimation equations and then applying the generalized method of moments(GMM).We show that the proposed GMM estimator is asymptotically normal and,under some conditions,is more efficient than the GEE estimator without using external summary information.Estimators of the asymptotic covariance matrix of the GMM estimators are also proposed.Simulation results are obtained to confirm our theory and quantify the improvements by utilizing external data.An example is also included for illustration.
基金supported by National Natural Science Foundation of China(Grant Nos.11901199 and 71931004),supported by National Natural Science Foundation of China(Grant Nos.11971098 and 11471069)the Open Research Fund of Key Laboratory for Applied Statistics of Ministry of Education,Northeast Normal University(Grant No.130028906)+1 种基金Shanghai Chenguang Program(Grant No.19CG26)National Key R&D Program of China(Grant No.2020YFA0714102)。
文摘Space-filling designs are widely used in computer experiments.They are frequently evaluated by the orthogonality and distance-related criteria.Rotating orthogonal arrays is an appealing approach to constructing orthogonal space-filling designs.An important issue that has been rarely addressed in the literature is the design selection for the initial orthogonal arrays.This paper studies the maximin L_(2)-distance properties of orthogonal designs generated by rotating two-level orthogonal arrays under three criteria.We provide theoretical justifications for the rotation method from a maximin distance perspective and further propose to select initial orthogonal arrays by the minimum G_(2)-aberration criterion.New infinite families of orthogonal or 3-orthogonal U-type designs,which also perform well under the maximin distance criterion,are obtained and tabulated.Examples are presented to show the effectiveness of the constructed designs for building statistical surrogate models.
基金supported by the National Natural Science Foundation of China under Grant Nos.11771146,11831008,81530086,11771145,11871252the 111 Project(B14019)Program of Shanghai Subject Chief Scientist under Grant No.14XD1401600。
文摘Motivated by a medical study that attempts to analyze the relationship between growth curves and other variables and to measure the association among multiple growth curves,the authors develop a functional multiple-outcome model to decompose the total variation of multiple functional outcomes into variation explained by independent variables with time-varying coefficient functions,by latent factors and by noise.The latent factors are the hidden common factors that influence the multiple outcomes and are found through the combined functional principal component analysis approach.Through the coefficients of the latent factors one may further explore the association of the multiple outcomes.This method is applied to the multivariate growth data of infants in a real medical study in Shanghai and produces interpretable results.Convergence rates for the proposed estimates of the varying coefficient and covariance functions of the model are derived under mild conditions.
文摘In this thesis,we establish non-linear wavelet density estimators and studying the asymptotic properties of the estimators with data missing at random when covariates are present.The outstanding advantage of non-linear wavelet method is estimating the unsoothed functions,however,the classical kernel estimation cannot do this work.At the same time,we study the larger sample properties of the ISE for hazard rate estimator.
基金Supported by the National Natural Science Foundation of China(11471120)the Science and Technology Commission of Shanghai Municipality(19JC1420100)。
文摘The paper first analyzes price change due to stock splits in Chinese stock markets,which shows stock prices typically go up for stock splits.Then theoretical analyses based on risk theory are presented to explain the reason,where the method comes from a new perspective and obtained theoretical conclusions show that stock splits typically make stock price go up if risk-compensation function is convex,and go down if risk-compensation function is concave.Stock prices typically go up for stock splits because risk-compensation functions are mainly convex.The obtained conclusions are consistent with the known results in the last three decades.
基金supported by the National Key Research and Development Program of China(2015AA020108)the National Natural Science Foundation of China(31671377,81671326)+3 种基金Shanghai Municipal Science and Technology Major Project(2017SHZDZX01)Open Research Fund of Key Laboratory of Advanced Theory and Application in Statistics and Data Science(East China Normal University)of Ministry of Educationthe Fundamental Research Funds for the Central Universities,Beihang University&Capital Medical University Advanced Innovation Center for Big Data-Based Precision Medicine Plan(BHME-201804,BHME-201904)The Special Fund of the Pediatric Medical Coordinated Development Center of Beijing Hospitals。
文摘Different psychiatric disorders share genetic relationships and pleiotropic loci to certain extent.We integrated and analyzed datasets related to major depressive disorder(MDD),bipolar disorder(BIP),and schizophrenia(SCZ)from the Psychiatric Genomics Consortium using multitrait analysis of genome-wide association analysis(MTAG).MTAG significantly increased the effective sample size from 99,773 to 119,754 for MDD,from 909,061 to 1,450,972 for BIP,and from 856,677 to 940,613 for SCZ.We discovered 7,32,and 43 novel lead single nucleotide polymorphisms(SNPs)and 1,6,and 3 novel causal SNPs for MDD,BIP,and SCZ,respectively,after fine-mapping.We identified rs8039305 in the FURIN gene as a novel pleiotropic locus across the three disorders.We performed marker analysis of genomic annotation(MAGMA)and Hi-C-coupled MAGMA(H-MAGMA)based gene-set analysis and identified 101 genes associated with the three disorders,which were enriched in the regulation of postsynaptic membranes,postsynaptic membrane dopaminergic synapses,and Notch signaling pathway.Next,we performed Mendelian randomization analysis using different tools and detected a causal effect of BIP on SCZ.Overall,we demonstrated the usage of combined genome-wide association studies summary statistics for exploring potential novel mechanisms of the three psychiatric disorders,providing an alternative approach to integrate publicly available summary data.
基金supported by National Natural Science Foundation of China(project number:11771146,11831008,81530086,11771145)the National Social Science Foundation Key Program(17ZDA091)+2 种基金the 111 Project(B14019)Programof Shanghai Subject Chief Scientist(14XD1401600)supported by the China Postdoctoral Science Foundation(2018M630393).
文摘The unified weighing scheme for the local-linear smoother in analysing functional data can deal with data that are dense,sparse or of neither type.In this paper,we focus on the convergence rate of functional principal component analysis using this method.Almost sure asymptotic consistency and rates of convergence for the estimators of eigenvalues and eigenfunctions have been established.We also provide the convergence rate of the variance estimation of the measurement error.Based on the results,the number of observations within each curve can be of any rate relative to the sample size,which is consistent with the earlier conclusions about the asymptotic properties of the mean and covariance estimators.
文摘We propose two simple regression models of Pearson correlation coefficient of two normal responses or binary responses to assess the effect of covariates of interest.Likelihood-based inference is established to estimate the regression coefficients,upon which bootstrap-based method is used to test the significance of covariates of interest.Simulation studies show the effectiveness of the method in terms of type-I error control,power performance in moderate sample size and robustness with respect to model mis-specification.We illustrate the application of the proposed method to some real data concerning health measurements.
基金Supported by the State Key Program of National Natural Science Foundation of China(Grant No.71931004)the State Key Program in the Ma jor Research Plan of National Natural Science Foundation of China(Grant No.91546202)。
文摘Length-biased data are encountered in many fields,including economics,engineering and epidemiological cohort studies.There are two main challenges in the analysis of such data:the assumption of independent censoring is violated and the assumed model for the underlying population is no longer satisfied for the observed data.In this paper,a proportional mean residual life varyingcoefficient model for length-biased data is considered and a local pseudo likelihood method is proposed for estimating the coefficient functions in the model.Asymptotic properties are investigated for the proposed estimators.The finite sample performance of the proposed methodology is demonstrated by simulation studies.Finally,the method is applied to a real data set concerning the Academy Awards.
基金the National Natural Science Foundation of China under Grant Nos. 11971171,11971300, 11901286, 12071267 and 12171310the Shanghai Natural Science Foundation under Grant No.20ZR1421800+2 种基金the Open Research Fund of Key Laboratory of Advanced Theory and Application in Statistics and Data Science (East China Normal University)the General Research Fund (HKBU12303421, HKBU12303918)the Initiation Grant for Faculty Niche Research Areas (RC-FNRA-IG/20-21/SCI/03) of Hong Kong Baptist University。
文摘The partially linear single-index model(PLSIM) is a flexible and powerful model for analyzing the relationship between the response and the multivariate covariates. This paper considers the PLSIM with measurement error possibly in all the variables. The authors propose a new efficient estimation procedure based on the local linear smoothing and the simulation-extrapolation method,and further establish the asymptotic normality of the proposed estimators for both the index parameter and nonparametric link function. The authors also carry out extensive Monte Carlo simulation studies to evaluate the finite sample performance of the new method, and apply it to analyze the osteoporosis prevention data.
基金the China Postdoctoral Science Foundation under Grant No.2019M651422the National Natural Science Foundation of China under Grant Nos.71701127,11831008 and 11971171+3 种基金the National Social Science Foundation Key Program under Grant No.17ZDA091the 111 Project of China under Grant No.B14019the Natural Science Foundation of Shanghai under Grant Nos.17ZR1409000 and 20ZR1423000the Project of Humanities and Social Science Foundation of Ministry of Education under Grant No.20YJC910003。
文摘The problem of hazard rate estimation under right-censored assumption has been investigated extensively.Integrated square error(ISE)of estimation is one of the most widely accepted measurements of the global performance for nonparametric kernel estimation.But there are no results available for ISE of hazard rate estimation under right-censored model with censoring indicators missing at random(MAR)so far.This paper constructs an imputation estimator of the hazard rate function and establish asymptotic normality of the ISE for the kernel hazard rate estimator with censoring indicators MAR.At the same time,an asymptotic representation of the mean integrated square error(MISE)is also presented.The finite sample behavior of the estimator is investigated via one simple simulation.
基金the National Natural Science Foundation of China Grant Nos. 1180135511871376 and 11971116Shanghai Pujiang Program 18PJ1409800。
文摘The authors propose a two-step test for the two-sample problem of processes of OrnsteinUhlenbeck type. In the first step, the authors test the equality of correlation structures, based on the least square estimators of the correlation parameters, and the test statistic follows the standard normal distribution. If the null hypothesis is not rejected in the first step, the authors consider a second step to test the equality of marginal distributions, based on the weighted deviation of the empirical characteristic functions;the test statistic has a complicated asymptotic distribution, so that sequential bootstrap method is applied to reach a temporary decision. Simulation studies and real data analysis suggest that the proposed approach performs well in finite samples.
基金This work is supported by National Natural Science Foun-dation of China(No.11971171)the 111 Project(B14019)and Project of National Social Science Fund of China(15BTJ027)+3 种基金Weidong Liu’s research is supported by National Program on Key Basic Research Project(973 Program,2018AAA0100704)National Natural Science Foundation of China(No.11825104,11690013)Youth Talent Sup-port Program,and a grant from Australian Research Council.Hansheng Wang’s research is partially supported by National Natural Science Foundation of China(No.11831008,11525101,71532001)It is also supported in part by China’s National Key Research Special Program(No.2016YFC0207704).
文摘The rapid emergence of massive datasets in various fields poses a serious challenge to tra-ditional statistical methods.Meanwhile,it provides opportunities for researchers to develop novel algorithms.Inspired by the idea of divide-and-conquer,various distributed frameworks for statistical estimation and inference have been proposed.They were developed to deal with large-scale statistical optimization problems.This paper aims to provide a comprehensive review for related literature.It includes parametric models,nonparametric models,and other frequently used models.Their key ideas and theoretical properties are summarized.The trade-off between communication cost and estimate precision together with other concerns is discussed.
基金the National Natural Science Foundation of China(No.11971149).
文摘Firstly,this paper proposes a generalized log-determinant optimization model with the purpose of estimating the high-dimensional sparse inverse covariance matrices.Under the normality assumption,the zero components in the inverse covariance matrices represent the conditional independence between pairs of variables given all the other variables.The generalized model considered in this study,because of the setting of the eigenvalue bounded constraints,covers a large number of existing estimators as special cases.Secondly,rather than directly tracking the challenging optimization problem,this paper uses a couple of alternating direction methods of multipliers(ADMM)to solve its dual model where 5 separable structures are contained.The first implemented algorithm is based on a single Gauss–Seidel iteration,but it does not necessarily converge theoretically.In contrast,the second algorithm employs the symmetric Gauss–Seidel(sGS)based ADMM which is equivalent to the 2-block iterative scheme from the latest sGS decomposition theorem.Finally,we do numerical simulations using the synthetic data and the real data set which show that both algorithms are very effective in estimating high-dimensional sparse inverse covariance matrix.
基金supported by the 111 Project[grant number B14019]the National Natural Science Foundation of China[grant numbers 11571113,11601157,11601320].
文摘This paper is devoted to study the proportional reinsurance/new business and investment problem under the mean-variance criterion in a continuous-time setting.The strategies are constrained in the non-negative cone and all coefficients in the model except the interest rate are stochastic processes adapted the filtration generated by a Markov chain.With the help of a backward stochastic differential equation driven by the Markov chain,we obtain the optimal strategy and optimal cost explicitly under this non-Markovian regime-switching model.The cases with one risky asset and Markov regime-switching model are considered as special cases.
文摘We thank the editor,Professor Jun Shao,for orga-nizing this stimulating discussion.We are grateful to all discussants for their insightful comments on our review article on the distributed statistical inference.Due to the urgent need to process the datasets with massive sizes,various distributed computing methods have been proposed for the large-scale statistical prob-lems.Meanwhile,some important theoretical results were established.While we want to give a relatively comprehensive overview on this hot topic,there are still some important works that have been missed in our review written over a year ago.However,we are glad to see the discussants provide reviews of some new works and references.We hope that these discussions and our review would serve as a stimulus for further studies in this rapidly developing area.
基金supported by National Natural Science Foundation of China(Grant No.11401603)the Fundamental Research Funds for the Central Universities(Grant No.QL 18009)+2 种基金Discipline Foundation of Central University of Finance and Economics(Grant No.CUFESAM201811)supported by the State Key Program of National Natural Science Foundation of China(Grant No.71331006)the State Key Program in the Major Research Plan of National Natural Science Foundation of China(Grant No.91546202)
文摘Prevalent cohort studies frequently involve length-biased and right-censored data, a fact that has drawn considerable attention in survival analysis. In this article, we consider survival data arising from lengthbiased sampling, and propose a new semiparametric-model-based approach to estimate quantile differences of failure time. We establish the asymptotic properties of our new estimators theoretically under mild technical conditions, and propose a resampling method for estimating their asymptotic variance. We then conduct simulations to evaluate the empirical performance and efficiency of the proposed estimators, and demonstrate their application by a real data analysis.
基金supported by MOE(Ministry of Education in China),Project of Humanities and Social Sciences(No.15YJA910004)Sponsored by K.C.Wong Magna Fund in Ningbo University+1 种基金supported by the National Social Science Foundation of China(No.17BTJ025)the Open Research Fund of Key Laboratory of Advanced Theory and Application in Statistics and Data Science(East China Normal University),Ministry of Education(No.KLATASDS1802)
文摘The identification of within-subject dependence is important for constructing efficient estimation in longitudinal data models.In this paper,we proposed a flexible way to study this dependence by using nonparametric regression models.Specifically,we considered the estimation of varying coefficient longitudinal data model with non-stationary varying coefficient autoregressive error process over observational time quantum.Based on spline approximation and local polynomial techniques,we proposed a two-stage nonparametric estimation for unknown functional coefficients and didn’t not drop any observations in a hybrid least square loss framework.Moreover,we showed that the estimated coefficient functions are asymptotically normal and derived the asymptotic biases and variances accordingly.Monte Carlo studies and two real applications were conducted for illustrating the performance of our proposed methods.
基金supported by the State Key Program of National Natural Science Foundation of China(No.71331006)by the Graduate Innovation Foundation of Shanghai University of Finance and Economics of China(No.CXJJ-2018-408)。
文摘Published auxiliary information can be helpful in conducting statistical inference in a new study.In this paper,we synthesize the auxiliary information with semiparametric likelihood-based inference for censoring data with the total sample size is available.We express the auxiliary information as constraints on the regression coefficients and the covariate distribution,then use empirical likelihood method for general estimating equations to improve the efficiency of the interested parameters in the specified model.The consistency and asymptotic normality of the resulting regression parameter estimators established.Also numerical simulation and application with different supposed conditions show that the proposed method yields a substantial gain in efficiency of the interested parameters.