In practice, predictors possess grouping structures spontaneously. Incorporation of such useful information can improve statistical modeling and inference. In addition, the high-dimensionality often leads to the colli...In practice, predictors possess grouping structures spontaneously. Incorporation of such useful information can improve statistical modeling and inference. In addition, the high-dimensionality often leads to the collinearity problem. The elastic net is an ideal method which is inclined to reflect a grouping effect. In this paper, we consider the problem of group selection and estimation in the sparse linear regression model in which predictors can be grouped. We investigate a group adaptive elastic-net and derive oracle inequalities and model consistency for the cases where group number is larger than the sample size. Oracle property is addressed for the case of the fixed group number. We revise the locally approximated coordinate descent algorithm to make our computation. Simulation and real data studies indicate that the group adaptive elastic-net is an alternative and competitive method for model selection of high-dimensional problems for the cases of group number being larger than the sample size.展开更多
This paper employs the SCAD-penalized least squares method to simultaneously select variables and estimate the coefficients for high-dimensional covariate adjusted linear regression models.The distorted variables are ...This paper employs the SCAD-penalized least squares method to simultaneously select variables and estimate the coefficients for high-dimensional covariate adjusted linear regression models.The distorted variables are assumed to be contaminated with a multiplicative factor that is determined by the value of an unknown function of an observable covariate.The authors show that under some appropriate conditions,the SCAD-penalized least squares estimator has the so called "oracle property".In addition,the authors also suggest a BIC criterion to select the tuning parameter,and show that BIC criterion is able to identify the true model consistently for the covariate adjusted linear regression models.Simulation studies and a real data are used to illustrate the efficiency of the proposed estimation algorithm.展开更多
A robust and efficient shrinkage-type variable selection procedure for varying coefficient models is proposed,selection consistency and oracle properties are established.Furthermore,a BIC-type criterion is suggested f...A robust and efficient shrinkage-type variable selection procedure for varying coefficient models is proposed,selection consistency and oracle properties are established.Furthermore,a BIC-type criterion is suggested for shrinkage parameter selection and theoretical property is discussed.Numerical studies and real data analysis also are included to illustrate the finite sample performance of our method.展开更多
We consider the problem of variable selection for single-index varying-coefficient model, and present a regularized variable selection procedure by combining basis function approximations with SCAD penalty. The propos...We consider the problem of variable selection for single-index varying-coefficient model, and present a regularized variable selection procedure by combining basis function approximations with SCAD penalty. The proposed procedure simultaneously selects significant covariates with functional coefficients and local significant variables with parametric coefficients. With appropriate selection of the tuning parameters, the consistency of the variable selection procedure and the oracle property of the estimators are established. The proposed method can naturally be applied to deal with pure single-index model and varying-coefficient model. Finite sample performances of the proposed method are illustrated by a simulation study and the real data analysis.展开更多
In this puper, we consider the problem of variabie selection and model detection in varying coefficient models with longitudinM data. We propose a combined penalization procedure to select the significant variables, d...In this puper, we consider the problem of variabie selection and model detection in varying coefficient models with longitudinM data. We propose a combined penalization procedure to select the significant variables, detect the true structure of the model and estimate the unknown regression coefficients simultaneously. With appropriate selection of the tuning parameters, we show that the proposed procedure is consistent in both variable selection and the separation of varying and constant coefficients, and the penalized estimators have the oracle property. Finite sample performances of the proposed method are illustrated by some simulation studies and the real data analysis.展开更多
We propose a robust estimation procedure based on local Walsh-average regression(LWR) for single-index models. Our novel method provides a root-n consistent estimate of the single-index parameter under some mild regul...We propose a robust estimation procedure based on local Walsh-average regression(LWR) for single-index models. Our novel method provides a root-n consistent estimate of the single-index parameter under some mild regularity conditions;the estimate of the unknown link function converges at the usual rate for the nonparametric estimation of a univariate covariate. We theoretically demonstrate that the new estimators show significant efficiency gain across a wide spectrum of non-normal error distributions and have almost no loss of efficiency for the normal error. Even in the worst case, the asymptotic relative efficiency(ARE) has a lower bound compared with the least squares(LS) estimates;the lower bounds of the AREs are 0.864 and 0.8896 for the single-index parameter and nonparametric function, respectively. Moreover, the ARE of the proposed LWR-based approach versus the ARE of the LS-based method has an expression that is closely related to the ARE of the signed-rank Wilcoxon test as compared with the t-test. In addition, to obtain a sparse estimate of the single-index parameter, we develop a variable selection procedure by combining the estimation method with smoothly clipped absolute deviation penalty;this procedure is shown to possess the oracle property. We also propose a Bayes information criterion(BIC)-type criterion for selecting the tuning parameter and further prove its ability to consistently identify the true model. We conduct some Monte Carlo simulations and a real data analysis to illustrate the finite sample performance of the proposed methods.展开更多
Censored regression("Tobit") model is a special case of limited dependent variable regression model, and plays an important role in econometrics. Based on this model, all kinds of methods for variable or gro...Censored regression("Tobit") model is a special case of limited dependent variable regression model, and plays an important role in econometrics. Based on this model, all kinds of methods for variable or group variable selection have been developed and the corresponding shrinkage parameter estimates are widely studied. However, asymptotic distributions of the shrinkage estimates involve unknown nuisance parameters,such as density function of error term. To avoid estimating nuisance parameters, this paper presents a randomly weighting method to approximate to the asymptotic distribution of the shrinkage estimate. A computation procedure of random approximation is provided and asymptotic properties of the randomly weighting estimates are also obtained. The proposed methods are evaluated with extensively numerical studies and a women labor supply example.展开更多
In this paper, the comparison of orthogonal descriptors and Leaps and Bounds regression analysis is performed. The results obtained by using orthogonal descriptors are better than that obtained by using Leaps and Boun...In this paper, the comparison of orthogonal descriptors and Leaps and Bounds regression analysis is performed. The results obtained by using orthogonal descriptors are better than that obtained by using Leaps and Bounds regression for the data set of nitrobenzenes used in this study. Leaps and Bounds regression can be used effectively for selection of variables in quantitative structure activity/property relationship(QSAR/QSPR) studies. Consequently, orthogonalisation of descriptors is also a good method for variable selection for studies on QSAR/QSPR.展开更多
对部分线性模型的aglasso(adaptive group lasso)参数估计及变量选择问题进行研究.通过构造aglasso的估计函数,将分组部分线性模型变量的选择问题转化为分组因子的选择问题.理论研究表明:该方法能相合地识别真实模型,并且估计具有oracl...对部分线性模型的aglasso(adaptive group lasso)参数估计及变量选择问题进行研究.通过构造aglasso的估计函数,将分组部分线性模型变量的选择问题转化为分组因子的选择问题.理论研究表明:该方法能相合地识别真实模型,并且估计具有oracle性质.最后通过模拟研究了所提方法的有限样本性质.展开更多
基金supported by National Natural Science Foundation of China(Grant No.11571219)the Open Research Fund Program of Key Laboratory of Mathematical Economics(SUFE)(Grant No.201309KF02)Ministry of Education,and Changjiang Scholars and Innovative Research Team in University(Grant No.IRT13077)
文摘In practice, predictors possess grouping structures spontaneously. Incorporation of such useful information can improve statistical modeling and inference. In addition, the high-dimensionality often leads to the collinearity problem. The elastic net is an ideal method which is inclined to reflect a grouping effect. In this paper, we consider the problem of group selection and estimation in the sparse linear regression model in which predictors can be grouped. We investigate a group adaptive elastic-net and derive oracle inequalities and model consistency for the cases where group number is larger than the sample size. Oracle property is addressed for the case of the fixed group number. We revise the locally approximated coordinate descent algorithm to make our computation. Simulation and real data studies indicate that the group adaptive elastic-net is an alternative and competitive method for model selection of high-dimensional problems for the cases of group number being larger than the sample size.
基金supported by the National Natural Science Foundation of China under Grant Nos.11471029,11101014,61273221 and 11171010the Beijing Natural Science Foundation under Grant Nos.1142002 and 1112001+1 种基金the Science and Technology Project of Beijing Municipal Education Commission under Grant No.KM201410005010the Research Fund for the Doctoral Program of Beijing University of Technology under Grant No.006000543114550
文摘This paper employs the SCAD-penalized least squares method to simultaneously select variables and estimate the coefficients for high-dimensional covariate adjusted linear regression models.The distorted variables are assumed to be contaminated with a multiplicative factor that is determined by the value of an unknown function of an observable covariate.The authors show that under some appropriate conditions,the SCAD-penalized least squares estimator has the so called "oracle property".In addition,the authors also suggest a BIC criterion to select the tuning parameter,and show that BIC criterion is able to identify the true model consistently for the covariate adjusted linear regression models.Simulation studies and a real data are used to illustrate the efficiency of the proposed estimation algorithm.
基金supported by the National Natural Science Foundation of China(Nos.11901356,11971265)wealth management project(2019ZBKY047)of Shandong Technology and Business University.
文摘A robust and efficient shrinkage-type variable selection procedure for varying coefficient models is proposed,selection consistency and oracle properties are established.Furthermore,a BIC-type criterion is suggested for shrinkage parameter selection and theoretical property is discussed.Numerical studies and real data analysis also are included to illustrate the finite sample performance of our method.
文摘We consider the problem of variable selection for single-index varying-coefficient model, and present a regularized variable selection procedure by combining basis function approximations with SCAD penalty. The proposed procedure simultaneously selects significant covariates with functional coefficients and local significant variables with parametric coefficients. With appropriate selection of the tuning parameters, the consistency of the variable selection procedure and the oracle property of the estimators are established. The proposed method can naturally be applied to deal with pure single-index model and varying-coefficient model. Finite sample performances of the proposed method are illustrated by a simulation study and the real data analysis.
基金Supported by National Natural Science Foundation of China(Grant Nos.11501522,11101014,11001118 and11171012)National Statistical Research Projects(Grant No.2014LZ45)+2 种基金the Doctoral Fund of Innovation of Beijing University of Technologythe Science and Technology Project of the Faculty Adviser of Excellent PhD Degree Thesis of Beijing(Grant No.20111000503)the Beijing Municipal Education Commission Foundation(Grant No.KM201110005029)
文摘In this puper, we consider the problem of variabie selection and model detection in varying coefficient models with longitudinM data. We propose a combined penalization procedure to select the significant variables, detect the true structure of the model and estimate the unknown regression coefficients simultaneously. With appropriate selection of the tuning parameters, we show that the proposed procedure is consistent in both variable selection and the separation of varying and constant coefficients, and the penalized estimators have the oracle property. Finite sample performances of the proposed method are illustrated by some simulation studies and the real data analysis.
基金partially supported by National Natural Science Foundation of China(Grant Nos.11801168,11801169,11571055 and 11671059)the Natural Science Foundation of Hunan Province(Grant No.2018JJ3322)
文摘We propose a robust estimation procedure based on local Walsh-average regression(LWR) for single-index models. Our novel method provides a root-n consistent estimate of the single-index parameter under some mild regularity conditions;the estimate of the unknown link function converges at the usual rate for the nonparametric estimation of a univariate covariate. We theoretically demonstrate that the new estimators show significant efficiency gain across a wide spectrum of non-normal error distributions and have almost no loss of efficiency for the normal error. Even in the worst case, the asymptotic relative efficiency(ARE) has a lower bound compared with the least squares(LS) estimates;the lower bounds of the AREs are 0.864 and 0.8896 for the single-index parameter and nonparametric function, respectively. Moreover, the ARE of the proposed LWR-based approach versus the ARE of the LS-based method has an expression that is closely related to the ARE of the signed-rank Wilcoxon test as compared with the t-test. In addition, to obtain a sparse estimate of the single-index parameter, we develop a variable selection procedure by combining the estimation method with smoothly clipped absolute deviation penalty;this procedure is shown to possess the oracle property. We also propose a Bayes information criterion(BIC)-type criterion for selecting the tuning parameter and further prove its ability to consistently identify the true model. We conduct some Monte Carlo simulations and a real data analysis to illustrate the finite sample performance of the proposed methods.
基金partially supported by National Natural Science Foundation of China(Grant No.11101396)Anhui Provincial Natural Science Foundation(Grant No.1908085MA06)
文摘Censored regression("Tobit") model is a special case of limited dependent variable regression model, and plays an important role in econometrics. Based on this model, all kinds of methods for variable or group variable selection have been developed and the corresponding shrinkage parameter estimates are widely studied. However, asymptotic distributions of the shrinkage estimates involve unknown nuisance parameters,such as density function of error term. To avoid estimating nuisance parameters, this paper presents a randomly weighting method to approximate to the asymptotic distribution of the shrinkage estimate. A computation procedure of random approximation is provided and asymptotic properties of the randomly weighting estimates are also obtained. The proposed methods are evaluated with extensively numerical studies and a women labor supply example.
文摘In this paper, the comparison of orthogonal descriptors and Leaps and Bounds regression analysis is performed. The results obtained by using orthogonal descriptors are better than that obtained by using Leaps and Bounds regression for the data set of nitrobenzenes used in this study. Leaps and Bounds regression can be used effectively for selection of variables in quantitative structure activity/property relationship(QSAR/QSPR) studies. Consequently, orthogonalisation of descriptors is also a good method for variable selection for studies on QSAR/QSPR.
文摘对部分线性模型的aglasso(adaptive group lasso)参数估计及变量选择问题进行研究.通过构造aglasso的估计函数,将分组部分线性模型变量的选择问题转化为分组因子的选择问题.理论研究表明:该方法能相合地识别真实模型,并且估计具有oracle性质.最后通过模拟研究了所提方法的有限样本性质.
基金The National Social Science Foundation of China(11CTJ004)the National Natural Science Foundation of China(11301569+2 种基金11101452)the Natural Science Foundation Project of CQCSTC(cstc2011jjA00014)the Research Foundation of Chongqing Municipal Education Commission(KJ120504)