A general single-index model with high-dimensional predictors is considered. Additive structure of the unknown link function and the error is not assumed in this model. The consistency of predictor selection and estim...A general single-index model with high-dimensional predictors is considered. Additive structure of the unknown link function and the error is not assumed in this model. The consistency of predictor selection and estimation is investigated in this model. The index is formulated in the sufficient dimension reduction framework. A distribution-based LASSO estimation is then suggested. When the dimension of predictors can diverge at a polynomial rate of the sample size, the consistency holds under an irrepresentable condition and mild conditions on the predictors. The new method has no requirement, other than independence from the predictors, for the distrlLbution of the error. This property results in robustness of the new method against outliers in the response variable. The conventional consistency of index estimation is provided after the dimension is brought down to a value smaller than the sample size. The importance of the irrepresentable condition for the consistency, and the robustness are examined by a simulation study and two real-data examples.展开更多
We study the properties of the Lasso in the high-dimensional partially linear model where the number of variables in the linear part can be greater than the sample size.We use truncated series expansion based on polyn...We study the properties of the Lasso in the high-dimensional partially linear model where the number of variables in the linear part can be greater than the sample size.We use truncated series expansion based on polynomial splines to approximate the nonparametric component in this model.Under a sparsity assumption on the regression coefficients of the linear component and some regularity conditions,we derive the oracle inequalities for the prediction risk and the estimation error.We also provide sufficient conditions under which the Lasso estimator is selection consistent for the variables in the linear part of the model.In addition,we derive the rate of convergence of the estimator of the nonparametric function.We conduct simulation studies to evaluate the finite sample performance of variable selection and nonparametric function estimation.展开更多
基金supported by Research Council of Hong Kong(Grant No.FRG/09-10/II057)Hong Kong Baptist University(Grant No.HKBU2034/09P)
文摘A general single-index model with high-dimensional predictors is considered. Additive structure of the unknown link function and the error is not assumed in this model. The consistency of predictor selection and estimation is investigated in this model. The index is formulated in the sufficient dimension reduction framework. A distribution-based LASSO estimation is then suggested. When the dimension of predictors can diverge at a polynomial rate of the sample size, the consistency holds under an irrepresentable condition and mild conditions on the predictors. The new method has no requirement, other than independence from the predictors, for the distrlLbution of the error. This property results in robustness of the new method against outliers in the response variable. The conventional consistency of index estimation is provided after the dimension is brought down to a value smaller than the sample size. The importance of the irrepresentable condition for the consistency, and the robustness are examined by a simulation study and two real-data examples.
文摘We study the properties of the Lasso in the high-dimensional partially linear model where the number of variables in the linear part can be greater than the sample size.We use truncated series expansion based on polynomial splines to approximate the nonparametric component in this model.Under a sparsity assumption on the regression coefficients of the linear component and some regularity conditions,we derive the oracle inequalities for the prediction risk and the estimation error.We also provide sufficient conditions under which the Lasso estimator is selection consistent for the variables in the linear part of the model.In addition,we derive the rate of convergence of the estimator of the nonparametric function.We conduct simulation studies to evaluate the finite sample performance of variable selection and nonparametric function estimation.