期刊文献+

Robust Sure Independence Screening for Ultrahigh Dimensional Non-normal Data 被引量:1

Robust Sure Independence Screening for Ultrahigh Dimensional Non-normal Data
原文传递
导出
摘要 Sure independence screening(SIS) has been proposed to reduce the ultrahigh dimensionality down to a moderate scale and proved to enjoy the sure screening property under Gaussian linear models.However,the observed response is often skewed or heavy-tailed with extreme values in practice,which may dramatically deteriorate the performance of SIS.To this end,we propose a new robust sure independence screening(RoSIS) via considering the correlation between each predictor and the distribution function of the response.The proposed approach contributes to the literature in the following three folds: First,it is able to reduce ultrahigh dimensionality effectively.Second,it is robust to heavy tails or extreme values in the response.Third,it possesses both sure screening property and ranking consistency property under milder conditions.Furthermore,we demonstrate its excellent finite sample performance through numerical simulations and a real data example. Sure independence screening(SIS) has been proposed to reduce the ultrahigh dimensionality down to a moderate scale and proved to enjoy the sure screening property under Gaussian linear models.However,the observed response is often skewed or heavy-tailed with extreme values in practice,which may dramatically deteriorate the performance of SIS.To this end,we propose a new robust sure independence screening(RoSIS) via considering the correlation between each predictor and the distribution function of the response.The proposed approach contributes to the literature in the following three folds: First,it is able to reduce ultrahigh dimensionality effectively.Second,it is robust to heavy tails or extreme values in the response.Third,it possesses both sure screening property and ranking consistency property under milder conditions.Furthermore,we demonstrate its excellent finite sample performance through numerical simulations and a real data example.
作者 Wei ZHONG
出处 《Acta Mathematica Sinica,English Series》 SCIE CSCD 2014年第11期1885-1896,共12页 数学学报(英文版)
基金 Supported by National Natural Science Foundation of China(Grant Nos.11301435 and 71131008) the Fundamental Research Funds for the Central Universities
关键词 ROBUSTNESS sure independence screening sure screening property ultrahigh dimensionality variable selection Robustness sure independence screening sure screening property ultrahigh dimensionality variable selection
  • 相关文献

参考文献15

  • 1Anscombe, F.: Graphs in statistical analysis. Amer. Statist., 27, 17-21 (1973).
  • 2Fan, J., Li, R.: Variable selection via nonconcave penalized likelihood and it oracle properties. J. Amer.Statist. Assoc., 96, 1348-1360 (2001).
  • 3Fan, J., Lv, J.: Sure independence screening for ultrahigh dimensional feature space (with discussion). J.Royal Statist. Soc., Ser. B, 70, 849-911 (2008).
  • 4Fan, J., Samworth, R., Wu, Y.: Ultrahigh dimensional feature selection: beyond the linear model. J.Machine Learning Research, 10, 1829-1853 (2009).
  • 5Fan, J., Song, R.: Sure independence screening in generalized linear models with NP-dimensionality. Ann.Statist., 38, 3567-3604 (2010).
  • 6Hall, P., Li, K. C.: On almost linearity of low dimensional projection from high dimensional data. Ann.Statist., 21, 867-889 (1993).
  • 7Hall, P., Miller, H.: Using generalized correlation to effect variable selection in very high dimensional problems. J. Comput. Graphical Statist., 18, 533-550 (2009).
  • 8Li, G., Peng. H., Zhang, J., et al.: Robust rank correlation based screening. Ann. Statist., 40, 1846-1877(2012).
  • 9Li, R., Zhong, W., Zhu, L. P.: Feature screening via distance correlation learning. J. Amer. Statist. Assoc.,107, 1129-1139 (2012).
  • 10Segal, M. R., Dahlquist, K. D., Conklin, B. R.: Regression approach for microarray data analysis. J.Comput. Biology, 10, 961-980 (2003).

同被引文献2

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部