摘要
如何在高维数据空间中筛选有用变量,提取有用的信息,是大数据时代研究的热点之一。文章将变量选择的方法应用于高维数据,通过模拟仿真,引进敏感性与特异性,分析比较岭回归、Lasso、自适应Lasso以及Elastic Net回归等方法的适用领域,并指出变量选择方法的应用前景。
How to filter useful variables and extract useful information in high-dimensional data space is one of the hot topics in the era of big data.This paper applies the method of variable selection to high-dimensional data,analyzes and compares the applicable fields of ridge regression,Lasso,adaptive Lasso and Elastic Net regression by comparing sensitivity and specificity based on simulation,and points out the application prospect of variable selection method.
作者
宋瑞琪
朱永忠
王新军
Song Ruiqi;Zhu Yongzhong;Wang Xinjun(School of Science,Hohai University,Nanjing 211100,China)
出处
《统计与决策》
CSSCI
北大核心
2019年第2期13-16,共4页
Statistics & Decision
关键词
大数据
系数压缩法
变量选择
高维数据
big data
coefficient compression method
variable selection
high-dimensional data