摘要
在社会体育调查资料统计处理的过程中,经常面对缺失数据的问题.以2001年沈阳市500名40 ~ 70岁城市妇女关于参加体育锻炼活动和体力劳动的问卷调查数据为实例,对5种常用处理缺失数据的插补方法进行了比较分析.首先介绍5种缺失数据插补方法并阐述利与弊,然后结合调查实例进行描述性指标比较和回归分析比较,最后提出研究者在选用插补方法时应该综合考虑缺失数据所占比例、研究能力和时间限制等因素寻找最适宜的插补方法.
In the process of social sports statistical processing of the survey data, we are often faced with the problem of missing data. In this paper, taking 500 40-year-old to 70-year-old urban women in Shenyang City participatiing in the sur- vey data on physical exercise and physical activity in 2001 as an example, five kinds of commonly missing data imputation methods were compared and analyzed. At first, it introduced five kinds of missing data imputation methods and described the pros and cons, and then conducted descriptive examples and regression analysis and comparison of comparative indica- tors combined surveys. It concludes that in the selection of the interpolation method, the researchers should take into ac- count the percentage of missing data proportion, research capacity, time constraints and other factors to find the most suit- able interpolation method.
出处
《沈阳体育学院学报》
CSSCI
北大核心
2014年第4期46-49,共4页
Journal of Shenyang Sport University
基金
2013年辽宁省教育厅科学研究一般项目
编号:W2013231
关键词
缺失数据
插补方法
统计推断
数据分析
missing data
interpolation method
statistical inference
data analysis