期刊文献+

缺失数据插补方法的比较研究 被引量:18

Comparative Research on Interpolation Method of Missing Data
下载PDF
导出
摘要 文章通过对样本数据缺失值问题的分析,在随机缺失模式下选择科学有效的插补法对数据缺失问题进行研究,然后以市场中顾客对商品的喜爱度作为调查对象,构建模型对其进行实证分析,并在与多重插补法结合运用的基础上研究不同缺失率下的插补效果。结果显示,随着缺失率的提高,调查中所获得的数据可用性减少,同时插补法的效果也随之降低。在四种插补法中,EM插补和多重插补的插补效果优于其余两种,同时运用模型与多重插补相结合的插补效果也不错。因此根据不同的缺失率,需要选择合适的插补法对数据进行插补。 Through the analysis of the missing data of sample data,this paper chooses scientific and effective interpolation method to study the problem of missing data under the random missing mode,and then uses the customer's preference for the products in the market as a survey target to construct a model for conducting empirical analysis.Finally,on the basis of combining with multiple interpolation method,the paper makes a study on the interpolation effect under different missing rate.The results show that,with the increase of the missing rate,the availability of data from the survey decreases,and the effectiveness of the interpolation method is also reduced accordingly.Among the four interpolation methods,the interpolation effect of EM interpolation and multiple interpolation is better than the other two,and at the same time,the interpolation effect of the combination of the model and multiple interpolation is also good.Therefore,it is necessary to select the proper interpolation method to interpolate the data according to different missing rates.
作者 宋亮 万建洲 Song Liang;Wan Jianzhou(School of Mathematics and Statistics,Nanyang Institute of Technology,Nanyang Henan 473000,China)
出处 《统计与决策》 CSSCI 北大核心 2020年第18期10-14,共5页 Statistics & Decision
基金 国家自然科学基金资助项目(11901320) 河南省教育厅高等学校重点科研项目(19A110028) 河南省科技厅基础与前沿项目(162300410076)。
关键词 抽样调查 缺失值 插补法 逻辑回归分析 sample survey missing value interpolation logistic regression analysis
  • 相关文献

参考文献5

二级参考文献21

  • 1田兵.缺失数据的单一插补方法[J].阴山学刊(自然科学版),2011,25(3):17-19. 被引量:3
  • 2殷杰,石锐.SAS中处理数据集缺失值方法的对比研究[J].计算机应用,2007,27(B06):438-439. 被引量:8
  • 3Barnard J,Meng XL. Applications of muhiple imputation in medical studies: from AIDS to NHANES[ J ]. Stat Methods Med Res,1999,8(1):17.
  • 4[ 美]Roderick J. A. Little, Donald B. Rubin Statistical Analysis with Missing Data [M].New York :John Wiley & Sons INC,2002.
  • 5[美]James0.Berger著,贾乃光译.统计决策论及贝叶斯分析[M].北京:中国统计出版社.1997.
  • 6[美]Donald.B.Rubin Multiple Imputation For Nonresponse in Surveys [M].New Yrok :Jghn Wiley & Sons INC,1987.
  • 7Aitchison J. The Statistical Analysis of Compositional Data [M]. New York: Chapman and Hall, 1986.
  • 8Dempster A P, Laird N M, Rubin D B. Maximum likelihood from incomplete data via the EM algorithm [J]. J. Roy. Statist. Soc. Ser. B, 1977, 39(1): 1-38.
  • 9Egozcue J J, Pawlowsky-Glahn V, Mateu-Figueras G, et al. Isometric logratio transformations for compositional data analysis [J]. Math. Geol., 2003, 35(3): 279-300.
  • 10王星.非参数统计[M].北京:清华大学出版社,2013.

共引文献45

同被引文献172

引证文献18

二级引证文献28

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部