期刊文献+

SAS中处理数据集缺失值方法的对比研究 被引量:8

下载PDF
导出
摘要 采用SAS软件中的多重填补法(MI),期望最大化算法(EM)和Ad Hoc法分别对医疗费用集的缺失值进行处理,比较三种方法的优劣并探讨其在医疗费用缺失值处理中的适用性。运用SAS9.10,采用数据模拟技术,分别模拟真实医疗数据集的各种缺失率的随机缺失数据集,分别用MI、EM和Ad Hoc对各缺失数据集进行处理,对结果进行比较和分析。结果:数据缺失率≤10%时,Ad Hoc更优;数据缺失率在15%~30%时,经MI处理后的分析结果更接近“真实”;数据缺失率≥35%时,三种方法均无效。结论:对不同缺失率的费用科目缺失数据集,MI和Ad Hoc对缺失值的处理各有优劣,EM效果略差于MI,没有明显优势。
作者 殷杰 石锐
出处 《计算机应用》 CSCD 北大核心 2007年第B06期438-439,共2页 journal of Computer Applications
  • 相关文献

参考文献4

  • 1茅群霞,李晓松.多重填补法Markov Chain Monte Carlo模型在有缺失值的妇幼卫生纵向数据中的应用[J].四川大学学报(医学版),2005,36(3):422-425. 被引量:7
  • 2GIARDINA M,HUO Y,AZUAJE F,et al.A Missing Data Estimation Analysis in Type Ⅱ Diabetes Databases[A].Proceedings of the 18th IEEE Symposium on Computer-Based Medical Systems[C].2005.
  • 3BARZI F,WOODWARD M.Imputation of Missing Values in Practice:Results from Imputations of Serum Cholesterol in 28 Cohort Studies[J].American Journal of Epidemiology,2004,160 (1):34-351.
  • 4ARNOLD AM,KRONMAL RA.Multiple Imputation of Baseline Data in the Cardiovascular Health Study[J].American Journal of Epidemiology,2003,157 (1):74-841.

二级参考文献10

  • 1Arnold Alice M, Kronmal Richard A. Multiple Imputation of Baseline Data in the Cardiovascular Health Study. American Journal of Epidemiology,2003;157(1):74.
  • 2Abraham WT, Russell DW. Missing data:a review of current methods and applications in epidemiological research. Current Opinion in Psychiatry,2004;17(4):315.
  • 3Mary Beth Landrum,Mark P Becker. A multiple imputation strategy for incomplete longitudinal data. Statistics in Medicine,2001;20:2741.
  • 4Yang C Yuan. Multiple imputation for missing data:Concepts and new development. SAS Institute Inc,1999:267-25.
  • 5Rubin DB. Multiple imputation for nonresponse in surveys. New York: John Wiley & Sons,1987:15-22.
  • 6Van Buuren S, Boshuizen HC, Knook DL. Multiple imputation of missing blood pressure covariates in survival analysis. Statistics in Medicine,1999;18:681.
  • 7Schafer JL,Maren K Olsen. Multiple imputation for multivariate missing-data problems:a data analysis's perspective. Multivariate Behavioural Research,1998;33:545.
  • 8MCMC Method for Arbitrary Missing Data. SAS/STAT 9 User's guide. North Carolina:SAS Institute Inc,2002:159-169.
  • 9Combining Inferences from Multiple Imputed Data Sets. SAS/STAT 9 User's Guide. North Carolina:SAS Institute Inc,2002:211-213.
  • 10Patricia A Patrician. Focus on research methods multiple imputation for missing data. Research in Nursing & Health,2002;25:76.

共引文献6

同被引文献106

引证文献8

二级引证文献51

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部