期刊文献+

LGM模型中缺失数据处理方法的比较:ML方法与Diggle-Kenward选择模型 被引量:3

LGM-based analyses with missing data: Comparison between ML method and Diggle-Kenward selection model
下载PDF
导出
摘要 追踪研究中缺失数据十分常见。本文通过Monte Carlo模拟研究,考察基于不同前提假设的Diggle-Kenward选择模型和ML方法对增长参数估计精度的差异,并考虑样本量、缺失比例、目标变量分布形态以及不同缺失机制的影响。结果表明:(1)缺失机制对基于MAR的ML方法有较大的影响,在MNAR缺失机制下,基于MAR的ML方法对LGM模型中截距均值和斜率均值的估计不具有稳健性。(2)DiggleKenward选择模型更容易受到目标变量分布偏态程度的影响,样本量与偏态程度存在交互作用,样本量较大时,偏态程度的影响会减弱。而ML方法仅在MNAR机制下轻微受到偏态程度的影响。 In longitudinal studies, missing data are common. The missing not at random (MNAR) data may lead to biasd parameter estimates and even distort the results of analyses. In this article we compared two techniques based on different mechanisms [i.e., the maximum likelihood approach based on the Missing at Random (MAR) mechanism and the Diggle-Kenward selection model based on the MNAR mechanism] for handling different types of missing data using the Monte Carlo simulation method. Estimates of parameters and standard errors using each of these methods were contrasted under different model assumptions. Four possible influential factors were considered: the dropout missingness proportions, the sample size, the distribution shape (i.e., skewness and kurtosis), and the missing mechanisms. The results indicated that (1) The Diggle-Kenward selection model were affected less by the missingness mechanism than the ML approach. At the MAR condition, the Diggle-Kenward selection model based on the MNAR mechanism kept stable and would provide similar estimation results with the ML approach based on the MAR assumption. At the MNAR condition, the ML approach was not much different from the Diggle-Kenward selection model in their variance of latent variances (σi2 and σs2 ) but had greater discrepancy in their means of the latent variables (μi and μs). (2) The distribution shape had more impact on the Diggle-Kenward selection model. For the mean and variance of the intercept and the variance of the slope, the sample size and the degrees of skewness and kurtosis had significant interactions. With large sample sizes, the influence of distribution shape on the estimation precision would decrease. The ML approach was not easily affected by the distribution shape. (3) When fitting a growth curve model, compared to the means of the latent variables (μi and μs), the variances (σi2 and σs2) were influenced much more by the distribution shape (i.e., the degree of skewness and kurtosis). (4) The level of dropout missingness proportion was the major factor affecting the parameter estimation precision. Greater sample size would improve the estimation precision in most cases.
出处 《心理学报》 CSSCI CSCD 北大核心 2017年第5期699-710,共12页 Acta Psychologica Sinica
基金 国家自然科学基金项目(31571152) 北京市与中央在京高校共建项目(019-105812) 未来教育高精尖创新中心 中央高校基本科研业务费专项资金资助
关键词 潜变量增长模型 非随机缺失机制 Diggle-Kenward选择模型 极大似然方法 latent growth model missing not at random Diggle-Kenward selection model maximum likelihood approach
  • 相关文献

参考文献1

二级参考文献59

  • 1茅群霞,李晓松.多重填补法Markov Chain Monte Carlo模型在有缺失值的妇幼卫生纵向数据中的应用[J].四川大学学报(医学版),2005,36(3):422-425. 被引量:7
  • 2风笑天.追踪研究:方法论意义及其实施[J].华中师范大学学报(人文社会科学版),2006,45(6):43-47. 被引量:27
  • 3张佩(2002).心理学论文写作规范,北京:科学出版社.
  • 4Barzi, F., & Woodward, M. (2004). Imputations of missing values in practice: Results from imputations of serum cholesterol in 28 cohort studies. American Journal of Epidemiology, 160(1), 3445.
  • 5Barzi, F., Woodward, M., Marfisi, R. M., Tognoni, G., & Marchioli, R. (2006). Analysis of the benefits of a Mediterranean diet in the GISSI-Prevenzione study: A case study in imputation of missing values from repeated measurements. European Journal of Epidemiology, 21(1), 15-24.
  • 6Burton, A., &Altman, D. G. (2004). Missing covariate data within cancer prognostic studies: A review of current reporting and proposed guidelines. British Journal of Cance, 91(1),4-8.
  • 7Clarke, P., & Hardy, R. (2007). Methods for handling missing data. In A. Pickles, B. Maughan, & M. Wadsworth (Eds.), Epidemiological methods in life course research (Vol. 1, pp. 157-197).
  • 8New York: Oxford University Press. Daniels, M. J., & Hogan, J. W. (2008). Missing data in longitudinal studies: Strategies for bayesian modeling and sensitivity analysis. Boca Raton, Florida: CRC Press.
  • 9Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 39(1), 1-38.
  • 10Diggle, P. J. (1989). Testing for random dropouts in repeated measurement data. Biometrics, 45(4), 1255-1258.

共引文献18

同被引文献273

引证文献3

二级引证文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部