LGM模型中缺失数据处理方法的比较:ML方法与Diggle-Kenward选择模型被引量：3

LGM-based analyses with missing data: Comparison between ML method and Diggle-Kenward selection model

下载PDF

导出

摘要追踪研究中缺失数据十分常见。本文通过Monte Carlo模拟研究,考察基于不同前提假设的Diggle-Kenward选择模型和ML方法对增长参数估计精度的差异,并考虑样本量、缺失比例、目标变量分布形态以及不同缺失机制的影响。结果表明:(1)缺失机制对基于MAR的ML方法有较大的影响,在MNAR缺失机制下,基于MAR的ML方法对LGM模型中截距均值和斜率均值的估计不具有稳健性。(2)DiggleKenward选择模型更容易受到目标变量分布偏态程度的影响,样本量与偏态程度存在交互作用,样本量较大时,偏态程度的影响会减弱。而ML方法仅在MNAR机制下轻微受到偏态程度的影响。 In longitudinal studies, missing data are common. The missing not at random （MNAR） data may lead to biasd parameter estimates and even distort the results of analyses. In this article we compared two techniques based on different mechanisms [i.e., the maximum likelihood approach based on the Missing at Random （MAR） mechanism and the Diggle-Kenward selection model based on the MNAR mechanism] for handling different types of missing data using the Monte Carlo simulation method. Estimates of parameters and standard errors using each of these methods were contrasted under different model assumptions. Four possible influential factors were considered： the dropout missingness proportions, the sample size, the distribution shape （i.e., skewness and kurtosis）, and the missing mechanisms. The results indicated that （1） The Diggle-Kenward selection model were affected less by the missingness mechanism than the ML approach. At the MAR condition, the Diggle-Kenward selection model based on the MNAR mechanism kept stable and would provide similar estimation results with the ML approach based on the MAR assumption. At the MNAR condition, the ML approach was not much different from the Diggle-Kenward selection model in their variance of latent variances （σi2 and σs2 ） but had greater discrepancy in their means of the latent variables （μi and μs）. （2） The distribution shape had more impact on the Diggle-Kenward selection model. For the mean and variance of the intercept and the variance of the slope, the sample size and the degrees of skewness and kurtosis had significant interactions. With large sample sizes, the influence of distribution shape on the estimation precision would decrease. The ML approach was not easily affected by the distribution shape. （3） When fitting a growth curve model, compared to the means of the latent variables （μi and μs）, the variances （σi2 and σs2） were influenced much more by the distribution shape （i.e., the degree of skewness and kurtosis）. （4） The level of dropout missingness proportion was the major factor affecting the parameter estimation precision. Greater sample size would improve the estimation precision in most cases.

作者张杉杉陈楠刘红云

机构地区首都经济贸易大学劳动经济学院北京师范大学心理学院应用实验心理北京市重点实验室艾美仕市场调研咨询(上海)有限公司

出处《心理学报》 CSSCI CSCD 北大核心 2017年第5期699-710,共12页 Acta Psychologica Sinica

基金国家自然科学基金项目(31571152) 北京市与中央在京高校共建项目(019-105812) 未来教育高精尖创新中心中央高校基本科研业务费专项资金资助

关键词潜变量增长模型非随机缺失机制 Diggle-Kenward选择模型极大似然方法 latent growth model missing not at random Diggle-Kenward selection model maximum likelihood approach

分类号 B841 [哲学宗教—基础心理学]

引文网络
相关文献

参考文献1

1叶素静,唐文清,张敏强,曹魏聪.追踪研究中缺失数据处理方法及应用现状分析[J].心理科学进展,2014,22(12):1985-1994. 被引量：19

二级参考文献59

1茅群霞,李晓松.多重填补法Markov Chain Monte Carlo模型在有缺失值的妇幼卫生纵向数据中的应用[J].四川大学学报（医学版）,2005,36(3):422-425. 被引量：7
2风笑天.追踪研究:方法论意义及其实施[J].华中师范大学学报（人文社会科学版）,2006,45(6):43-47. 被引量：27
3张佩(2002).心理学论文写作规范,北京:科学出版社.
4Barzi, F., & Woodward, M. (2004). Imputations of missing values in practice: Results from imputations of serum cholesterol in 28 cohort studies. American Journal of Epidemiology, 160(1), 3445.
5Barzi, F., Woodward, M., Marfisi, R. M., Tognoni, G., & Marchioli, R. (2006). Analysis of the benefits of a Mediterranean diet in the GISSI-Prevenzione study: A case study in imputation of missing values from repeated measurements. European Journal of Epidemiology, 21(1), 15-24.
6Burton, A., &Altman, D. G. (2004). Missing covariate data within cancer prognostic studies: A review of current reporting and proposed guidelines. British Journal of Cance, 91(1),4-8.
7Clarke, P., & Hardy, R. (2007). Methods for handling missing data. In A. Pickles, B. Maughan, & M. Wadsworth (Eds.), Epidemiological methods in life course research (Vol. 1, pp. 157-197).
8New York: Oxford University Press. Daniels, M. J., & Hogan, J. W. (2008). Missing data in longitudinal studies: Strategies for bayesian modeling and sensitivity analysis. Boca Raton, Florida: CRC Press.
9Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 39(1), 1-38.
10Diggle, P. J. (1989). Testing for random dropouts in repeated measurement data. Biometrics, 45(4), 1255-1258.

共引文献18

1黄菲菲,张敏强.社会网络分析中缺失数据的处理方法[J].心理技术与应用,2016,4(8):456-464. 被引量：3
2周敏林,章海涛,陆梦洁,钟伟华,刘玉秀.临床纵向数据缺失的随机效应模式混合模型及SAS实现[J].中国临床药理学与治疗学,2016,21(9):1012-1017. 被引量：2
3谢翘楚,姚毅.电网历史数据缺失及补录研究[J].四川理工学院学报（自然科学版）,2017,30(2):21-25. 被引量：1
4董书阳,梁熙,张莹,王争艳.母亲积极养育行为对儿童顺从行为的早期预测与双向作用：从婴儿到学步儿[J].心理学报,2017,49(4):460-471. 被引量：7
5林睿,陈鲁雁,王嘉梅,范菁,袁长森.基于语言模型的缺失数据追踪方法与应用分析[J].计算机与数字工程,2018,46(10):2034-2038. 被引量：1
6于力超.纵向抽样调查中缺失值的预防和处理方法[J].统计与决策,2018,0(20):9-13.
7胥彦,李超平.追踪研究在组织行为学中的应用[J].心理科学进展,2019,27(4):600-610. 被引量：28
8高霞,李瑞俊.缺失数据处理方法的研究及其在软测量技术中的应用[J].江西电力职业技术学院学报,2019,32(1):4-5. 被引量：1
9张沥今,陆嘉琦,魏夏琰,潘俊豪.贝叶斯结构方程模型及其研究现状[J].心理科学进展,2019,27(11):1812-1825. 被引量：27
10温忠麟,方杰,沈嘉琦,谭倚天,李定欣,马益铭.新世纪20年国内心理统计方法研究回顾[J].心理科学进展,2021,29(8):1331-1344. 被引量：21

同被引文献273

1汤丹丹,温忠麟.共同方法偏差检验:问题与建议[J].心理科学,2020,43(1):215-223. 被引量：433
2单志艳,孟庆茂.心理学中定量研究的几个问题[J].心理科学,2002,25(4):466-467. 被引量：18
3胡中锋,莫雷.论因素分析方法的整合[J].心理科学,2002,25(4):474-475. 被引量：25
4崔丽霞,郑日昌.20年来我国心理学研究方法的回顾与反思[J].心理学报,2001,33(6):564-570. 被引量：34
5温忠麟,张雷,侯杰泰,刘红云.中介效应检验程序及其应用[J].心理学报,2004,36(5):614-620. 被引量：7413
6刘鹏,雷蕾,张雪凤.缺失数据处理方法的比较研究[J].计算机科学,2004,31(10):155-156. 被引量：24
7周浩,龙立荣.共同方法偏差的统计检验与控制方法[J].心理科学进展,2004,12(6):942-950. 被引量：3536
8关丹丹,张厚粲,李中权.差异分数的信度分析[J].心理科学,2005,28(1):161-163. 被引量：2
9田晓明,傅珏生.多元总体均值差异显著性检验的研究[J].心理科学,2005,28(1):164-165. 被引量：4
10刘军,吴维库.心理测量平衡性研究与实例[J].心理科学,2005,28(1):170-174. 被引量：6

引证文献3

1温忠麟,方杰,沈嘉琦,谭倚天,李定欣,马益铭.新世纪20年国内心理统计方法研究回顾[J].心理科学进展,2021,29(8):1331-1344. 被引量：21
2刘源,都弘彦,方杰,温忠麟.国内追踪数据分析方法研究与模型发展[J].心理科学进展,2022,30(8):1734-1746. 被引量：6
3高霞,李瑞俊.缺失数据处理方法的比较研究[J].佳木斯职业学院学报,2019,35(3):259-259.

二级引证文献25

1张凯欣,张瑞宏.基于心理授权的中介作用探讨护士组织内人际和谐对职业高原的影响[J].中国医疗管理科学,2022,12(3):85-91. 被引量：2
2崔洪波,樊晏辰,蒋玉露.家庭功能对青少年利他行为的影响:有调节的中介效应[J].贵州师范学院学报,2022,38(6):61-66.
3许岳培,陆春雷,王珺,宋琼雅,贾彬彬,胡传鹏.评估零效应的三种统计方法[J].应用心理学,2022,28(4):369-384. 被引量：4
4温忠麟,谢晋艳,方杰,王一帆.新世纪20年国内假设检验及其关联问题的方法学研究[J].心理科学进展,2022,30(8):1667-1681. 被引量：6
5温忠麟,陈虹熹,方杰,叶宝娟,蔡保贞.新世纪20年国内测验信度研究[J].心理科学进展,2022,30(8):1682-1691. 被引量：11
6温忠麟,方杰,谢晋艳,欧阳劲樱.国内中介效应的方法学研究[J].心理科学进展,2022,30(8):1692-1702. 被引量：177
7方杰,温忠麟,欧阳劲樱,蔡保贞.国内调节效应的方法学研究[J].心理科学进展,2022,30(8):1703-1714. 被引量：26
8王阳,温忠麟,李伟,方杰.新世纪20年国内结构方程模型方法研究与模型发展[J].心理科学进展,2022,30(8):1715-1733. 被引量：40
9刘源,都弘彦,方杰,温忠麟.国内追踪数据分析方法研究与模型发展[J].心理科学进展,2022,30(8):1734-1746. 被引量：6
10宋兵,孙咏冰,刘慧娟,孙小坚,郭佳佳,王婉雪,王新华.初入高原工作的医务人员不同应对方式与焦虑抑郁的关系以及心理弹性的中介作用[J].中华全科医学,2022,20(8):1367-1371. 被引量：4

1叶素静,唐文清,张敏强,曹魏聪.追踪研究中缺失数据处理方法及应用现状分析[J].心理科学进展,2014,22(12):1985-1994. 被引量：19
2赵春.人有多少好心情[J].人民文摘,2007(6):17-27.
3石雷山,姜冬梅,高峰强.初中留守儿童的学业自我效能与学校适应:潜变量增长模型分析[J].应用心理学,2017,23(2):119-127. 被引量：8
4涂冬波,蔡艳,戴海琦,丁树良.一种多策略认知诊断方法:MSCD方法的开发[J].心理学报,2012,44(11):1547-1553. 被引量：14
5刘玥,刘红云.贝叶斯题组随机效应模型的必要性及影响因素[J].心理学报,2012,44(2):263-275. 被引量：16
6姚琦,马华维,李强.对新员工入职期望变化的一项纵向研究[J].心理学报,2007,39(6):1122-1130. 被引量：15
7赵顶位,戴海琦.R-RUM的参数估计及性能评价研究[J].心理学探新,2017,37(3):231-236.
8沐守宽,周伟.缺失数据处理的期望-极大化算法与马尔可夫蒙特卡洛方法[J].心理科学进展,2011,19(7):1083-1090. 被引量：15
9艾娟,高峰强.当代人格研究范式的新进展[J].湖南师范大学教育科学学报,2006,5(5):88-92. 被引量：2
10杨林山,曹亦薇.贝叶斯理论框架下的2种纵向缺失数据处理方法的比较——以潜在变量增长曲线模型为例[J].江西师范大学学报（自然科学版）,2012,36(5):461-465. 被引量：3

心理学报

2017年第5期

浏览历史

内容加载中请稍等...

LGM模型中缺失数据处理方法的比较:ML方法与Diggle-Kenward选择模型被引量：3

参考文献1

二级参考文献59

共引文献18

同被引文献273

引证文献3

二级引证文献25

相关作者

相关机构

相关主题

浏览历史

LGM模型中缺失数据处理方法的比较:ML方法与Diggle-Kenward选择模型 被引量：3

参考文献1

二级参考文献59

共引文献18

同被引文献273

引证文献3

二级引证文献25

相关作者

相关机构

相关主题

浏览历史

LGM模型中缺失数据处理方法的比较:ML方法与Diggle-Kenward选择模型被引量：3