期刊文献+

缺失数据的结构方程建模:全息极大似然估计时辅助变量的作用 被引量:4

The mechanism of auxiliary variables in full information maximum likelihood–based structural equation models with missing data
下载PDF
导出
摘要 本研究通过蒙特卡洛模拟考查了采用全息极大似然估计进行缺失数据建模时辅助变量的作用。具体考查了辅助变量与研究变量的共缺机制、共缺率、相关程度、辅助变量数目与样本量等因素对参数估计结果精确性的影响。结果表明,当辅助与研究变量共缺时:(1)对于完全随机缺失的辅助变量,结果更容易出现偏差;(2)对于MAR-MAR组合机制,纳入单个辅助变量是有益的;对于MAR-MCAR或MAR-MNAR组合机制,纳入多于一个辅助变量的效果更好;(3)纳入与研究变量低相关的辅助变量对结果也是有益的。 In social and behavioral studies, missing data cannot be avoided in the process of data collection, especially in longitudinal studies. Because sample with missing data lose the balance characteristics of their complete counterparts, which may distort parameter estimates and degrade the performance of confidence intervals, special methods have to be developed for these analysis. Two modern missing data analysis techniques, maximum likelihood estimation and multiple imputation, have been widely studied in the methodological literature during the last decade. Since the maximum likelihood estimation and multiple imputation require the MAR(missing at random) assumption, including auxiliary variables can help fine-tune the missing data handling procedure, either by reducing bias or by increasing power. A useful auxiliary variable is a potential cause or a correlate of the incomplete variables in the analysis model. Notably, Graham(2003) proposed a "saturated correlates model", which allows us to include auxiliary variables in FIML-based structural equation models easily. However, some questions about the inclusion of auxiliary variables are needed to further study. The main research question was under what condition the auxiliary variables will be effective in the FIML-based structural equation modeling. The current study investigates the effect of including auxiliary variables during estimation of structural equation modeling parameters with FIML estimation through Monte Carlo simulation. It focused on the missing values of the auxiliary variables and variables of interests simultaneously. The simulation repeated 5,000 times for each of 576 combinations: common missing rates(5 percent, 10 percent, 15 percent, and 20 percent), missingmechanism combinations(MCAR-MCAR, MCAR-MAR, MCAR-MNAR, MAR-MCAR, MAR-MAR, and MAR-MNAR), correlations(low, moderate to high), number of auxiliary variables(1, 3, 5), and sample sizes(100, 200, 500, 1000). The evaluation criteria are bias and confidence intervals coverage of parameters. Data generates according to Enders(2008) model. All data generate and analyze by Mplus 7.0. Auxiliary variables without missing values outperformed auxiliary variables with missing values. Including auxiliary variables which had missing values in the analysis procedure was found to improve parameter estimation efficiently in most cases. Results showed that the bias was more serious when the missing mechanism of the auxiliary variables was MCAR than MNAR. In the FIML-based structural equation modeling, the inclusion of more than a single auxiliary variable for MAR-MCAR or MAR-MNAR combined mechanisms is beneficial, while for MAR-MAR combined mechanism, a single auxiliary variable would be better. In addition, it is beneficial to include auxiliary variables which had low correlation with variables of interests in this model. However, simulation results indicated that the common missing rates had little impact on bias. Overall, this study indicates that the inclusion of incomplete auxiliary variables is beneficial, even if the auxiliary variables and variables of interests have a relative proportion of missing data.
作者 王孟成 邓俏文 WANG Meng-Cheng DENG Qiaowen(Department of Psychology, Guangzhou Universit Center for Psychometric and Latent Variable Modeling, Guangzhou University, Guangzhou 510006, Chin)
出处 《心理学报》 CSSCI CSCD 北大核心 2016年第11期1489-1498,共10页 Acta Psychologica Sinica
基金 国家自然科学基金(31400904) 广州大学"创新强校工程"(2014WQNCX069)项目资助
关键词 缺失数据 缺失机制 结构方程 全息极大似然估计 辅助变量 蒙特卡洛模拟 missing data missing mechanism SEM full information maximum likelihood auxiliary variable Monte Carlo simulation
  • 相关文献

参考文献2

二级参考文献39

  • 1Rubin D B. Inference and missing data[J]. Biometrika, 1976, 63: 581-592.
  • 2Little R J A. A test of missing completely at random for multivariate data with missing values[J] JAmStatAssoc, 1988, 83: 1198-1202.
  • 3Ridout M S. Testing for random dropouts in repeated measurement data[J]. Biometrics, 1991, 47: 1617-1619.
  • 4Diggle P J. Testing for random dropouts in repeated measurements data[J]. Biometrics, 1989, 45: 1255-1258.
  • 5Listing J, Schlittgen R. Tests if dropouts are missed at random[J]. Biometrical Journal, 1998, 40: 929-935.
  • 6Listing,J.Schlittgen,R. Nonparametric test for random dropouts[J]. Biometrical Journal, 2003, 45: 113-127.
  • 7Denys P. Testing the mechanism of missing data[J/OL]. [2012:02] http://hal.archives-ouvertes.fr/hal- 00669339.
  • 8金勇进,邵军.《(缺失数据的统计处理》[M】.北京:中国统计出版社,2009.
  • 9王孟成.(2014),潜变量建模与Mplus应用:基础篇.重庆:重庆大学出版社.
  • 10Collins,L.M.,Schafer,J.L.,& Kam,C.-M.(2001),A comparison of inclusive and restrictive strategies in modem missing data procedures.Psychological Methods,6,330-351.

共引文献14

同被引文献358

引证文献4

二级引证文献71

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部