期刊文献+

校正的Bootstrap方法对概化理论方差分量及其变异量估计的改善 被引量:3

Using Adjusted Bootstrap to Improve the Estimation of Variance Components and Their Variability for Generalizability Theory
下载PDF
导出
摘要 Bootstrap方法是一种有放回的再抽样方法,可用于概化理论的方差分量及其变异量估计。用Monte Carlo技术模拟四种分布数据,分别是正态分布、二项分布、多项分布和偏态分布数据。基于p×i设计,探讨校正的Bootstrap方法相对于未校正的Bootstrap方法,是否改善了概化理论估计四种模拟分布数据的方差分量及其变异量。结果表明:跨越四种分布数据,从整体到局部,不论是"点估计"还是"变异量"估计,校正的Bootstrap方法都要优于未校正的Bootstrap方法,校正的Bootstrap方法改善了概化理论方差分量及其变异量估计。 Bootstrap is a returned re-sampling method used to estimate the variance component and their variability. Adjusted bootstrap method was used by Wiley in pxi design for normal data in 2001. However, Wiley did not compare the difference between adjusted method and unadjusted method when estimating the variability. To expand Wiley's 2001 study, our study applied Monte Carlo method to simulate four distribution data. The aim of simulation is to explore the effects of four different estimation methods when estimating the variability of estimated variance components for generalizability theory, The four distribution data are normal distribution data, dichotomous distribution data, polytomous distribution data and skewed distribution data. It is common that researchers focus on normal distribution data and neglect non-normal distribution data, yet non-normal distribution data could always be seen in tests such as TOEFL and GRE. There are several methods to estimate the variability of variance components, including traditional, bootstrap, jackknife and Markov Chain Monte Carlo (MCMC). Former research by Li and Zhang (2009) shows that bootstrap method is significantly better than traditional, jackknife, and MCMC methods in estimating the variability for four distribution data.Bootstrap method has superior cross-distribution quality when estimating the variability of estimated variance components. Li and Zhang (2009) also suggest that bootstrap method should be adopted with a "divide-and-conquer" strategy to obtain good estimated standard error and estimated confidence interval and the criteria of such strategy should be set to: boot-p for person, boot-pi for item, and boot-i for person and item. However, it is unclear that which of the bootstrap methods (adjusted and unadjusted) is better for boot-p, boot-pi, and boot-i. Therefore, our study intends to probe into this comparison as well. This aim of the study is to explore whether adjusted bootstrap method is superior to unadjusted method in improving the estimation of variance components and their variability relative for generalizability theory. The simulation is implemented in R statistical programming environment. To simulate skewed data, HyperbolicDist package is used. Some criteria are set to compare the four methods. The bias is considered when variance components and their standard errors are estimated. The smaller the absolute bias is, the more reliable the result is. The criterion of confidence intervals is "80% interval coverage". If the "80% interval coverage" is closer to 0.80, the confidence interval is more reliable. The results indicate that for four distribution data, adjusted bootstrap method is superior to unadjusted bootstrap method whether in point estimation of variance components or in variability estimation of variance components. For its improvement of the estimation of variance components and their variability for generalizability theory, adjusted bootstrap should be adopted as soon as possible.
出处 《心理学报》 CSSCI CSCD 北大核心 2013年第1期114-124,共11页 Acta Psychologica Sinica
基金 教育部人文社会科学研究青年基金项目(12YJC190016) 全国教育科学"十二五"规划教育部重点课题(GFA111009) 广东省教育科学"十二五"规划2011年度研究项目(2011TJK161)
关键词 概化理论 BOOTSTRAP方法 方差分量 方差分量变异量 蒙特卡洛模拟 Generalizability Theory Bootstrap method Variance component variability of estimated variance components Monte Carlo simulation
  • 相关文献

参考文献17

  • 1Brennan,R.L. Generalizability theory[M].New York:springer-verlag,2001.
  • 2Brennan,R.L. Unbiased estimates of variance components with bootstrap procedures[J].Educational and Psychological Measurement,2007,(05):784-803.doi:10.1177/0013164407301534.
  • 3Brennan,R.L,Harris,D.J,Hanson,B.A. The bootstrap and other procedures for examining the variability of estimated variance components in testing contexts(ACT Research Report Series87-7)[R].Iowa City,IA:American College Testing Program,1987.
  • 4Cui,Z.M,Kolen,M.J. Comparison of parametric and nonparametric bootstrap methods for estimating random error in equipercentile equating[J].Applied Psychological Measurement,2008,(04):334-347.doi:10.1177/0146621607300854.
  • 5Efron,B. Bootstrap method:Another look at the jackknife[J].Annals of Statistics,1979.1-26.
  • 6Efron,B. The jackknife,the bootstrap and other resampling plans[A].1982.
  • 7Efron,B,Tibshrani,R.J. Bootstrap methods for standard errors,confidence intervals,and other measures of statistical accuracy[J].Statistical Science,1986.54-57.
  • 8Efron,B,Tibshrani,R.J. An introduction to the Bootstrap[M].New York.Chapman and Hall,1993.
  • 9Fan,X.T. Using commonly available software for bootstrapping in both substantive and measurement analyses[J].Educational and Psychological Measurement,2003,(01):24-50.doi:10.1177/0013164402239315.
  • 10Gao,X.H. Generalizability of a state-wide science performance assessment[D].University of California,1992.

二级参考文献38

  • 1杨志明,张雷.用多元概化理论对普通话的测试[J].心理学报,2002,34(1):50-55. 被引量:21
  • 2American Educational Research Association, American Psychological Association, National Council on Measurementin Education (1985). Standards for educational and psychological testing. Washington, DC: Author.
  • 3American Educational Research Association, American Psychological Association, National Council on Measurement in Education (1999). Standards for educational and psychological testing (Rev. ed. ). Washington, DC: Author.
  • 4Brennan, R. L. (1992). Elements of generalizability theory (Rev. ed. ). Iowa City, IA: ACT.
  • 5Brennan, R. L. (2001). Generalizability theory. New York: Springer-Verlag.
  • 6Brennan, R. L. (2006). Unbiased estimates of variance components with bootstrap procedures: Detailed results (CASMA Research Report No 21). Iowa City, IA: Center for Advanced Studies in Measurement and Assessment, University of Iowa. Available from http://www.education. uiowa.edu/casma.
  • 7Brennan, R. L., Harris, D. J., & Hanson, B. A. (1987). The bootstrap and other procedures for examining the variability of estimated variance components in testing contexts (ACT Research Report Series87-7). Iowa City, IA: American College Testing Program.
  • 8Briggs, D. C., & Wilson, M. (2007). Generalizability in item response modeling. Journal of Educational Measurement, 44(2), 131-155.
  • 9Clauser, B. E., Harik, E, & Clyman, S. G. (2000). The generalizability of scores for a performance assessment scored with a computer-automated scoring system. Journal of Educational Measurement, 36(2), 245-262.
  • 10Clauser, B. E., Harik, P., & Margolis, M. J. (2006). A multivariate generalizability analysis of data from a performance assessment of physicaians' clinical skill. Journal of Educational Measurement, 43(3), 173-191.

共引文献15

同被引文献35

引证文献3

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部