期刊文献+

概率分布等值法及其应用 被引量:3

Methodology of Equating Based on Probability Distribution and Its Applications
下载PDF
导出
摘要 在项目反应理论框架下,根据已有文献提出了开发新的测验等值准则的方法,即许多准则都可以看成是通过对锚题上作答反应概率分布进行变换而导出。据此揭示了两个著名的等值准则——Haebara方法和Stock ing-Lord方法之间的联系,并且导出了一个新的等值准则——余弦等值准则。为了讨论余弦准则的行为表现,开展了一系列Monte-Carlo模拟研究。模拟结果表明,余弦准则在多级评分模型GPCM上表现比Haebara方法和Stock-ing--Lord方法都好,而对GRM和2PLM,其表现不如Haebara,但可以和Stock ing-Lord方法相提并论。这一发现提醒我们等值准则的选用是否恰当,不仅与等值系数所落的范围有关,而且还与项目反应函数(IRF)有更密切的关系。 This paper, divided into two parts, discusses the following two issues: (1) the methodology of developing a new test equating criterion and (2) the behavior of a new test equating method, referred to as cosine criterion. Under the item response theory (IRT) and in light of the probability distribution of an examinee's response to some item, the fast part of this paper proposes the methodology derived from the published literature on some test equating criteria. Moreover, some test equating criteria could be regarded as certain functions of probability distributions. Based on this, a series of test equating approaches, such as the Haebara item characteristic curve equating method (Hcrit), Stocking - Lord test characteristic curve equating method (SLcrit), logcontract equating method, SQRT method, and weighted Haebara method, could be clearly illustrated. Further, the relationship between Hcrit and SLcrit was identified: if the mutual compensation of the responses to the anchor items is evident, then SLcrit is suitable, and if not, then Hcrit is more appropriate. In the second part of the paper, a new test equating criterion, known as cosine criterion (COScrit) was discussed as an example of the application of this methodology of the equating criteria. The results of the Monte Carlo study show that the behavior of the new criterion is better than that of Herit and SLcrit; this is evident when the data is fit to the generalized partial credit model (GPCM) in the sense that the root mean squared deviations (RMSDs) corresponding to the three criteria are compared. Further, the RMSD to COSerit is smaller and statistically significant. When the data is fit to the 2 - parameter logistic model 2PLM, or the graded response model (GRM),COSerit is comparable to SLcrit; in fact, it is considerably better than SLcrit, provided that the equating coefficient A is not smaller than 1.2. If, however, coefficient A is smaller than 12,an inverse result is observed. Nevertheless, COScrit is inferior to Herit in both the cases. The findings suggest that the behavior of a test equating criterion is related to the domain of coefficient A, particularly to the item response function (IRF).
出处 《心理学报》 CSSCI CSCD 北大核心 2008年第1期101-108,共8页 Acta Psychologica Sinica
基金 国家自然科学基金(60263005) 江西省自然科学基金(0411021) 省科技厅攻关项目 省教育厅科技项目 省高校人文社科研究项目(JY06201) 全国教育考试科研规划项目(2006JKS3063) 卫生部课题(JM20060070,KY200704)资助
关键词 等值准则 开发方法 余弦准则 项目反应函数 equating criterion, developing methodology, cosine criterion, item response function
  • 相关文献

参考文献13

二级参考文献35

  • 1戴海崎.等级反应模型项目特征曲线法等值研究[J].心理学探新,2000,20(3):49-53. 被引量:4
  • 2[2]Win J. Van der Linden & Ronald K. Hambleton[Z]. Handbook of Modem Item Response Theory. 1995.
  • 3[3]Michael J. Kolen & Robert L. Brennan. Test Equating[M]. 1995.
  • 4Kolen M J, Brennan R L. Test equating: Methods and practices,New York: Springer-Verlag, 1995. 169 - 173
  • 5AM穆德 F A 格雷比尔 史定华 译.统计学导论[M].北京:科学出版社,1978.312.
  • 6BishopYMM FienbergSE HollandPW 张尧庭译.离散多元分析理论与实践[M].北京:中国统计出版社,1998.619-620.
  • 7Thissen D. MULTILOG User' s guide, Chicago, IL. : Scientific Software, Inc., 1991. Examples 3-7-3-9
  • 8Harwell M R. Analyzing the results of Monte Carlo studies in item response theory, Educational and Psychological Measurement,1997, 57(2): 266 -279
  • 9丁树良 熊建华.几种常见等值方法的统一处理[A]..第六届全球华人计算机教育应用大会暨2002年全国教育信息化论坛论文集[C].中央广播电视大学出版社,2002.457-460.
  • 10Kolen M J, Breenan R L.Test equating:methods and Practices[M] .New York: Springer-Verlag,Inc.1995,169-173.

共引文献22

同被引文献26

引证文献3

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部