
垂直等值的应用及最新发展述评 被引量:10

Vertical Scaling:The Development and Application
摘要 由于实际的需求,垂直等值方法在近些年来迅速发展。但从垂直等值方法的整个过程来看,包括垂直等值的选用、双向细目编制、发展性量尺的构建、程序的选择和结果的报告,仍存在大量有待解决的问题。同时,随着其他测量方法的发展与进步,垂直等值与之相结合从而获得了进一步的完善。综观之,垂直等值方法的发展与完善,一方面依赖于各种模型和参数估计方法的改进与创新,另一方面还依赖于研究者对学业发展本质的不断深入认识。 Vertical scaling has developed rapidly in recent years for the practical need. In the perspective of testing, the whole process of vertical scaling including the choice of whether to use vertical scaling or not, the development of test blueprint, the construction of developmental score scale, the relative software and the report were introduced in detail. The progress of vertical scaling was based on both the improvement of the methods and the deeper understanding of the nature of achievement development.
出处 《心理学探新》 CSSCI 2011年第5期472-476,共5页 Psychological Exploration
关键词 垂直等值 发展性分数量尺 学业发展 vertical scaling developmental score scale achievement development
  • 相关文献


  • 1Kolen M J, Brennan R L. Test equating, scaling, and linking, methods and practices. New York, U. S. : Springer,2004.
  • 2Holland P W, Dorans N J. Linking and equating. In:Brennan R L. Ed. Educational measurement. 4th ed. Westport, U. S. : Praeger,2006 : 187 - 220.
  • 3Dorans N J, Pommerich M, Holland P W. Linking and alig- ning scores and scales. New York, U. S. : Springer Verlag, 2007.
  • 4Lissitz R W, Huynh H. Vertical equating for state assess- ments, issues and solutions in determination of adequate yearly progress and school accountability. Practical Assess- ment, Research Evaluation,2003,8 : 1 - 10.
  • 5Kolen M J. Equating and vertical scaling:research questions. Paper represented at the annual meeting of the national coun- cil on measurement in education. Chicago,2003.
  • 6Martineau J A. The effects of construct shift on growth and accountability models. Unpublished doctorial dissertation. Michigan State University, East Lansing, U. S. ,2004.
  • 7Martineau J A. A distorting value added, the use of longitudi- nal,vertically scaled student achievement data for growth - based, value - added accountability. Journal of Educational and Behavioral Statistics ,2006,31:35 - 62.
  • 8Tong Y, Kolen M J. Comparisons of methodologies and results in vertical scaling for educational achievement tests. Applied Measurement in Education,2007,20 : 227 - 253.
  • 9王烨晖,边玉芳.构建学业发展性量表——垂直等值的应用[J].中国考试,2010(10):7-12. 被引量:5
  • 10Yen W M, Burket G R, Fitzpatrick A R. Response to clem- ans. Educational Assessment, 1995 - 1996,3 : 181 - 190.


  • 1Arce-Ferrer, A., Frisbie, D. A., & Kolen, M. J. (2002). Standard errors of proportions used in reporting changes in school performance with achievement levels. Educational Assessment, 8, 59-75.
  • 2Fang, Y. (2008). Using a projection method to estimate subseores from tests with muhidimensional structures. Unpublished doctorial dissertation, Michigan State University, East Lansing, U.S.
  • 3Hanson, B. A., & B e, guin, A.A. (2002). Obtaining a common scale for item response theory item parameters using separate versus concurrent estimation in the common-item equating design. Applied Psychological Measurement, 26, 3 - 24.
  • 4Harris, D. J., & Crouse, J. D. (1993). A Study of criteria used in equating. Applied Measurement in Education, 6, 195 - 240.
  • 5Holland, P.W. (2002). Two measures of change in the gaps between the CDFs of test-score distributions. Journal of Educational and Behavioral Statistics, 27, 3-17.
  • 6Huynh, H., Schneider, C. (2005) Vertical moderated standards, background, assumptiuns, and practices. Applied Measurement in Education, 18, 99-113.
  • 7lto, K., Sykes, R. C., & Yao, L. H. (2008). Concurrent and separate grade-groups linking procedures for vertical scaling. Applied Measurement in Education, 21, 187 - 206.
  • 8Kang, T., & Petersen, N.S. (2009). Linking item parameters to a base scale. Paper presented at the annual meeting of National Council on Measurement in Education. San Diego, U.S.
  • 9Kim, Y. Y. (2008). Effects of test linking methods on proficiency classification, UIRT versus MIRT linking. Unpublished doctorial dissertation, Michigan State University, East Lansing, U.S.
  • 10Kolen, M.J. (2006). Scaling and norming, in Brennan, R. L. (Eds.), Educational measurement (4th ed., pp. 171-180). Westport, U. S.: Praeger.



  • 1戴海崎,刘启辉.锚题题型与等值估计方法对等值的影响[J].心理学报,2002,34(4):367-370. 被引量:17
  • 2原萍.成就测试对外语教学的正面反拨效应[J].外语教学,2002,23(4):73-76. 被引量:36
  • 3朱正才.大学英语四、六级考试分数等值研究——一个基于铆题和两参数IRT模型的解决方案[J].心理学报,2005,37(2):280-284. 被引量:18
  • 4戴海崎 张峰 陈雪枫.心理与教育测量[M].广州:暨南大学出版社,2001.117-118,282.
  • 5Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika, 43, 561-573.
  • 6B6guin, A. A., Hanson, B. A., & Glas, C. A. W. (2000, April) Effect of multidimensionality on separate and concurrent estimation in IRT equating. Paper presented at the annual meeting of the National Council of Measurement in Education, New Orleans, LA.
  • 7Bock, R. D. (1972). Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika, 37, 29-51.
  • 8Briggs, D. C., & Weeks, J. P. (2009). The impact of vertical scaling decisions on growth interpretations. Educational Measurement: Issues and Practice, 28(4), 3-14.
  • 9Chin, T. Y., Kim, W., & Nering, M. L. (2006, April). Five statistical factors that influence IRT vertical scaling. Paper presented at the annual meeting of the American Educational Research Association, San Francisco.
  • 10Custer, M., Omar, M. H., & Pomplun, M. (2006). Vertical scaling with the Rasch model utilizing default and tight convergence settings with WlNSTEPS and BILOG MG. Applied Measurement in Education, 19(2), 133-149.










使用帮助 返回顶部