

An Empirical Study of the Effects of Individualized Rater Feedback
摘要 评分人反馈信息是保证评分质量的一种重要手段,然而以往的研究对该方法的有效性尚无定论。本研究邀请三位评分人对30篇CET4模拟作文评分,同时提供按重要性排列的三条评分理由。评分结束一周后,评分人收到个性化反馈信息报告,内容包括多层面Rasch模型(MFRM)的分析结果(严厉度、内部一致性和偏差)以及评分理由的编码分析结果。阅读完反馈信息后,评分人对另外30篇CET4模拟作文评分并提供评分理由。研究发现,反馈信息能加深评分人对评分标准的理解,提高评分人的内部一致性,减少评分理由中的构念不相关差异。 Feedback to raters is widely used as a common practice to control rating quality, while previous studies have produced mixed findings. This paper inquires into the effectiveness of individualized rater feedback by inviting three CET4 accredited raters to rate 30 mock essays, then write and rank three reasons for their ratings simultaneously. One week later, raters received detailed individualized feedback report of their performance including results of the MFRM analysis and coding analysis for their rating reasons. Raters were asked to attend to the feedback when marking a new pack of 30 mock essays, then write and rank three reasons as well. A comparison between rater performance before and after feedback revealed that raters were able to gain a deeper understanding of rating scale, become more self-consistent and decrease construct-irrelevant variance.
作者 徐鹰
出处 《天津外国语大学学报》 2014年第1期62-69,共8页 Journal of Tianjin Foreign Studies University
关键词 个性化反馈 多层面RASCH模型 评分理由 individualized feedback Multi-Facet Rasch Model reasons for rating
  • 相关文献


  • 1Baker,B.A. Individual Differences in Rater Decision-making Style:An Exploratory Mixed-methods Study[J].Language Assessment Quarterly,2012,(03):225-248.
  • 2Barkaoui K. Explaining ESL Essay Holistic Scores:A Multilevel Modeling Approach[J].Language Testing,2010,(04):515-535.
  • 3Barkaoui K. Rating Scale Impact on EFL Essay Marking:A Mixed-method Study[J].Assessing Writing,2007,(02):86-107.
  • 4Bonk,W.J,G.J.Ockey. A Many-facet Rasch Analysis of the Second Language Group Oral Discussion Task[J].Language Testing,2003,(01):89-110.
  • 5Cherry,R.D,P.R.Meyer. Reliability Issues in Holistic Assessment[A].Cresskill:Hampton Press,1993.109-141.
  • 6Crisp V. An Investigation of Rater Cognition in the Assessment of Projects[J].Educational Measurement:Issues and Practice,2012,(03):10-20.
  • 7Cumming,A,R.Kantor,D.E.Powers. Decision Making While Rating ESL/EFL Writing Tasks:A Descriptive Framework[J].The Modern Language Journal,2002,(01):67-96.
  • 8Elder,C. Individual Feedback to Enhance Rater Training:Does It Work[J].Language Assessment Quarterly,2005,(03):175-196.
  • 9Freedman,S.W,R.C.Calfee. Holistic Assessment of Writing:Experimental Design and Cognitive Theory[A].New York:Longman,1983.75-98.
  • 10Fulcher,G,E Davidson. The Routledge Handbook of Language Testing[M].London:Routledge,2012.


  • 1Bachman, L. F., B. K. Lynch, & M. Mason. Investigating variability in tasks and rater judgments in a performance test of foreign language speaking [ J ]. Language Testing, 1995,12 : 238 - 257.
  • 2Barrett, S. The impact of training on rater variability[ J]. International Education Journal ,2001,2:49 -58.
  • 3Bernardin, H. J. & E. C. Pence. Effects of rater training: Creating new response sets and decreasing accuracy [ J ]. Journal of Applied Psychology, 1980,65 ( 60 - 66 ).
  • 4Bonk, W. J. & G. J. Ockey. A many-facet Rasch analysis of the second language group oral discussion task [ J ]. Language Testing, 2003,20( 1 ) :89 - 110.
  • 5Brown, W. L. , K. OGorman, & Y. Du. The Reliability and Validity of Mathematics Performance Assessment [ P ]. Paper presented at the Annual Meeting of the American Educational Research Association, Minnesota, 1996.
  • 6Buu, Y. -P. Statistical analysis of rater effects[ D]. Unpublished PhD thesis, University of Florida, Florida ,2003.
  • 7Cronbach, L. J. Essentials of Psychological Testing[ M] (Sth ed. ). New York: Haper and Row,1990.
  • 8Eckes, T. Examining rater effects in TestDaF writing and speaking performance assessments: A many-facet Rasch analysis [ J ]. Language Assessment Quarterly,2005,2 ( 3 ) : 197 - 221.
  • 9Eckes, T. Rater types in writing performance assessments: A classifi- cation approach to rater variability [ J ]. Language Testing, 2008,25 : 155 - 185.
  • 10Elder, C. , U. Knoch, G. Barkhuizen, & J. yon Randow. Individual feedback to enhance rater training: Does it work [ J ]. Language Assessment Quarterly,2005,2 : 175 - 196.









使用帮助 返回顶部