期刊文献+

大规模英语考试作文评分标准效度验证 被引量:8

Validating TEM-8 Rating Scale via MFRM and TAPs Based Evidence
下载PDF
导出
摘要 本研究收集评分员对130篇大学专业英语八级考试(TEM-8)作文的评分数据,采用多面Rasch模型分析法以及有声思维法收集证据对TEM-8作文评分标准进行了多维度效度验证。结果表明,评分标准大体上能够反映写作理论构念,评分尺度划分较为合理;大部分评分员能够有效使用评分标准进行评分,可信度较高。 The validation study of the rating scale used in Test of English Majors(TEM-8)writing assessment was conducted based on data from the rating scores of 13 raters each rating 10 TEM-8 essays and their Think-aloud Protocols during rating. Quantitative analysis using Multi-Facets Rasch Model and qualitative analysis using Nvivio showed the following results: 1)the rating scale is appropriately categorized and operates well during the rating process; 2)although a few raters' ratings exhibit some misfit and severity, the majority of the raters are able to rate with high consistency using the scale.
作者 陈建林
出处 《中国考试》 2016年第1期29-38,共10页 journal of China Examinations
基金 教育部人文社会科学青年基金项目"基于语料库的甘肃藏汉中学生书面语对比研究"(项目编号:15YJC740004)的研究成果之一 "兰州大学中央高校基本科研业务费专项资金"(项目编号:2022014skzy001)资助
关键词 多面RASCH模型 有声思维法 评分标准 效度验证 Multi-Facets Rasch Model Think-aloud Protocols Rating Criteria Validation
  • 相关文献

参考文献34

  • 1Alderson, J. C. Bands and seores[C]//Alderson, J. C, & North, B. (eds.). Language testing in the 1990s London. Macmillan, Develop- ments in ELT, 1991: 71-86.
  • 2Anderson, J. C. Testing Reading Comprehension Skills (part 2) [J]. Reading in a Foreign Language, 1990(7): 465-503.
  • 3Bachman, L. F., Lynch, B. K., & Mason, M. Investigating variability in tasks and rater judgements in a performance test of foreign lan- guage speaking[J]. Language Testing, 1995( 12): 238-257.
  • 4Barkaoui, K. Think-aloud protocols in research on essay rating: An empirical study of their veridicality and reactivity[J]. Language Test- ing, 2011, 28 ( 1 ): 51-75.
  • 5Carr, N. A comparison of the effects of analytic and bolistic rating scale types in the contest of composition tests[J]. Issues in Applied linguistics, 2000( 11 ): 207- 241.
  • 6Connor, U., & Carrell, P. The interpretation of tasks by writers and readers in holistically rated direct assessment of writing[C]//Carson, J. G., & Leki, I. Reading in the composition classroom. Boston, MA: Heinle and Heinle. 1993: 141-160.
  • 7Cumming, A., Kantor, R., & Powers, D. Decision making while scor- ing ESL/EFL compositions: A descriptive model[J]. Modem Lan- guage Journal, 2002 (86): 67- 96.
  • 8DeRemer, M. Writing assessment: Raters' elaboration of the rating task[J]. Assessing Writing, 1998(5): 7-29.
  • 9Eckes, T. Rater types in writing performance assessments: A classifi- cation approach to rater variability[J]. Language Testing, 2008, 25 (2): 155-185.
  • 10Erdosy, M. U. Exploring Variability in Judging Writing Ability in a Second Language: A Study of Four Experienced Raters of ESL Compositions: TOEFL Research Report[R]. Princeton, NJ: Educa- tional Testing Service, 2004.

二级参考文献77

共引文献75

同被引文献59

引证文献8

二级引证文献52

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部