蕴涵量表法在HSK阅读理解测验公平性研究中的应用被引量：2

Using Implicational Scaling Procedure to Detect the Differential Passage Difficulty Order on the Reading Comprehension Test of HSK

下载PDF

导出

摘要阅读理解能力测验中所选择的文章在内容方面对不同专业背景的考生亚团体是否具有公平性的问题,是测验效度高低的重要证据,也是测验效度验证(validation)的重要环节。本研究以中国语言与文学专业考生为目标组,分别将经济学专业和生物医学专业考生作为参照组,采用效标测量和蕴涵量表分析相结合的方法,对HSK(高等)阅读理解测验的文章难度对三个不同专业背景的考生组的公平性问题进行了检验。研究结果表明,两个参照组考生尽管具有各自的相对专业优势,但他们在六篇阅读材料上获得的难度排列顺序与目标组考生完全一致;虽然目标组考生不具备汉语知识以外的其他专业优势,但因为HSK考试所选择的阅读材料没有涉及语言知识本身以外的特殊专业要求,因而测验对三个不同专业背景的考生具有较高的公平性。 As a state-level standardized test, the Chinese Proficiency Test （HSK） is designed to measure the general language proficiency of those whose native language is not Chinese. But within these groups there may be subgroups that differ in ways other than the language ability of interest. These differences may affect their test performance, and hence the validity of inferences made on the basis of test scores. When this systematic difference in test performance occurs that appear to be associated with characteristics not logically related to the ability in question, we must fully investigate the possibility that the test is biased. This research, based on the evaluation of examinees＇ reading proficiency levels through their teachers,using implicational scaling procedure, specifically addresses itself to the investigation of fairness for three subgroups： Chinese and Chinese literature group, economics and business group, and biology and medicine group, on the reading comprehension test of HSK. The empirical research indicates that the difficulty order of six passages is completely same across the three subgroups, the implicational scaling procedure could be used to detect differential item or passage functioning.

作者柴省三

机构地区北京语言大学汉语水平考试中心

出处《考试研究》 2012年第5期54-62,共9页 Examinations Research

基金北京语言大学校级科研项目(项目编号:11YB01)研究成果之一

关键词蕴涵量表测验公平性阅读理解测验 HSK 构念效度测验偏差 Implicational Scaling, Test Fairness, Reading Comprehension Test, HSK, Construct Validity, Test Bias

分类号 G424.74 [文化科学—课程与教学论]

引文网络
相关文献

参考文献19

1Shealy, R. & Stout, W. , A model-based standardization approach that separates true bias/DIF from group ability differences and detects test bias/DTF as well as item bias/DIF [ J ], Psychometrika, 1993,58 (2), 159 - 194.
2Dorans, N. J. & Kulick, E. M. , Demonstrating the utility of the standardization approach to assessing unexpected differential item performance on the Scholastic Aptitude Test [ J ] , Journal of Educational Measurement, 1986,23 ( 2 ) , 355 - 368.
3Holland, P. W. & Thayer, D. T. , Differential item functioning and the Mantel-Haenszel procedure, In H. Wainer & H. I. Braun (Eds.),Test validity[C], (pp. 129 - 145), Hillsdale, NJ : Laurence Erlbaum, 1988.
4Dorans, N. J. & Kulick, E. M. , Assessing unexpected differential item performance of female candidates on SAT and TSWE forms administered in December 1977:An application of the standardization approach (ETS Research Report RR -88 - 9 ) [ R ] , Princeton, NJ : Educational Testing Service, 1983.
5Wainer, H. , Brandlow, E. T. & Wang, X. , Testlet response theory and its applications[ M ], New York : Cambridge University Press ,2007.
6Geranpayeh, A. & Kunnan, A. J, Differential item functioning in terms of age in the Certificate in Advanced English Examination [ J ] , Language Assessment Quarterly, 2007,4 (2) , 190 - 222.
7任杰,谢小庆.中国少数民族考生与外国考生HSK成绩的公平性分析[J].心理学探新,2002,22(2):51-56. 被引量：14
8郭树军(1989).HSK阅读理解试题的设计,见《汉语水平考试(HsK)研究》,北京:现代出版社.
9Alderson, J. C. , Assessing Reading [ M ] , Cambridge : Cambridge University Press,2000.
10Pae, T. I. , DIF for examinees with different academic backgrounds [ J ], Language Testing,2004,21 ( 1 ) ,53 - 73.

二级参考文献14

1曾秀芹,孟庆茂.项目功能差异及其检测方法[J].心理科学进展,1999,9(2):41-47. 被引量：27
2董圣鸿,马世晔.三种常用DIF检测方法的比较研究[J].心理学探新,2001,21(1):43-48. 被引量：21
3Chang H，J Educat Measarement，1996年，33卷，333--353页
4曾秀芹孟庆茂.项目功能差异的简介[J].心理学探新,1998,(1).
5Hua-Hua Chang,John Mazzeo,Louis Roussos.Detecting DIF for Polytomously Scored Item:An Adaptation of the SIBTEST Procedure[J].Journal of Educational Measurement, Fall,1996.
6Neil J.Dorans, Paul W.Holland.DIF detection and description: Mantel-Haenszel and Standardization[A].Differential Item Functioning[C].Lawrence Erlbaum Associates, Hillsdale, New Jersy,1993.
7Kathleen A,O'Neill, W Miles Mcpeek).Item and Test Charateristics That are Associated with Differential Item Functioning[A].Differential Item Functioning, Lawrence Erlbaum Associates, Hillsdale,New Jersy.Journal of Educational Measurement, Summer 1996.
8Roussos, Stout.Simulation Studies of the Effects of Small Sample Size and Studied Item Qrqmeters on SIBTEST and Mantel-Haenszel Type 1 Error Performance,1995.
9教育部考试中心资料室.教育考试图书资料摘编(合订本)[C].1997.
10[11]Hua-Hua Chang, John Mazzeo, Louis Roussos. Detecting DIF for PolytomouslyScored Items: An Adaptation of the SIBTESY Procedure[J] .Journal of EducationalMeasurement, 1996.

共引文献56

1汤楚,李运华,谭化芝,张猷玫.临床问卷中的项目功能差异分析[J].心理月刊,2023(9):228-230.
2朱正才,李俊敏.《中国英语能力等级量表》描述语偏差研究[J].现代外语,2021(1):113-122. 被引量：10
3李清华,孔文.TEM-4阅读测试的DIF研究[J].中国外语,2009(1):53-60. 被引量：24
4江西师大"现代教育和心理测量通用分析系统"研制组,漆书青,周骏,张青华.用信息函数法对标准参照测验作质量分析[J].心理与行为研究,2003,1(1):34-39. 被引量：20
5任杰,谢小庆.中国少数民族考生与外国考生HSK成绩的公平性分析[J].心理学探新,2002,22(2):51-56. 被引量：14
6严芳,张增修.用Logistic Regression侦察题目差异功能[J].应用心理学,2001,7(1):57-62. 被引量：1
7应心.你知道DIF吗?[J].外国中小学教育,2005(11):45-48. 被引量：1
8陆建明.社会考试试题内容文化公平性的审核机制[J].中国成人教育,2006(1):59-60. 被引量：1
9骆方,张厚粲.检验项目功能差异的两类方法—CFA和IRT的比较[J].心理学探新,2006,26(1):74-78. 被引量：12
10刘曦,张建新.项目功能差异在临床问卷分析中的应用[J].中国临床心理学杂志,2006,14(4):349-351. 被引量：3

同被引文献37

1刘镰力.中国汉语水平考试(HSK)的等级体制[J].世界汉语教学,1999,13(3):17-23. 被引量：3
2张凯.语言测验和乔姆斯基理论[J].世界汉语教学,1998,12(2):78-85. 被引量：6
3陈宏.在语言能力测验中如何建立结构效度[J].语言教学与研究,1997(2):78-93. 被引量：12
4刘英林.汉语水平考试（HSK）的理论基础探讨[J].汉语学习,1994(1):40-48. 被引量：4
5刘英林,郭树军,王志芳.汉语水平考试(HSK)的性质和特点[J].世界汉语教学,1988,2(2):110-120. 被引量：10
6陈宏.结构效度与汉语能力测验——概念和理论[J].世界汉语教学,1997,11(3):30-41. 被引量：7
7陈宏.第二语言能力结构研究回顾[J].世界汉语教学,1996,10(2):47-53. 被引量：16
8张凯.语言测验的测度和精度[J].语言文字应用,2004(4):60-68. 被引量：5
9北京语言大学汉语水平考试中心"HSK改进工作"项目组,王佶旻.汉语水平考试(HSK)改进方案[J].世界汉语教学,2007,21(2):126-135. 被引量：10
10郭树军.汉语水平考试(HSK)项目内部结构效度检验[c].汉语水平考试研究论文选.北京:现代出版社.1995.

引证文献2

1柴省三.阅读理解考试篇章数量与题目数量拟合度研究[J].中国考试,2014(5):3-11.
2赵琪凤.汉语水平考试的历史回顾及研究述评[J].中国考试,2016(9):47-53. 被引量：4

二级引证文献4

1张园,李亚男,杨琳静.HSK和MHK测验等值分析[J].考试研究,2019,0(1):86-97. 被引量：1
2赵琪凤.汉语国际教育考试体系发展研究[J].语言战略研究,2020,5(2):71-79. 被引量：2
3丁俊玲.基于判断题及选项的雅思和HSK真题对比分析[J].语文学刊,2020,40(3):77-84.
4郑英,郑玥,张家维.HSK试卷架构对1-3级考生成绩的影响--以英语母语者为例[J].国际中文教育（中英文）,2021,6(3):50-59. 被引量：1

1唐蕾.高校家庭经济困难学生生活事件、主观幸福感关系研究[J].知识经济,2010(17):57-58.
2杨玲.“核心自我评价”对中小学教师职业认同的影响[J].教师教育研究,2015,27(2):49-53. 被引量：12
3刘万伦.对测验的构念效度的理解[J].淮南师范学院学报,2006,8(6):108-110. 被引量：3
4郝玉英.小组教学的公平性问题探讨——兼论构建小组教学的新策略[J].教学月刊（中学版）,2006(7):37-39.
5张璐.论《论语》中“因材施教”教育思想及其在对外汉语教学中的体现[J].现代语文（上旬．文学研究）,2013(12):74-76. 被引量：2
6欧晓丽.从育人的视角看小学语文教师的任务[J].中国科教创新导刊,2012(24):222-222.
7吴瑗,刘仿,米娜,陈群.不同层次医学免疫学教学体会[J].卫生职业教育,2008,26(5):56-57. 被引量：1
8康春花,曾平飞,田伟.贯穿测验过程的公平分析思路[J].教育测量与评价（理论版）,2010(7):4-7. 被引量：3
9谢小庆.谈语言能力的考查——考试不一定可靠[J].内蒙古教育,2011(10):23-27. 被引量：1
10李丹.别丢了语文教学的根[J].新课程研究（下旬）,2009(7):185-186.

考试研究

2012年第5期

浏览历史

内容加载中请稍等...

蕴涵量表法在HSK阅读理解测验公平性研究中的应用被引量：2

参考文献19

二级参考文献14

共引文献56

同被引文献37

引证文献2

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

蕴涵量表法在HSK阅读理解测验公平性研究中的应用 被引量：2

参考文献19

二级参考文献14

共引文献56

同被引文献37

引证文献2

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

蕴涵量表法在HSK阅读理解测验公平性研究中的应用被引量：2