期刊文献+

多面拉希模型在医师资格考试第一阶段临床基本技能考试中的应用 被引量:2

The application of many facets Rasch model in the first stage clinical fundamental skills test of National Medical Licensing Examination phased examination
原文传递
导出
摘要 目的针对2018年医师资格考试临床类别分阶段考试第一阶段临床基本技能考试中评分者对评分标准的掌握程度进行评价,探讨标准化病人(standardized patients,SP)与考官评分的一致性,为相关研究提供参考。方法2018年,随机抽取参加医师资格考试临床类别分阶段考试第一阶段临床基本技能考试的某所学校,以其临床医学专业77名考生的沟通交流能力和人文关怀能力的分数作为研究对象。采用多面拉希模型(many facets Rasch model,MFRM),将评分者(包括2名考官和1名SP)的情景误差因素分离出来,对考生的沟通交流能力和人文关怀能力进行评估,并对评分者的内部一致性和评价的宽严度进行分析。结果77名考生能力估计值的平均数为2.75 logits(MFRM分析结果均采用洛基量尺logit作为基本单位),大部分考生的加权拟合检验量(Infit)小于1.5;评分者总体宽严度平均数为-0.55 logits;考官的宽严度平均数为-0.45 logits,SP的宽严度平均数为-0.70 logits,其差异无统计学意义(t=-0.129,P=0.903)。结论评分者对评分标准掌握较好,整体标准相对宽松,SP与考官评分的内部一致性较高。 Objective To evaluate the mastery of scoring standards by raters in 2018 clinical fundamental skills test of National Medical Licensing Examination phased the first stage,to explore the consistency between standardized patients(SP)and examiners'scores,and to provide more information for relevant research.Methods In 2018,based on the scores of communication capacity and humanistic care from 77 candidates in clinical fundamental skills test of the National Medical Licensing Examination phased the first stage in a randomly selected medical college,use the many facets Rasch model to calculate estimated ability of 77 candidates,analyze the internal consistency of raters and the leniency and strictness of evaluation.Results The results showed that the average estimated capacity of 77 candidates was 2.75 logits,and the most examinees'infit was less than 1.5.The average severity of the raters was-0.55,the severity of the examiners was-0.45,of the SP was-0.70,The difference was not statistically significant.Conclusions Raters have a good command of scoring standards,the overall standard is relatively loose.The scores of SP and examiners were consistent.
作者 卢燕 张颖 何惧 邹杰文 Lu Yan;Zhang Ying;He Ju;Zou Jiewen(Research and Evaluation Department,National Medical Examination Center,Beijing 100097,China;National Medical Examination Center,Beijing 100097,China)
出处 《中华医学教育杂志》 2020年第4期311-315,共5页 Chinese Journal of Medical Education
关键词 医师资格考试 评分者误差 拉希理论 多面拉希模型 标准化病人 National medical licensing examination Rater error Rasch theory Many facets Rasch model Standardized patients
  • 相关文献

参考文献3

二级参考文献61

  • 1孙晓敏,张厚粲.表现性评价中评分者信度估计方法的比较研究——从相关法、百分比法到概化理论[J].心理科学,2005,28(3):646-649. 被引量:45
  • 2孙晓敏,张厚粲.国家公务员结构化面试中评委偏差的IRT分析[J].心理学报,2006,38(4):614-625. 被引量:36
  • 3王小华.北京师范大学硕士学位论文,2003:26,41.
  • 4James,L.R.,Demaree,R.G.,& Wolf,G.Estimating within-group interrater reliability with and without response bias.Journal of Applied Psychology,1984,69(l):85-98.
  • 5Kozlowski,S.W.J.,& Hattrup,K.A disagreement about within-group agreement:Disentangling issues of consistency versus consensus.Journal of Applied Psychology,1992,77(2):161-167.
  • 6James,L.R.,Demaree,R.G.,& Wolf,G.Rwg:An assessment of within-group interrater agreement.Journal of Applied Psychology,1993,78(2):306-309.
  • 7Lindell,MichaeK.;Brandt,ChristinaJ.Assessing Interrater Agreement on the Job Relevance of a Test:A Comparison of the CVI,T,rWG(J),and r*WG(J) Indexes.Journal of Applied Psychology,1999,84(4):640-647.
  • 8Cohen,Ayala;Doveh,Etti; Eick,Uri.Statistical Properties of the rWG(J) Index of Agreement.Psychological Methods,2001,6(3):297-310.
  • 9Dunlap,William P.; Burke,Michael J.; Smith-Crowe,Kristin.Accurate Tests of Statistical Significance for rWG and Average Deviation Interrater Agreement Indexes.Journal of Applied Psychology,2003,88(2):356-362.
  • 10Fin,R.H.A note on estimating the reliability of categorical data.Educational and psychological measurement,1970,30:71-76.

共引文献53

同被引文献26

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部