期刊文献+

认知诊断评价中的被试拟合研究

Person-Fit in Cognitive Diagnostic Assessment
下载PDF
导出
摘要 通常情况下,认知诊断需要通过认知诊断模型对被试进行诊断评价。认知诊断模型所生成诊断结果的有效性依赖于被试作答反应是否与所选用的模型拟合。因此,在对诊断结果进行评估的时候,需要通过被试拟合分析来对被试个体的作答反应与模型的拟合情况进行检验,以避免错误或无效的补救措施。本研究基于加权的得分残差,提出认知诊断评价中新的被试拟合指标R。模拟研究结果表明,R指标的一类错误率有较好的稳定性,对随机作答、疲劳、睡眠和创造性作答四种异常被试类型均有较高的统计检验力。并将R指标应用于分数减法实证数据,展示指标在实际测验中的使用过程。 Cognitive Diagnostic Assessment(CDA)is a widely used educational assessment.It can provide guidance for further study and teaching by analyzing whether the test-takers have acquired knowledge points or skills.<br/>In psychometrics,statistical methods for assessing the fit of an examinee’s item responses to a postulated psychometric model are often called person-fit statistic.The person-fit analysis can help to verify diagnostic results,and is mainly used to distinguish the abnormal examinees from the normal ones.The abnormal response patterns include“sleeping”behavior,fatigue,cheating,creative responding,random guessing responses and cheating with randomness,all of which can affect the deviation of examinee’s ability estimation.The person-fit analysis can help researchers identify the abnormal response patterns more accurately,so as to delete the abnormal responding examinees and improve the validity of the test.In the past,most of the person fit researches were mainly carried out under the Item Response Theory(IRT)framework,while only few papers have been published dealing with person-fit under the CDM framework.This study attempts to fill a gap in the literature by introducing new methods.In this study,a new person fit index(R)was proposed.<br/>In order to verify the validity of the newly developed person fit index,this study explores the type I error and statistical test power of R index under different item length,item discrimination and different misfit types of respondent,and compares it with existing methods RCI and lz.Type I error rate was defined as the proportion of flagged abnormal response patterns by a person fit statistic out of 1,000 generated normal response patterns from the DINA model.The control variables of this study include:the number of subjects is controlled to 1000,the cognitive diagnosis model is chosen as DINA model,the attributes are 6,and the Q matrix is fixed.Finally,to reflect the value of person fit index in practical application,the R index is applied to the empirical data of fractional subtraction.<br/>The results show that the type I error of R index is reasonable and stable at.05.In the aspect of statistical test power,with the improvement of item differentiation,the statistical test power of each index in different abnormal examinees is improved.With the increase in the number of items,most of the statistical power show an upward trend.For different types of abnormal subjects,R index perform best in the cases of random guessing responses and cheating with randomness.In the case of fatigue,sleep,and creative responding,the lz index perform better.In the empirical data study,the detection rate of abnormal examinees is 4.29%.<br/>With the increase of the discrimination of items and the increase of the number of items,the power of R index has improved,and the performance of R index is the most robust when the discrimination of item is low.The R index has a high power for the types of abnormal behavior such as creative responding behavior,random guessing responses and cheating with randomness.
作者 喻晓锋 唐茜 秦春影 李喻骏 Yu Xiaofeng;Tang Qian;Qin Chunying;Li Yujun(School of Psychology,Jiangxi Normal University,Nanchang330022;School of Mathematics and Information Science,Nanchang Normal University,Nanchang330032)
出处 《心理科学》 CSCD 北大核心 2024年第3期744-751,共8页 Journal of Psychological Science
基金 教育部教育考试院‘十四五’规划支撑专项课题“高考实施过程中的科目跨年分数的转换研究(NEEA2021050)” 国家自然科学基金项目(32360208,62341207)的资助。
关键词 认知诊断 被试拟合DINA模型 异常反应作答 cognitive diagnosis person fit DINA model aberrant response
  • 相关文献

参考文献7

二级参考文献46

  • 1周欣,王滨.4~5岁儿童对书面数符号的表征和理解能力的发展[J].心理科学,2004,27(5):1132-1136. 被引量:13
  • 2汪存友,余嘉元.一种新的基于神经网络的IRT项目参数估计模型[J].计算机应用,2006,26(4):992-994. 被引量:9
  • 3Tao Jian, Shi Ningzhong, Chang Huahua. Item-weighted likelihood method for ability estimation in tests composed of both dichotomous and polytomous items [ J ]. Journal of Educational and Behavioral Statistics, 2012,37 ( 2 ) : 298- 315.
  • 4Ding Shuliang, Luo Fen, Cai Yan, et al. Complement to Tatsuoka' s Q matrix theory [ A ]//Shigemasu K, Okada A, Imaizumi T, et al. New Trends in Psychometrics [ C ]. Tokyo :Universal Academy Press ,2008:417-423.
  • 5Sun Jianan, Xin Tao, Zhang Shumei, et al. A polytomousex- tension of the generalized distance discriminating method [ J]. Applied Psychological Measurement, 2013 ( 7 ) : 503- 521.
  • 6Tatsuoka K K. Architecture of knowledge structures and cognitive diagnosis: a statistical pattern classification ap- proach, in cognitively diagnostic assessments [ D ]. Erl- baum : Hillsdale, 1995:327-359.
  • 7Tatsuoka K K. Cognitive assessment:an introduction to the rule space method [ M ]. New York: Taylor & Francis Group, 2009.
  • 8Leighton J P, Gierl M J, Hunka S M. The attribute hierar chy method for cognitive assessment : a variation on Tatsuo- ka's ride-space approach [ J ]. Journal of Educational Measurement, 2004,41:205-237.
  • 9Tatsuoka K K. Architecture of knowledge structures and cognitive diagnosis: a statistical pattern classication ap- proach [ C ]. Nichols P D, Chipman S F, Brennan R L. Cognitively diagnostic assessments. Erlbaum: Hillsdale, 1995:327-359.
  • 10Tatsuoka K K. Cognitive assessment : an introduction to the rule space method [ M ]. New York: Taylor & Francis Group, 2009.

共引文献29

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部