期刊文献+

分类视角下认知诊断测验项目区分度指标及应用 被引量:4

An Item Discrimination Index and Its Application in Cognitive Diagnostic Assessment on a ClassificationOriented View
下载PDF
导出
摘要 在认知诊断中还没有指标能在无作答数据情况下直接评价项目的属性分类准确率或属性判准率。项目水平上的属性分类准确率,与项目属性向量、项目参数、先验分布和作答反应等有关。综合各个影响因素定义了项目水平上的属性期望分类准确率指标,并将其用于组卷。模拟研究显示:新指标可十分准确地评价项目的属性判准率,新指标对于项目筛选十分重要;以模式分类准确率为评价指标,基于新指标的组卷方法与经典的组卷方法表现相当。 The existing studies suggested that item quality is closely relevant to the number of attributes required by an item, item parameters, and the prior distribution of attribute patterns in cognitive diagnostic assessment. Several studies focused on the design of Q-matrix and showed that items required only one attribute are important for classification. There are some works that provided two basic sets of item discrimination index to measure discriminatory power of an item. The first one is based on descriptive measures from classical test theory, such as the global item discrimination index, and the second index is based on information measures from item response theory, including cognitive diagnosis index (CDI), attribute discrimination index (ADI), modified CDI and ADI. Results showed a strong relationship between these indices and the average correct classification rates of attributes. But their relationship to the indices may change as a function of the distribution of attributes. There lacks an item quality index as a measure of item's correct classification rates of attributes. The purpose of this study was to propose an item discrimination index as a measure of correct classification rate of attributes based on Q-matrix, item parameters, and the distribution of attributes. Firstly, an attribute-specific item discrimination index, called item expected attribute matched rate (EAMR), was introduced. Secondly, a heuristic method was presented using EAMR for test construction. The first simulation study was conducted to evaluate the performance of EAMR under the deterministic input noisy "and" gate (DINA) model. Several factors were manipulated for five independent attributes in this study. Four levels of correlation between latent attributes, p=.00, p=.50, p=.75, and p=.95, were considered. Items were categorized into five groups according to the number of attributes measured by each item. Item discrimination power was set at three levels, high, medium, and low. High level meant relatively smaller guessing and slip parameters, which were randomly generated from a uniform distribution U(.05,.25). Medium-level and low-level item parameters were randomly drawn from uniform distributions U(.05, .40) and U(.25, .45). Next, 1000 items were simulated with the q-vector randomly selected from all possible attribute patterns measuring at least one attribute. Results showed that the new index performed well in that their values matched closely with the simulated correct classification rates of attributes across different simulation conditions. The second simulation study was conducted to examine the effectiveness of the heuristic method for test construction. The test length was fixed to 50 and simulation conditions are similar to those used in the first study. Results showed that the heuristic method based on the sum of EAMRs yielded comparable performance to the famous CDI. These indices can provide test developers with a useful tool to evaluate the quality of the diagnostic items. The attribute-specific item discrimination index will provide researchers and practitioners a way to select the most appropriate item and test that they want to measure with greater accuracy. It will be valuable to explore the applications and advantages of using the EAMR for developing item selection algorithm or termination rule in cognitive diagnostic computerized adaptive testing.
作者 汪文义 宋丽红 丁树良 Wang Wenyi, Song Lihong2, Diog Shulian(1.School of Computer and Information Engineering, Jiangxi Normal University, Nanchang, 530022) (2Elementary Educational College, Jiangxi Normal University, Nanchang, 330022)
出处 《心理科学》 CSSCI CSCD 北大核心 2018年第2期475-483,共9页 Journal of Psychological Science
基金 国家自然科学基金项目(31500909 31360237 31160203) 全国教育科学规划教育部重点课题(DHA150285) 江西省自然科学基金项目(20161BAB212044) 江西省教育科学2013年度一般课题(13YB032) 江西省社会科学规划项目(17JY10) 国家社会科学基金项目(16BYY096) 江西师范大学青年成长基金 江西师范大学博士启动基金的资助
关键词 分类准确率 项目属性期望分类准确率 组卷 确定性输入噪音与门模型 correct classification rate, item expected attribute matched rate, test construction, the DINA model
  • 相关文献

参考文献8

二级参考文献126

  • 1丁树良,罗芬.求偏序关系Hasse图的算法[J].江西师范大学学报(自然科学版),2005,29(2):150-152. 被引量:12
  • 2林海菁,丁树良.具有认知诊断功能的计算机化自适应测验的研究与实现[J].心理学报,2007,39(4):747-753. 被引量:21
  • 3丁树良,汪文义,杨淑群.认知诊断测验编制的原则.中国科技论文在线,http://www.paper.edu.cn.2009.
  • 4Leighton J P, Gierl M J, Hunka S M. The attribute hierarchy method for cognitive assessment: a variation on Tatsuoka' s rule-space approach [J]. Journal of Educational Measurement, 2004,41 (3) :205-237.
  • 5Ding Shu-liang, Luo Fen, Cai Yan, et al. Complement to Tatsuoka' s Q matrix theory [ C]. Shigemasu K, Okada A, Imaizumi T, et al. New Trends in Psychometrics,Tokyo:Universal Academy Press,2008:417-424.
  • 6Samejima F. A cognitive diagnosis method using latent trait models: competency space approach and its relationship with DiBello and Stout's unified cognitive-psychometric diagnosis model [ C]. Nichols P D, Chipman S F, Brcnnan R L. Cognitively Diagnostic Assessment, NJ: Erlbatun, 1995 : 391-410.
  • 7Tatsuoka K K. Cognitive assessment an introduction to the rule space method [ M]. New York: Routledge Taylor & Francis Group, 2009.
  • 8Henson R, Douglas J. Test construction for cognitive diagnosis [ J]. Applied Psychological Measurement, 2005,29:262-277.
  • 9Zeng Ling-yan, Ding Shu-hang, Gan Deng-wen. Test construction for cognitive diagnosis [ C]. Shenzhen: Asia-Pacific Conference on Wearable Computing Systems, 2010.
  • 10Kuang Zheng, Ding Shu-liang, Xu'Zhi-yong. Application of support vector machine to cognitive diagnosis [ C]. Shenzhen: Asia-Pacific Conference on Wearable Computing Systems,2010.

共引文献141

同被引文献24

引证文献4

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部