分类视角下认知诊断测验项目区分度指标及应用被引量：4

An Item Discrimination Index and Its Application in Cognitive Diagnostic Assessment on a ClassificationOriented View

下载PDF

导出

摘要在认知诊断中还没有指标能在无作答数据情况下直接评价项目的属性分类准确率或属性判准率。项目水平上的属性分类准确率,与项目属性向量、项目参数、先验分布和作答反应等有关。综合各个影响因素定义了项目水平上的属性期望分类准确率指标,并将其用于组卷。模拟研究显示:新指标可十分准确地评价项目的属性判准率,新指标对于项目筛选十分重要;以模式分类准确率为评价指标,基于新指标的组卷方法与经典的组卷方法表现相当。 The existing studies suggested that item quality is closely relevant to the number of attributes required by an item, item parameters, and the prior distribution of attribute patterns in cognitive diagnostic assessment. Several studies focused on the design of Q-matrix and showed that items required only one attribute are important for classification. There are some works that provided two basic sets of item discrimination index to measure discriminatory power of an item. The first one is based on descriptive measures from classical test theory, such as the global item discrimination index, and the second index is based on information measures from item response theory, including cognitive diagnosis index （CDI）, attribute discrimination index （ADI）, modified CDI and ADI. Results showed a strong relationship between these indices and the average correct classification rates of attributes. But their relationship to the indices may change as a function of the distribution of attributes. There lacks an item quality index as a measure of item＇s correct classification rates of attributes. The purpose of this study was to propose an item discrimination index as a measure of correct classification rate of attributes based on Q-matrix, item parameters, and the distribution of attributes. Firstly, an attribute-specific item discrimination index, called item expected attribute matched rate （EAMR）, was introduced. Secondly, a heuristic method was presented using EAMR for test construction. The first simulation study was conducted to evaluate the performance of EAMR under the deterministic input noisy ＂and＂ gate （DINA） model. Several factors were manipulated for five independent attributes in this study. Four levels of correlation between latent attributes, p=.00, p=.50, p=.75, and p=.95, were considered. Items were categorized into five groups according to the number of attributes measured by each item. Item discrimination power was set at three levels, high, medium, and low. High level meant relatively smaller guessing and slip parameters, which were randomly generated from a uniform distribution U（.05,.25）. Medium-level and low-level item parameters were randomly drawn from uniform distributions U（.05, .40） and U（.25, .45）. Next, 1000 items were simulated with the q-vector randomly selected from all possible attribute patterns measuring at least one attribute. Results showed that the new index performed well in that their values matched closely with the simulated correct classification rates of attributes across different simulation conditions. The second simulation study was conducted to examine the effectiveness of the heuristic method for test construction. The test length was fixed to 50 and simulation conditions are similar to those used in the first study. Results showed that the heuristic method based on the sum of EAMRs yielded comparable performance to the famous CDI. These indices can provide test developers with a useful tool to evaluate the quality of the diagnostic items. The attribute-specific item discrimination index will provide researchers and practitioners a way to select the most appropriate item and test that they want to measure with greater accuracy. It will be valuable to explore the applications and advantages of using the EAMR for developing item selection algorithm or termination rule in cognitive diagnostic computerized adaptive testing.

作者汪文义宋丽红丁树良 Wang Wenyi, Song Lihong2, Diog Shulian(1.School of Computer and Information Engineering, Jiangxi Normal University, Nanchang, 530022）（2Elementary Educational College, Jiangxi Normal University, Nanchang, 330022)

机构地区江西师范大学计算机信息工程学院江西师范大学初等教育学院

出处《心理科学》 CSSCI CSCD 北大核心 2018年第2期475-483,共9页 Journal of Psychological Science

基金国家自然科学基金项目(31500909 31360237 31160203) 全国教育科学规划教育部重点课题(DHA150285) 江西省自然科学基金项目(20161BAB212044) 江西省教育科学2013年度一般课题(13YB032) 江西省社会科学规划项目(17JY10) 国家社会科学基金项目(16BYY096) 江西师范大学青年成长基金江西师范大学博士启动基金的资助

关键词分类准确率项目属性期望分类准确率组卷确定性输入噪音与门模型 correct classification rate, item expected attribute matched rate, test construction, the DINA model

分类号 B842.1 [哲学宗教—基础心理学]

引文网络
相关文献

参考文献8

1丁树良,毛萌萌,汪文义,罗芬,CUI Ying.教育认知诊断测验与认知模型一致性的评估[J].心理学报,2012,44(11):1535-1546. 被引量：35
2丁树良,汪文义,罗芬,熊建华.可达阵功能的不可替代性[J].江西师范大学学报（自然科学版）,2016,40(3):290-294. 被引量：7
3丁树良,汪文义,杨淑群.认知诊断测验蓝图的设计[J].心理科学,2011,34(2):258-265. 被引量：69
4丁树良,杨淑群,汪文义.可达矩阵在认知诊断测验编制中的重要作用[J].江西师范大学学报（自然科学版）,2010,34(5):490-494. 被引量：81
5郭磊,郑蝉金,边玉芳,宋乃庆,夏凌翔.认知诊断计算机化自适应测验中新的选题策略:结合项目区分度指标[J].心理学报,2016,48(7):903-914. 被引量：14
6罗照盛,喻晓锋,高椿雷,李喻骏,彭亚风,王睿,王钰彤.基于属性掌握概率的认知诊断计算机化自适应测验选题策略[J].心理学报,2015,47(5):679-688. 被引量：15
7汪文义,丁树良,宋丽红.兼顾测验效率和题库使用率的CD-CAT选题策略[J].心理科学,2014,37(1):212-216. 被引量：14
8张淑梅,辛涛,曾莉,孙佳楠.2PL模型的EM缺失数据处理方法研究[J].应用概率统计,2011,27(3):241-255. 被引量：6

二级参考文献126

1丁树良,罗芬.求偏序关系Hasse图的算法[J].江西师范大学学报（自然科学版）,2005,29(2):150-152. 被引量：12
2林海菁,丁树良.具有认知诊断功能的计算机化自适应测验的研究与实现[J].心理学报,2007,39(4):747-753. 被引量：21
3丁树良,汪文义,杨淑群.认知诊断测验编制的原则.中国科技论文在线,http://www.paper.edu.cn.2009.
4Leighton J P, Gierl M J, Hunka S M. The attribute hierarchy method for cognitive assessment: a variation on Tatsuoka' s rule-space approach [J]. Journal of Educational Measurement, 2004,41 (3) :205-237.
5Ding Shu-liang, Luo Fen, Cai Yan, et al. Complement to Tatsuoka' s Q matrix theory [ C]. Shigemasu K, Okada A, Imaizumi T, et al. New Trends in Psychometrics,Tokyo:Universal Academy Press,2008:417-424.
6Samejima F. A cognitive diagnosis method using latent trait models: competency space approach and its relationship with DiBello and Stout's unified cognitive-psychometric diagnosis model [ C]. Nichols P D, Chipman S F, Brcnnan R L. Cognitively Diagnostic Assessment, NJ: Erlbatun, 1995 : 391-410.
7Tatsuoka K K. Cognitive assessment an introduction to the rule space method [ M]. New York: Routledge Taylor & Francis Group, 2009.
8Henson R, Douglas J. Test construction for cognitive diagnosis [ J]. Applied Psychological Measurement, 2005,29:262-277.
9Zeng Ling-yan, Ding Shu-hang, Gan Deng-wen. Test construction for cognitive diagnosis [ C]. Shenzhen: Asia-Pacific Conference on Wearable Computing Systems, 2010.
10Kuang Zheng, Ding Shu-liang, Xu'Zhi-yong. Application of support vector machine to cognitive diagnosis [ C]. Shenzhen: Asia-Pacific Conference on Wearable Computing Systems,2010.

共引文献141

1秦春影,刘小伟,徐新爱,卢昕.考虑属性间关系的诊断测验分类:贝叶斯网模型与DINA模型的比较[J].统计与决策,2021(8):40-45. 被引量：1
2钟志强.基于纵向认知诊断模型的形成性评价研究——以中学物理欧姆定律教学为例[J].鞍山师范学院学报,2023,25(6):26-31.
3王应选,何承源.复数域上L-正交矩阵和R-正交矩阵[J].西南师范大学学报（自然科学版）,2012,37(12):13-17.
4杨淑群,丁树良.有效对象的判定理论与方法[J].江西师范大学学报（自然科学版）,2011,35(1):1-4. 被引量：9
5陈平,李珍,辛涛.认知诊断计算机化自适应测验的题库使用均匀性初探[J].心理与行为研究,2011,9(2):125-132. 被引量：18
6汪文义,丁树良,游晓锋.计算机化自适应诊断测验中原始题的属性标定[J].心理学报,2011,43(8):964-976. 被引量：32
7孙佳楠,张淑梅,辛涛,包钰.基于Q矩阵和广义距离的认知诊断方法[J].心理学报,2011,43(9):1095-1102. 被引量：32
8尚志勇,丁树良.认知诊断自适应测验选题策略探新[J].江西师范大学学报（自然科学版）,2011,35(4):418-421. 被引量：11
9吴智辉,甘登文,丁树良.可达阵在认知诊断选题策略中的运用研究[J].江西师范大学学报（自然科学版）,2011,35(4):422-426. 被引量：3
10许志勇,丁树良,杨庆红.S-P表法的改进和应用[J].江西师范大学学报（自然科学版）,2011,35(5):543-547. 被引量：2

同被引文献24

1涂冬波,蔡艳,戴海琦,丁树良.一种多级评分的认知诊断模型:P-DINA模型的开发[J].心理学报,2010,42(10):1011-1020. 被引量：55
2丁树良,杨淑群,汪文义.可达矩阵在认知诊断测验编制中的重要作用[J].江西师范大学学报（自然科学版）,2010,34(5):490-494. 被引量：81
3唐小娟,丁树良,毛萌萌,俞宗火.基于属性层级结构的认知诊断测验的组卷[J].心理学探新,2013,33(3):252-259. 被引量：6
4蔡艳,涂冬波.属性多级化的认知诊断模型拓展及其Q矩阵设计[J].心理学报,2015,47(10):1300-1308. 被引量：15
5林毓锜.试论学习路径与升华型学习及其启示[J].高等教育研究,2015,36(11):12-18. 被引量：8
6张民选,黄华.自信·自省·自觉——PISA2012数学测试与上海数学教育特点[J].教育研究,2016,37(1):35-46. 被引量：39
7宋丽红,汪文义,戴海琦,丁树良.认知诊断模型下整体和项目拟合指标[J].心理学探新,2016,36(1):79-83. 被引量：4
8詹沛达,边玉芳,王立君.重参数化的多分属性诊断分类模型及其判准率影响因素[J].心理学报,2016,48(3):318-330. 被引量：19
9丁树良,汪文义,罗芬,熊建华.可达阵功能的不可替代性[J].江西师范大学学报（自然科学版）,2016,40(3):290-294. 被引量：7
10郭磊,郑蝉金,边玉芳,宋乃庆,夏凌翔.认知诊断计算机化自适应测验中新的选题策略:结合项目区分度指标[J].心理学报,2016,48(7):903-914. 被引量：14

引证文献4

1张怡,武小鹏.两岸四地学生的数学核心素养研究[J].现代教育技术,2021,31(12):51-60. 被引量：1
2张怡.基于认知诊断的学习测验开发、应用与启示——以Tatsuoka的分数减法测验为例[J].教育测量与评价,2022(5):71-82.
3马大付,秦春影,喻晓锋,何催.项目区分度指标在属性多水平和混合计分项目下的组卷研究[J].心理与行为研究,2023,21(6):760-769.
4马大付,秦春影,杨建芹,徐新爱,喻晓锋.认知诊断测验的自动组卷方法[J].心理学探新,2023,43(6):550-557.

二级引证文献1

1李衍勋,丁锐.基于认知诊断的学习进阶模型构建——以义务教育阶段概率概念为例[J].济宁学院学报,2022,43(6):87-95.

1李雪影.打开那扇门[J].作文与考试（初中版）,2018,0(13):6-7.
2王继伟,贾联珍.项目前期工作与项目库建设[J].山区经济,1999,0(6):21-22.
3温占考,易秀双,刘勇,李婕,王兴伟.基于属性向量协同过滤推荐算法并行化[J].计算机工程与设计,2018,39(2):425-429. 被引量：1
4业巧林,闫贺.基于最小二乘的孪生有界支持向量机分类算法[J].华中科技大学学报（自然科学版）,2018,46(3):30-35. 被引量：8
5张昊,周颖帆,尤薇佳.基于二模网络分析的众筹投资者研究[J].管理现代化,2018,38(1):7-10.
6徐世龙.李权事件发微[J].陕西社会科学论丛,2011,2(4):82-83.
7宗利永,周雪卉.出版众筹出资者参与行为特征分析及其优化——基于众筹项目聚类的视角[J].科技与出版,2018(1):73-78. 被引量：2
8梁运球.腹膜外腹腔镜疝气修补术与传统疝修补术治疗腹股沟疝的疗效比较[J].深圳中西医结合杂志,2018,28(1):121-122. 被引量：7
9刘晶.企业拓展水务投资项目的决策分析[J].现代营销（下）,2018(1):97-97.
10余松,吴延琳,王主丁,张漫.中压配电网规划项目优化排序混合方法[J].智慧电力,2018,46(1):93-99. 被引量：11

心理科学

2018年第2期

浏览历史

内容加载中请稍等...

分类视角下认知诊断测验项目区分度指标及应用被引量：4

参考文献8

二级参考文献126

共引文献141

同被引文献24

引证文献4

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

分类视角下认知诊断测验项目区分度指标及应用 被引量：4

参考文献8

二级参考文献126

共引文献141

同被引文献24

引证文献4

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

分类视角下认知诊断测验项目区分度指标及应用被引量：4