兼顾测验效率和题库使用率的CD-CAT选题策略被引量：14

Item Selection Methods for Balancing Test Efficiency with Item Bank Usage Efficiency in CD- CAT

下载PDF

导出

摘要 CD–CAT中已有选题策略较注重测验效率,而对题库使用率不够重视。针对此问题,基于DINA模型,引入两种新的选题策略KLED和RHA,同时对HA进行模拟研究。结果显示:PWKL与KLED只在测验效率上具有优势;KLED若按属性向量分层,题库使用率有所提高,KLED比ED更容易推广到其他有显式表达的诊断模型场合;HA、RHA和RP–PWKL可较好兼顾测验效度和题库使用率,但RP-PWKL需设置项目的最大曝光率阈值。两种新选题方法在定长和变长CD-CAT都具有一定的应用价值。 Cognitive diagnostic computerized adaptive testing （ CD - CAT） is a popular mode of online testing of cognitive diagnostic assessment （CDA）. The key to a CD - CAT program is the item selection methods. Three of the most popular methods are developed based on Kullback -Leibler information （KL）, Shannon entropy （SHE） and the expected discrimination method （ED） to select items in CD -CAT. These methods can achieve a much better test efficiency. However, they often lead to unbalanced item usage within a pool. Diagnostic test would not be a high - stake test, so the item overexposure problem may not be a major concern. However, the item underexposure problem leads to the waste of time and money invested in developing each item on it, and the high test overlap rate prob- lem leads to the effects of intense exercise. Although the restrictive progressive method （ RP - PWKL） and the restrictive threshold method （ RT - PWKL） are proposed to balance item exposure control with measurement accuracy, RP - PWKL and RT - PWKL sup press overexposure and thus add a restriction so that the maximum exposure rate will be kept under a predetermined value. The rationale for the maximum exposure rate deserves further consideration. For the above consideration, the article proposes two item selection methods for CD - CAT based on the ＂Deterministic Input, Noisy And Gate＂（DINA） model. First, using KL information as a discrimination function of ED, KLED is proposed to handle other cognitive diagnostic models, besides the DINA model. Second, according to the idea of randomization strategies, in which the selection of the item is always made at random among the most informative items, randomization halving algorithm （RHA） is proposed. For RHA, all items within the specified range are available for selection rather than an arbitrary or only one number. Moreover, we show the connection between KLED based on KL, HA, and RHA; KLED can be regarded as a weighted HA method, weighted by the corre sponding item parameters; HA can be regarded as RHA without adding a random component between different item attribute vectors in the Q matrix of the item pool. Then, two simulation studies are carried out, one using a simulated item bank, and the other based on items calibrated from real data. Eight item selection strategies are taken into consideration in these studies, including random, posterior -weighted KL （PWKL）, RP- PWKL, RT- PWKL, ED, halving algorithm （HA）, KLED and RHA. In addition, VRP- PWKL and VRT- PWKL are pro- posed for variable - length CD - CAT as an extended version of RP - PWKL and RT - PWKL. Simulation studies for fixed or variable - length CD - CAT are conducted based on the eight methods, and the results are compared in terms of the pattern or attribute correctclas- sification rate, error classification rate, item exposure rate,and test overlap rate. The simulation results show that ： RHA, HA, RP - PWKL, VRP - PWKL and VRT - PWKL have more balanced usage of the item bank and slight decrease in correct classification rate of knowledge state ; RHA, HA, ~RP - PWKL and VRT - PWKL can be used for variable - length CD - CAT. Though the results from the simulation study are encouraging, further studies of CD - CAT are proposed for the future investigations such as different coznitive diaznostic models.

作者汪文义丁树良宋丽红

机构地区江西师范大学计算机信息工程学院江西师范大学初等教育学院

出处《心理科学》 CSSCI CSCD 北大核心 2014年第1期212-216,共5页 Journal of Psychological Science

基金国家自然科学基金(30860084,31160203,31100756,31360237) 国家社会科学基金(12BYY055) 国家教育科学规划项目(CCA110109) 教育部人文社科项目(09JJCXLX012,10YJCXLX049,11YJC190002) 教育部人文社会科学研究青年基金项目(13YJC880060) 江西省社会科学研究“十二五”(2012年)规划项目(12JY07) 江西省教育科学2013年度一般课题(13yB032) 江西省教育厅科技计划项目(GJJ11385,GJJ10238,GJJ13207,GJJ13226,GJJ13208) 全国教育考试科研规划课题(2009JKS2009) 高等学校博士学科点专项科研基金(20113604110001) 江西师范大学青年成长基金的资助

关键词计算机化自适应认知诊断测验选题策略题库使用率二分法 CD -CAT, item selection methods, item bank usage, halving algorithm

分类号 B841 [哲学宗教—基础心理学] TP391.72 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献18

1陈平,李珍,辛涛.认知诊断计算机化自适应测验的题库使用均匀性初探[J].心理与行为研究,2011,9(2):125-132. 被引量：18
2陈平,辛涛.认知诊断计算机化自适应测验中的项目增补[J].心理学报,2011,43(7):836-850. 被引量：27
3丁树良,毛萌萌,汪文义,罗芬,CUI Ying.教育认知诊断测验与认知模型一致性的评估[J].心理学报,2012,44(11):1535-1546. 被引量：35
4丁树良,汪文义,杨淑群.认知诊断测验蓝图的设计[J].心理科学,2011,34(2):258-265. 被引量：69
5毛秀珍,辛涛.认知诊断CAT中选题策略的改进[J].北京师范大学学报（自然科学版）,2011,47(3):326-330. 被引量：7
6尚志勇,丁树良.认知诊断自适应测验选题策略探新[J].江西师范大学学报（自然科学版）,2011,35(4):418-421. 被引量：11
7唐小娟,丁树良,俞宗火.计算机化自适应测验在认知诊断中的应用[J].心理科学进展,2012,20(4):616-626. 被引量：15
8汪文义,丁树良,游晓锋.计算机化自适应诊断测验中原始题的属性标定[J].心理学报,2011,43(8):964-976. 被引量：32
9Cheng, Y. (2009). When cognitive diagnosis meets computerized adap- tive testing: CD - CAT. Psychometrika, 74,619 - 632.
10Cheng, Y. (2010). Improving cognitive diagnostic computerized adap- tive testing by balancing attribute coverage: The modified maximum global discrimination index method. Educational and Psychological Measurement, 70, 902 - 913.

二级参考文献151

1戴海崎,张青华.规则空间模型在描述统计学习模式识别中的应用研究[J].心理科学,2004,27(4):949-951. 被引量：39
2丁树良,罗芬.求偏序关系Hasse图的算法[J].江西师范大学学报（自然科学版）,2005,29(2):150-152. 被引量：12
3余嘉元.运用规则空间模型识别解题中的认知错误[J].心理学报,1995,27(2):196-203. 被引量：40
4陈平,丁树良,林海菁,周婕.等级反应模型下计算机化自适应测验选题策略[J].心理学报,2006,38(3):461-467. 被引量：38
5林海菁,丁树良.具有认知诊断功能的计算机化自适应测验的研究与实现[J].心理学报,2007,39(4):747-753. 被引量：21
6丁树良,汪文义,杨淑群.认知诊断测验编制的原则.中国科技论文在线,http://www.paper.edu.cn.2009.
7Quellmalz E S, Pellegrino J W. Perspective technology and testing [J]. Science, 2009, 323(2): 75-79.
8Tatsuoka K K. Rule space: an approach for dealing with miscon- ceptions based on item response theory [J]. Journal of Educationl Measurement, 1983, 20(4): 345-354.
9de la Torre J. DINA model and parameter estimation: a didactic [J]. Journal of Educational and Behavioral Statistics, 2009, 34: 115-130.
10Leihton J P, Gierl M J, Hunka S M. The attribute hierarchy method for cognitive assessment: a variation on Tatsuoka's rule-space approach [J]. Journal of Education Measurement, 2004, 41(3): 205-237.

共引文献138

1秦春影,刘小伟,徐新爱,卢昕.考虑属性间关系的诊断测验分类:贝叶斯网模型与DINA模型的比较[J].统计与决策,2021(8):40-45. 被引量：1
2郭宪,柏毅.Logistic模型在科学素养评测中的应用[J].东南大学学报（哲学社会科学版）,2021,23(S01):145-148.
3汪玲玲,陈平,辛涛,衷克定.基于BP神经网络的认知诊断计算机化自适应测验实现[J].北京师范大学学报（自然科学版）,2015,51(2):206-211. 被引量：8
4汪文义,丁树良,游晓锋.计算机化自适应诊断测验中原始题的属性标定[J].心理学报,2011,43(8):964-976. 被引量：32
5吴智辉,甘登文,丁树良.可达阵在认知诊断选题策略中的运用研究[J].江西师范大学学报（自然科学版）,2011,35(4):422-426. 被引量：3
6毛秀珍,辛涛.计算机化自适应测验选题策略述评[J].心理科学进展,2011,19(10):1552-1562. 被引量：22
7许志勇,丁树良,杨庆红.S-P表法的改进和应用[J].江西师范大学学报（自然科学版）,2011,35(5):543-547. 被引量：2
8余丹,潘奕娆,丁树良,杨庆红.计算机化自适应诊断测验新的选题策略[J].江西师范大学学报（自然科学版）,2011,35(5):548-550. 被引量：7
9汪文义,丁树良.题库结构对原始题在线属性标定准确性之影响研究[J].心理科学,2012,35(2):452-456. 被引量：5
10唐小娟,丁树良,俞宗火.计算机化自适应测验在认知诊断中的应用[J].心理科学进展,2012,20(4):616-626. 被引量：15

同被引文献97

1孟庆茂,刘红云.α系数在使用中存在的问题[J].心理学探新,2002,22(3):42-47. 被引量：18
2周婕,丁树良,陈平.多级评分CAT的认知诊断方法[J].江西师范大学学报（自然科学版）,2007,31(4):375-378. 被引量：9
3陈平.认知诊断计算机化自适应测验的项目增补:以DINA模型为例[D].北京:北京师范大学,2011.
4Tatsuoka K K. Cognitive assessment: An introduction to the rule space method[M]. Routledge, 2009.
5Chiu C. Statistical Refinement of the Q-matrix in Cognitive Diagno- sis[J]. Applied Psychological Measurement, 2013,37 (8) : 598-618.
6Xiang R. Nonlinear penalized estimation of true Q-matrix in cogni- tive diagnostic models[D]. Columbia University, 2013.
7DeCar|o L T. Recognizing Uncertainty in the Q-Matrix via a Bayes- ian Extension of the DINA Model[J]. Applied Psychological Mea- surement, 2012,36(6):447-468.
8Close C N. An exploratory technique for finding the Q-matrix for the DINA model in cognitive diagnostic assessment: Combining the- ory with data[D]. UNIVERSITY OF MINNESOTA, 2012.
9DeCarlo L T. On the analysis of fraction subtraction data: The DINA model, classification, latent class sizes, and the Q-matrix[J]. Ap- plied Psychological Measurement, 2010,35 ( 1 ):8-26.
10de la Torte J. An Empirically Based Method of Q-Matrix Validation for the DINA Model: Development and Applications[J]. Journal of Educational Measurement, 2008,45 (4):343.

引证文献14

1刘永,涂冬波.认知诊断测验Q矩阵估计方法比较[J].中国考试,2015(5):53-63. 被引量：2
2涂冬波,蔡艳.基于属性多级化的认知诊断计算机化自适应测验设计与实现[J].心理学报,2015,47(11):1405-1414. 被引量：13
3汪文义,宋丽红,陈平,丁树良,程艳.认知诊断测验的属性分类一致性和分类准确性指标[J].心理学探新,2016,36(3):264-269. 被引量：5
4郭磊,郑蝉金,边玉芳,宋乃庆,夏凌翔.认知诊断计算机化自适应测验中新的选题策略:结合项目区分度指标[J].心理学报,2016,48(7):903-914. 被引量：14
5蔡艳,苗莹,涂冬波.多级评分的认知诊断计算机化适应测验[J].心理学报,2016,48(10):1338-1346. 被引量：21
6高椿雷,罗照盛,郑蝉金,喻晓锋,彭亚风,郭小军.CD-CAT初始阶段项目选取方法[J].心理科学,2017,40(2):485-491. 被引量：4
7汪文义,宋丽红,丁树良.分类视角下认知诊断测验项目区分度指标及应用[J].心理科学,2018,41(2):475-483. 被引量：4
8罗芬,王晓庆,丁树良,熊建华.自适应分组认知诊断测验设计及其选题策略[J].心理科学,2018,41(3):720-726. 被引量：9
9王玥,常淑娟,韩晓玲,陆宏.基于项目反应理论的题库构建及其有效性检验——以“现代教育技术”公共课为例[J].现代教育技术,2019,29(10):41-47. 被引量：5
10唐倩,毛秀珍,何明霜,何洁.认知诊断计算机化自适应测验的选题策略[J].心理科学进展,2020,28(12):2160-2168. 被引量：3

二级引证文献67

1杨永强,范继璋,程丽红.阿尔泰地区铜镍矿床综合信息找矿模型及成矿预测[J].长春科技大学学报,2000,30(2):157-160. 被引量：7
2韩雨婷,高旭亮,汪大勋,蔡艳,涂冬波.多级评分项目的多维CAT选题策略开发[J].心理科学,2018,41(6):1500-1507. 被引量：5
3尚鹏丽,郭磊,陈佳芳,汪新,张进辅.基于KL信息矩阵的动态加权选题策略[J].西南师范大学学报（自然科学版）,2016,41(10):117-123.
4蔡艳,苗莹,涂冬波.多级评分的认知诊断计算机化适应测验[J].心理学报,2016,48(10):1338-1346. 被引量：21
5陈孚,辛涛,刘彦楼,刘拓,田伟.认知诊断模型资料拟合检验方法和统计量[J].心理科学进展,2016,24(12):1946-1960. 被引量：2
6韩雨婷,涂冬波,王潇濛,刘馨婷,汪大勋.多维计算机化自适应测验选题策略的开发及比较[J].心理科学,2017,40(4):997-1004. 被引量：3
7汪文义,宋丽红,丁树良.分类视角下认知诊断测验项目区分度指标及应用[J].心理科学,2018,41(2):475-483. 被引量：4
8罗芬,王晓庆,丁树良,熊建华.自适应分组认知诊断测验设计及其选题策略[J].心理科学,2018,41(3):720-726. 被引量：9
9黄宏涛,张若,李海龙,叶海智.基于近似子图的实时教学认知诊断模型设计与应用[J].现代远程教育研究,2018,0(4):97-105. 被引量：1
10昌维,詹沛达,王立君.认知诊断中多分属性与二分属性的对比研究[J].心理科学,2018,41(4):982-988. 被引量：3

1涂冬波,蔡艳,戴海琦.认知诊断CAT选题策略及初始题选取方法[J].心理科学,2013,36(2):469-474. 被引量：15
2文摘[J].中国民族教育,2008(1):48-48.
3喻晓锋,罗照盛,高椿雷,秦春影.Q矩阵包含错误的诊断测验分类准确性比较[J].心理科学,2014,37(6):1478-1484. 被引量：4
4HA！HA！[J].新东方英语.中学生（中英文版）,2013(11):40-40.
5阈值概念在英语语法教学中的应用[J].国内高等教育教学研究动态,2014(21):12-12.
6НИha ПaBПOBHa pacckaЗblBaet……[J].中学俄语,2012(8):6-7.
7何胜宗.На носу—Новый год![J].中学俄语,2012(1):4-4.
8叶华乔.基于IRT的计算机化自适应考试研究[J].福建电脑,2009,25(12):128-128. 被引量：2
9曾雪峰.基于Web的高职院校网上教学效果诊断[J].品牌（理论月刊）,2015(1):39-39.
10刘妍,戴静,石小恋,牛雨,祝嘉钰,顾小清.认知诊断理论在计算机自适应测试中的应用与启示[J].中国远程教育,2017(4):42-49. 被引量：3

心理科学

2014年第1期

浏览历史

内容加载中请稍等...

兼顾测验效率和题库使用率的CD-CAT选题策略被引量：14

参考文献18

二级参考文献151

共引文献138

同被引文献97

引证文献14

二级引证文献67

相关作者

相关机构

相关主题

浏览历史

兼顾测验效率和题库使用率的CD-CAT选题策略 被引量：14

参考文献18

二级参考文献151

共引文献138

同被引文献97

引证文献14

二级引证文献67

相关作者

相关机构

相关主题

浏览历史

兼顾测验效率和题库使用率的CD-CAT选题策略被引量：14