期刊文献+

基于基尼指数的双目标CD-CAT选题策略 被引量:6

A new dual-objective CD-CAT item selection method based on the Gini index
下载PDF
导出
摘要 双目标CD-CAT的测验结果既可用于形成性评估也可用于终结性评估。基尼指数可度量随机变量的不确定性程度,值越小则随机变量的不确定程度越低。本文用基尼指数度量被试知识状态类别以及能力估计置信区间后验概率的变化,提出基于基尼指数的选题策略。Monte Carlo实验表明与已有的选题策略相比,新策略的知识状态分类精度和能力估计精度都较高,同时能有效兼顾题库利用均匀性,并能快速实时响应,且受认知诊断模型和被试知识状态分布的影响较小,可用于实际测验中含多种认知诊断模型的混合题库。 Existing literature has shown that dual-objective CD-CAT testing can facilitate the achievement of measurement objectives for both formative and summative assessments.And the Gini Index can be used as a measurement for the degree of uncertainty of random variables since a smaller Gini value indicates a lower degree of uncertainty.Hence,this paper proposed a Gini-Index-based selection method for dual-objective CD-CAT,and it measured the changes in the posterior probability of knowledge state and confidence interval for latent traits estimation.By adopting the Bayesian Decision Theory,the potential information of participants could be detected based on participants’responses and changes in posterior probability distribution of two the random variables.Monte Carlo Simulation was used to test the performances of the selection method based on Gini,ASI,IPA and JSD,respectively.The item banks measured 5 attributes consisting of 250 items in total,and each item measured 3 attributes at most.The true knowledge state of each participant was generated by HO-CDM and Multivariate Normal Models(both means were 0 and covariance coefficient was 0.8 and 0.2,respectively).G-DINA,DINA and R-RUM were adopted as the cognitive diagnostic models and the item bank of each of these three models included both CDM and 2PL parameters.Specifically,CDM parameters were generated by a G-DINA package in R software with the slipping and guessing parameters randomly selected from uniform distribution in a range from 0.05 to 0.25.The 2PL parameters were estimated by factoring in the responses elicited from 3,000 participants’responses to all items in item banks using the mirt package.Four indexes,namely the pattern match ratio,root mean square error of latent trait,chi-square value and time needed for item selection,were adopted in comparing the efficiency of different item selection methods.The value for each index was the mean of 10 repeated simulations of 1,000 participants’responses to all item bank.The results showed that(1)The Gini and IPA selection methods had similar performance in terms of pattern match ratio,root mean square error of latent trait and chi-square value.Both methods were high in precision measurement and low in sensitivity to CDM and the distribution of participants’cognitive patterns,making both methods applicable to the item banks featuring a mixture of cognitive diagnosis models.By comparison,the Gini method outperformed slightly the IPA method in pattern match ratio and time needed for item selection in which the Gini method was only one-tenth that of the IPA method;(2)Both the Gini and ASI selection methods were weighted linear combination approaches.The performances of the two methods were very close in the short test.In the long test,however,although time needed for item selection using the ASI method was only one-third that of the Gini method,the latter was superior to the former in terms of measurement accuracy and chi-square value;(3)Although the JSD method outperformed the Gini method in terms of uniformity of item bank usage and time needed for item selection,its measurement accuracy was far less than the latter.To summarize,the Gini,IPA and ASI selection methods all have good measurement accuracy and hence are all recommended for short tests.For medium and long tests with a limited number of attributes and a smaller item bank,the Gini and IPA selection methods are recommended.As the number of attributes and item bank size grow,the Gini method is recommended.When there are high correlations among different attributes,as well as a large number of attributes and big item bank size,the ASI and JSD selection methods are recommended with the ASI method slightly outperforming the JSD method in measurement accuracy.
作者 罗芬 王晓庆 蔡艳 涂冬波 LUO Fen;WANG Xiaoqing;CAI Yan;TU Dongbo(School of Psychology,Jiangxi Normal University,Nanchang 330022,China;College of Computer Information Engineering,Jiangxi Normal University,Nanchang 330022,China)
出处 《心理学报》 CSSCI CSCD 北大核心 2020年第12期1452-1465,共14页 Acta Psychologica Sinica
基金 国家自然科学基金(61967009,31660278,31760288,31960186) 江西省教育厅科学技术研究项目(GJJ150356,GJJ160282)资助。
关键词 认知诊断 项目反应理论 基尼指数 双目标CD-CAT 选题策略 cognitive diagnostic items response theory Gini index dual objective CD-CAT selection method
  • 相关文献

参考文献5

二级参考文献59

  • 1陈平,丁树良,林海菁,周婕.等级反应模型下计算机化自适应测验选题策略[J].心理学报,2006,38(3):461-467. 被引量:38
  • 2林海菁,丁树良.具有认知诊断功能的计算机化自适应测验的研究与实现[J].心理学报,2007,39(4):747-753. 被引量:21
  • 3Cheng Y. When cognitive diagnosis meets computerized adaptive testing. Psychometrika, 2009, 74(4) : 619-632.
  • 4Cheng Y, Chang H. The modified maximum global discrimination index method for cognitive diagnostic computerized adaptive testing. Paper presented at the 2007 GMAC Conference on Computerized Adaptive Testing, McLean, USA, 2007.
  • 5McGlohen, M. K. The application of cognitive diagnosis and computerized adaptive testing to a large-scale assessment. Un- published doctoral thesis, University of Texas at Austin, 2004.
  • 6McGlohen M K, Chang H. Combining computer adaptive testing technology with cognitively diagnostic assessment. Behavioral Research Methods, 2008, 40 (3) : 808-821.
  • 7Tatsuoka K K, Tatsuoka M M. Computerized cognitive diagnostic adaptive testing:. Effect on remedial instruction as empirical vali- dation. Journal of Educational Measurement, 1997, 34 (1) : 3-20.
  • 8Xu X, Chang H, Douglas J. A simulation study to compare CAT strategies for cognitive diagnosis. Paper presented at the annual meeting of National Council on Measurement in Education, Montreal, Canada, 2003.
  • 9涂东波.项目自动生成的小学儿童数学问题解决认知诊断CAT编制.江西师范大学博士学位论文.2009.
  • 10文剑冰.规则空间模型在诊断性计算机化自适应测验中的应用.香港中文大学博士学位论文.2003.

共引文献50

同被引文献47

引证文献6

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部