可修改答案的认知诊断计算机化自适应测验研究被引量：2

The Research of Reviewable CD-CAT

下载PDF

导出

摘要允许修改答案的认知诊断计算机化自适应测验(Reviewable Cognitive Diagnostic Computerized Adaptive Testing,RCD-CAT),有利于更准确诊断被试的知识状态,题目口袋法(Item Pocket,IP)为被试提供了缓存作答并修改的机会,改进的题目口袋法(Modified IP,MIP)对IP内修改的题目重新计分。模拟研究比较了IP、MIP、stocking Ⅰ和stocking Ⅱ在RCD-CAT效果,结果发现:stocking设计的效果最优,其中stocking Ⅱ的效果略优于stocking Ⅰ,IP法和MIP法判准率要低于传统CD-CAT,stocking设计在RCD-CAT具有较好的应用前景。 Combining cognitive diagnosis with computerized adaptive testing, cognitive diagnostic computerized adaptive testing （CD-CAT） aims to more efficiently and accurately diagnose examinees＇ mastery status of a group of discretely defined skills, or attributes than paper ＆ pencil tests. While it is a natural thing for examinees to review their answers and possibly change them in paper and pencil-based tests, the same thing is less common to happen in CD-CAT since it could deteriorate the measurement efficiency. The absence of review opportunities on operational CD-CAT creates a dilemma for test developers as examinees need to review and change answers during the test in order to achieve more accurate estimates of their true ability. Item Pocket （Han, 2013） is a method of reviewable computerized adaptive testing （RCAT）. This method provides test takers with Item Pocket （IP） into which they can place items for later review and response change. Test takers can skip answering items by putting them in the IP, but the shortcoming of IP is that the capacity is not easy to control, if the capacity is too large that will result a comparatively large estimation error. Based on IP method, the study proposes a new IP method called modified IP （MIP）, employing a new scoring method in IP. Compared with IP, stocking （1997） design causes greater restrictions for examinee behavior. In stocking design I, examinees are instructed in advance that they will be permitted to revise answers to fixed number of items, under stocking design II, the testing is divided into separately sections and examinees are informed in advance of testing that they will be permitted to revise answers to items only within a section. The advantage of design II is that it simultaneously restricts examinee control over the actual item presented because revised responses from previous sections influence the section of items in subsequent sections. CD-CAT is a further development of the CAT, but they are very different in some ways. In order to verify the above methods in Reviewable CD-CAT （RCD-CAT）, two Monte Carlo simulation studies with different experimental conditions were conducted here, the interim and final states of knowledge were estimated using the maximum likelihood estimation （MLE） method, a group of 5,000 examinees were simulated for this study, and the tests were then created from an item pool of 300 items. These experimental conditions were cognitive diagnosis model （DINA and R-RUM）, the number of attributes （5 and 7）, item selection strategies （KL, PWKL, HKL and MPWKL）, and the fixed test length CD-CAT （10 and 20 items respectively）. Monte Carlo simulation results s^aowed that：（1）When using the DINA model, MIP and IP methods had very similar classification accuracy, however, while using R-RUM model MIP method had higher classification accuracy than IP method. Furthermore, both MIP and IP had low classification accuracy than traditional CD-CAT; （2）Stocking design had a higher classification accuracy than the other methods in all simulations, and stocking design II was slightly better than the stocking design I. In a word, RCD-CAT is more consistent with traditional examination habits, in addition, it can also improve classification accuracy. This study will help to provide theory and method support for future research and practical application.

作者高旭亮汪大勋韩雨婷蔡艳涂冬波

机构地区江西师范大学心理学院

出处《心理科学》 CSSCI CSCD 北大核心 2017年第3期721-727,共7页 Journal of Psychological Science

基金国家自然科学基金(31660278 31300876 31100756 31360237) 江西省高校人文社科项目(XL1507 XL1508) 东北师范大学应用统计教育部重点实验室开放课题(KLAS130028614) 武汉市卫计委支撑课题(WG16C0)的资助

关键词 CD-CAT 可修改答案题目口袋 stocking设计 CD-CAT, answer change, item pocket, stocking design

分类号 B841 [哲学宗教—基础心理学]

引文网络
相关文献

同被引文献12

1汪玲玲,陈平,辛涛,衷克定.基于BP神经网络的认知诊断计算机化自适应测验实现[J].北京师范大学学报（自然科学版）,2015,51(2):206-211. 被引量：8
2陈平,李珍,辛涛.认知诊断计算机化自适应测验的题库使用均匀性初探[J].心理与行为研究,2011,9(2):125-132. 被引量：18
3汪文义,丁树良,游晓锋.计算机化自适应诊断测验中原始题的属性标定[J].心理学报,2011,43(8):964-976. 被引量：32
4余丹,潘奕娆,丁树良,杨庆红.计算机化自适应诊断测验新的选题策略[J].江西师范大学学报（自然科学版）,2011,35(5):548-550. 被引量：7
5涂冬波,蔡艳,戴海琦.基于DINA模型的Q矩阵修正方法[J].心理学报,2012,44(4):558-568. 被引量：40
6涂冬波,蔡艳,戴海琦.认知诊断CAT选题策略及初始题选取方法[J].心理科学,2013,36(2):469-474. 被引量：15
7辛涛,刘拓.认知诊断计算机自适应测验中选题策略的新进展[J].南京师大学报（社会科学版）,2013(6):81-87. 被引量：3
8毛秀珍,辛涛.多维计算机化自适应测验:模型、技术和方法[J].心理科学进展,2015,23(5):907-918. 被引量：12
9戴步云,张敏强,焦璨,黎光明,朱华伟,张文怡.基于CD-CAT的多策略RRUM模型及其选题方法开发[J].心理学报,2015,47(12):1511-1519. 被引量：9
10郭磊,郑蝉金,边玉芳,宋乃庆,夏凌翔.认知诊断计算机化自适应测验中新的选题策略:结合项目区分度指标[J].心理学报,2016,48(7):903-914. 被引量：14

引证文献2

1孙小坚,王钰彤,张世夷,辛涛.认知诊断计算机自适应测验中平衡属性收敛的新方法[J].心理科学,2019,42(5):1236-1244. 被引量：4
2高旭亮,汪大勋,蔡艳,涂冬波.基于混合模型(Mixed-CDMs)视角的CD-CAT及其应用研究[J].心理科学,2019,42(1):194-201. 被引量：4

二级引证文献8

1董艳云,马晓梅,孟亚茹.混合模型在英语听力诊断测评中的应用——基于Mixed-CDMs与G-DINA模型的对比分析[J].现代教育技术,2020,30(3):52-58. 被引量：3
2唐倩,毛秀珍,何明霜,何洁.认知诊断计算机化自适应测验的选题策略[J].心理科学进展,2020,28(12):2160-2168. 被引量：3
3罗芬,王晓庆,蔡艳,涂冬波.基于Gini指数的认知诊断计算机化自适应选题策略[J].心理科学,2021,44(2):440-448. 被引量：2
4陈海文,泰中华.认知诊断评估研究述评:现状与展望[J].大众标准化,2021(6):55-57.
5孙小坚,郭磊.考虑题目选项信息的非参数认知诊断计算机自适应测验[J].心理学报,2022,54(9):1141-1154.
6孙小坚,刘彦楼,王诗梦,辛涛,宋乃庆,周蔓.认知诊断测验中基于信息矩阵的多群组DIF检验[J].心理科学,2022,45(3):710-717.
7唐小娟,丁树良,俞宗火.题目属性向量平衡策略的认知诊断测验设计[J].心理科学,2022,45(6):1466-1474. 被引量：2
8李心钰,陆宏.教育评价理念的演变及测评工具的审视与应用[J].现代教育技术,2023,33(4):74-82. 被引量：4

1余显文,向荣尧.整合资源实现IP远教资源的最佳效益[J].中国教育技术装备,2005(10):55-56.
2群群.智能大闯关[J].阅读,2011(1):66-73.
3雪莲.苏州天宫信息技术有限公司[J].东方文化周刊,2016,0(42):48-48.
4钱铮,钱雪菲.大学生成长服务平台使用情况的调研及对策——以江苏理工学院为例[J].苏州教育学院学报,2016,33(6):103-105. 被引量：1
5北师大版计分表[J].读写月报（新教育）,2009(2):44-44.
6阿紫.你是坚强的孩子吗[J].初中生辅导,2006(7):42-44.
7你会不会交朋友?[J].大舞台,2001(13):64-65.
8刘若娟.奇怪的考试制度[J].课堂内外（小学版）,2010(11):7-7.
9苏教版计分表[J].读写月报（新教育）,2009(2):20-20.
10人教版计分表[J].读写月报（新教育）,2009(2):66-66.

心理科学

2017年第3期

浏览历史

内容加载中请稍等...

可修改答案的认知诊断计算机化自适应测验研究被引量：2

同被引文献12

引证文献2

二级引证文献8

相关作者

相关机构

相关主题

浏览历史

可修改答案的认知诊断计算机化自适应测验研究 被引量：2

同被引文献12

引证文献2

二级引证文献8

相关作者

相关机构

相关主题

浏览历史

可修改答案的认知诊断计算机化自适应测验研究被引量：2