摘要
CD–CAT中已有选题策略较注重测验效率,而对题库使用率不够重视。针对此问题,基于DINA模型,引入两种新的选题策略KLED和RHA,同时对HA进行模拟研究。结果显示:PWKL与KLED只在测验效率上具有优势;KLED若按属性向量分层,题库使用率有所提高,KLED比ED更容易推广到其他有显式表达的诊断模型场合;HA、RHA和RP–PWKL可较好兼顾测验效度和题库使用率,但RP-PWKL需设置项目的最大曝光率阈值。两种新选题方法在定长和变长CD-CAT都具有一定的应用价值。
Cognitive diagnostic computerized adaptive testing ( CD - CAT) is a popular mode of online testing of cognitive diagnostic assessment (CDA). The key to a CD - CAT program is the item selection methods. Three of the most popular methods are developed based on Kullback -Leibler information (KL), Shannon entropy (SHE) and the expected discrimination method (ED) to select items in CD -CAT. These methods can achieve a much better test efficiency. However, they often lead to unbalanced item usage within a pool. Diagnostic test would not be a high - stake test, so the item overexposure problem may not be a major concern. However, the item underexposure problem leads to the waste of time and money invested in developing each item on it, and the high test overlap rate prob- lem leads to the effects of intense exercise. Although the restrictive progressive method ( RP - PWKL) and the restrictive threshold method ( RT - PWKL) are proposed to balance item exposure control with measurement accuracy, RP - PWKL and RT - PWKL sup press overexposure and thus add a restriction so that the maximum exposure rate will be kept under a predetermined value. The rationale for the maximum exposure rate deserves further consideration. For the above consideration, the article proposes two item selection methods for CD - CAT based on the "Deterministic Input, Noisy And Gate" (DINA) model. First, using KL information as a discrimination function of ED, KLED is proposed to handle other cognitive diagnostic models, besides the DINA model. Second, according to the idea of randomization strategies, in which the selection of the item is always made at random among the most informative items, randomization halving algorithm (RHA) is proposed. For RHA, all items within the specified range are available for selection rather than an arbitrary or only one number. Moreover, we show the connection between KLED based on KL, HA, and RHA; KLED can be regarded as a weighted HA method, weighted by the corre sponding item parameters; HA can be regarded as RHA without adding a random component between different item attribute vectors in the Q matrix of the item pool. Then, two simulation studies are carried out, one using a simulated item bank, and the other based on items calibrated from real data. Eight item selection strategies are taken into consideration in these studies, including random, posterior -weighted KL (PWKL), RP- PWKL, RT- PWKL, ED, halving algorithm (HA), KLED and RHA. In addition, VRP- PWKL and VRT- PWKL are pro- posed for variable - length CD - CAT as an extended version of RP - PWKL and RT - PWKL. Simulation studies for fixed or variable - length CD - CAT are conducted based on the eight methods, and the results are compared in terms of the pattern or attribute correctclas- sification rate, error classification rate, item exposure rate,and test overlap rate. The simulation results show that : RHA, HA, RP - PWKL, VRP - PWKL and VRT - PWKL have more balanced usage of the item bank and slight decrease in correct classification rate of knowledge state ; RHA, HA, ~RP - PWKL and VRT - PWKL can be used for variable - length CD - CAT. Though the results from the simulation study are encouraging, further studies of CD - CAT are proposed for the future investigations such as different coznitive diaznostic models.
出处
《心理科学》
CSSCI
CSCD
北大核心
2014年第1期212-216,共5页
Journal of Psychological Science
基金
国家自然科学基金(30860084,31160203,31100756,31360237)
国家社会科学基金(12BYY055)
国家教育科学规划项目(CCA110109)
教育部人文社科项目(09JJCXLX012,10YJCXLX049,11YJC190002)
教育部人文社会科学研究青年基金项目(13YJC880060)
江西省社会科学研究“十二五”(2012年)规划项目(12JY07)
江西省教育科学2013年度一般课题(13yB032)
江西省教育厅科技计划项目(GJJ11385,GJJ10238,GJJ13207,GJJ13226,GJJ13208)
全国教育考试科研规划课题(2009JKS2009)
高等学校博士学科点专项科研基金(20113604110001)
江西师范大学青年成长基金的资助