结合题目作答时间的计算机化自适应测验选题方法被引量：3

The Use of Response Time in Item Selection of Computerized Adaptive Testing

下载PDF

导出

摘要计算机形式的测验能够记录考生在测验中的题目作答时间(RT),作为一种重要的辅助信息来源,RT对于测验开发和管理具有重要的价值,特别是在计算机化自适应测验(CAT)领域。本文简要介绍了RT在CAT选题方面应用并作以简评,分析了这些技术在实践中的可行性。最后,探讨了当前RT应用于CAT选题存在的问题以及可以进一步开展的研究方向。 The computer-based test enables the examinee’s response time(RT)to be recorded accurately.As an important source of auxiliary information,RT have an important potential value for test development and management,especially in the field of Computerized Adaptive Testing(CAT).With the collection of RT,the CAT assessment process can be further improved in terms of precision,fairness,and minimizing costs.It is widely known that item selection is the key step of CAT,which reflects its"adaptive"characteristics.The traditional CAT item selection algorithm does not consider RT information,this is unfavorable for test management and may lead to biased assessment results.This paper synthetically and briefly introduces the application of RT in the item selection of CAT and analyzes the feasibility of these techniques in practice,which makes the readers have a specific and clear understanding of the potential value of RT in CAT.Since item selection in CAT is based on the candidate's ability estimation(except for the selection of initial items),the improvement of ability estimation can also be considered as an indirect improvement of the item selection.Therefore,this paper divides relevant methods into two categories:(1)indirect improvement of item selection by RTs(ability estimation)and(2)direct improvement of item selection by RT(item selection method).Generally,a majority of tests are a mixture of speed and power components,while the RT provide information not only examinees’ability but also item characteristics.In the past decades,a lot of models for RT and response accuracy(RA)has been proposed(e.g.,Thissen,1983;Wang&Hanson,2005;van der Linden,2007),which makes it possible to use RT to improve the accuracy of ability estimation in CAT,and the item selection is further improved(van der Linden,2008).In general,examinees with the same ability level may need different time to complete an item(van der Linden et al.,1999),and the response time of an examinee for different items may also be different because some items are usually more time consuming than others(Veldkamp,2016).Test speededness results in examinees taking different amounts of time to complete a test.However,most standardized tests often set a specific time for practical administration purposes,when candidates are pressured by the time limit,they may improve the response speed at the expense of accuracy(Entink et al.,2009),which leads to biased ability estimation.Therefore,it is necessary to eliminate the influence of the speed factor for the test whose main goal is to evaluate ability,and this is more in line with the unidimensional hypothesis of IRT.However,the conventional item selection methods didn’t take this into account,and RT information should be introduced into the process of item selection to address this problem(van der Linden et al.,1999;Fan et al.,2012).With the development of measurement theory and technology,researchers hope to get richer diagnostic information about an examinee from the test,rather than simply evaluating him on an abstract scale,and the application of RTs is a promissing approach.

作者郭治辰汪大勋蔡艳涂冬波 Guo Zhichen;Wang Daxun;Cai Yan;Tu Dongbo(School of Psychology,Jiangxi Normal University,Nanchang,330022)

机构地区江西师范大学心理学院

出处《心理科学》 CSSCI CSCD 北大核心 2021年第5期1241-1248,共8页 Journal of Psychological Science

基金国家自然科学基金(31960186、31760288、31660278)项目的资助。

关键词计算机化自适应测验题目作答时间能力估计选题方法题目曝光测验时间 computerized adaptive testing item response time ability estimation item exposure item selection method test time

分类号 B841.7 [哲学宗教—基础心理学]

引文网络
相关文献

参考文献4

1郭磊,尚鹏丽,夏凌翔.心理与教育测验中反应时模型应用的优势与举例[J].心理科学进展,2017,25(4):701-712. 被引量：10
2孟祥斌.项目反应时间的对数偏正态模型[J].心理科学,2016,39(3):727-734. 被引量：9
3詹沛达.计算机化多维测验中作答时间和作答精度数据的联合分析[J].心理科学,2019,42(1):170-178. 被引量：9
4詹沛达,Hong Jiao,Kaiwen Man.多维对数正态作答时间模型:对潜在加工速度多维性的探究[J].心理学报,2020,52(9):1132-1142. 被引量：10

二级参考文献27

1Azzalini, A. (1985). A class of distributions which includes the normal ones. Scandinavian Journal of Statisges, 12, 171-178.
2Baker, F. B. & Kim, S. H. (2004). Item response theory: Parameter estimation techniques. New York: Marcel Dekker.
3Kang, T., & Cohen, A. S. (2007). IRT model selection methods for dichotomous items. Applied Psychological Measurement, 31, 331-358.
4Klein Entink, R. H., Fox, J. P., & van der Linden, W. J. (2009). A multivariate multilevel approach to the modeling of accuracy and speed of test takers. Psyehomettlka, 74, 21 --48.
5Klein Entink, R. H., van der Linden, W. J., & Fox, J. P. (2009). A Box-Cox normal model for response times. British Journal of Mathematical and Statistical Psychology, 62, 621-640.
6Lee, Y. H., & Chen, H. (2011). A review of recent response-time analyses in educational testing. Psychological Test and Assessment Modeling, 53, 359- 379.
7Li, F. M., Cohen, A. S., Kim, S. H., & Cho, S. J. (2009). Model selection methods for mixture, dichotomous IRT models. Applied Psychological Measurement, 33, 353-373.
8I.oeys, T., Rosseel, Y., & Batena, K. (2011). A joint modeling approach for reaction time and accuracy in psycholinguistic experiments. Psyehometffka, 76, 487- 503.
9Meyer, J. P. (2010). A mixture Rasch model with item response time components. Applied Psycbological Measurement, 34, 521-538.
10Meng, X. B., Tan, J., & Chang, H. H. (2015). A conditional joint modeling approach for locally dependent item responses and response times. Journal of Educational Measurement, 52, 1-27.

共引文献21

1郭小军,罗照盛.基于速度与准确率权衡的心理测量学模型及应用[J].心理学探新,2019,39(5):451-460. 被引量：4
2郭小军,罗照盛.速度与准确率权衡：被试反应状态评价与建模[J].心理与行为研究,2019,17(5):589-595. 被引量：5
3詹沛达,Hong Jiao,Kaiwen Man.多维对数正态作答时间模型:对潜在加工速度多维性的探究[J].心理学报,2020,52(9):1132-1142. 被引量：10
4刘耀辉,徐慧颖,陈琦鹏,詹沛达.基于过程数据的问题解决能力测量及数据分析方法[J].心理科学进展,2022,30(3):522-535. 被引量：6
5任赫,黄颖诗,陈平.计算机化分类测验终止规则的类别、特点及应用[J].心理科学进展,2022,30(5):1168-1182. 被引量：2
6钟小缘,喻晓锋,苗莹,秦春影,彭亚风,童昊.基于作答时间数据的改变点分析在检测加速作答中的探索——已知和未知项目参数[J].心理学报,2022,54(10):1277-1292. 被引量：2
7詹沛达.引入眼动注视点的联合−交叉负载多模态认知诊断建模[J].心理学报,2022,54(11):1416-1432. 被引量：7
8严娟,郭小军,罗照盛.多维人格测验的反应与反应时联合建模分析[J].江西师范大学学报（自然科学版）,2022,46(5):453-459. 被引量：1
9郭小军,罗照盛,严娟.项目间多维测验作答时间数据分析:潜在特质速度间效应建模[J].心理科学,2022,45(5):1222-1229. 被引量：3
10袁建林,李美娟,刘红云.情境化测验的进展与挑战[J].中国考试,2023(3):17-26. 被引量：3

同被引文献24

1毕忠勤,陈光喜,徐安农.计算机自适应测试系统的算法[J].桂林电子工业学院学报,2004,24(6):50-53. 被引量：10
2田建全,苗丹民,杨业兵,何宁,肖玮.应征公民计算机自适应化拼图测验的编制[J].心理学报,2009,41(2):167-174. 被引量：7
3程小扬,丁树良,严深海,朱隆尹.引入曝光因子的计算机化自适应测验选题策略[J].心理学报,2011,43(2):203-212. 被引量：35
4鹿士义,张坚.题目曝光控制的动态a分层方法[J].中国考试,2011(9):3-9. 被引量：4
5毛秀珍,辛涛.计算机化自适应测验选题策略述评[J].心理科学进展,2011,19(10):1552-1562. 被引量：23
6詹沛达,王立君,杨卫敏.引入曝光因子的最大信息量动态分层法[J].中国考试,2013(2):12-20. 被引量：2
7詹沛达,王立君,杨卫敏.引入内容平衡的最大信息量组块分层选题策略[J].江西师范大学学报（自然科学版）,2013,37(1):106-110. 被引量：1
8郭磊,王卓然,王丰,边玉芳.结合a分层的兼具项目曝光和广义测验重叠率控制的选题策略[J].心理学报,2014,46(5):702-713. 被引量：3
9李佳,丁树良,方剑英.基于平均数形式的选题策略比较[J].江西师范大学学报（自然科学版）,2015,39(1):69-72. 被引量：6
10李佳,丁树良.多种分层方法在CAT校准误差中的应用研究[J].江西师范大学学报（自然科学版）,2016,40(1):56-60. 被引量：4

引证文献3

1蔡婷婷,丁元旗,黄青梅,吴傅蕾,孙琪,袁长蓉.基于项目反应理论的计算机自适应测验系统在癌症患者中的研究进展[J].护士进修杂志,2023,38(14):1276-1280. 被引量：1
2李佳,况天昊.结合项目反应时间与项目区分度的CAT选题新策略[J].江西师范大学学报（自然科学版）,2023,47(4):377-383.
3刘海东,虞芷筠,李浩,秦春影,杨建芹,喻晓锋.CAT分层类曝光控制策略研究新进展[J].江西师范大学学报（自然科学版）,2024,48(2):139-146.

二级引证文献1

1王喜益,胡韵.护理领域量表改编及其方法学的研究进展[J].护理研究,2024,38(19):3484-3489.

1袁莉,曹梦莹,约翰·加德纳,迈克尔·奥利里.人工智能教育评估应用的潜力和局限[J].开放教育研究,2021,27(5):4-14. 被引量：24
2李佳,丁树良,况天昊.区分度与测验进程相匹配的CAT选题策略[J].江西师范大学学报（自然科学版）,2021,45(4):384-389. 被引量：3

心理科学

2021年第5期

浏览历史

内容加载中请稍等...

结合题目作答时间的计算机化自适应测验选题方法被引量：3

参考文献4

二级参考文献27

共引文献21

同被引文献24

引证文献3

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

结合题目作答时间的计算机化自适应测验选题方法 被引量：3

参考文献4

二级参考文献27

共引文献21

同被引文献24

引证文献3

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

结合题目作答时间的计算机化自适应测验选题方法被引量：3