摘要
在文献回顾和参考外军有关资料的基础上,根据项目反应理论和空间能力测验的有关理论编制试题库。首先采用纸笔测验的形式进行预实验,探讨采用IRT理论编制CAT拼图测验的可行性。然后,在预实验的基础上对试题进行修订并扩充试题数量,编制计算机辅助测验。选择三参数Logistic模型,采用铆题等值设计,分7份不同的试卷在全国征兵心理检测的过程中对55777名应征公民进行施测。根据测试结果,对题目进行分析,选择高质量的题目构成CAT试题库,采用a系数分层抽样的方法控制曝光率,并采用不同的测验终止策略编制CAT拼图测验。最后用WAIS智力测验积木分测验和三门功课的考试成绩为效标,通过72名被试对CAT拼图测验进行效度验证。结果显示该测验符合项目反应理论三参数Logistic模型的假设,各题目参数比较理想,所编制的测验具有较好的信度和效度,可用于应征公民心理选拔的实践。
According to the literature review and references of psychological testing about foreign armies, a spatial ability item bank was developed based on the principles of item response theory (IRT) and theories about spatial abilities. At first, we conducted a Paper and Pencil (P&P) test as a pilot study to explore the possibility of developing a picture assembling test using computerized adaptive testing (CAT) form. And correlation of this P&P test and task performance subscale of Soldier Performance Assessment Scale was analyzed. Then at the basis of the pilot study, a computer-administrated test (as a part of Nationwide Psychological Testing for Recruitment) was completed to modify and enlarge the item bank further. In this test, 7 different test forms, which were linked using anchor items, were developed and tested. Data was analyzed under 3 Parameter Logistic Model (3PL) using Bilog-MG software. Qualified items were selected to form the item bank, and a CAT form picture assembling test was developed. At last, we conducted validation test using subtest of block design in WAIS and scores of three curriculums as criterion. From the results, we could see that the CAT form picture assembling test could satisfy the three assumptions of 3PL model, including unidimensionality, local independence and no speeding. And discrimination, location and guessing parameters, which were estimated using marginal maximum likelihood estimation (MMLE), were satisfying. So were ability parameters, estimated using Bayes expected a posterior estimation (EAPE). Furthermore, both test's and items'information functions were good. On the other side, item exposure can be controlled properly by strategy of using maximum information procedure and a-stratified method in this test. And on the basis of current item bank, using fixed-length stopping rule is appropriate for Nationwide Psychological Test for Recruitment. As for the validation study, we found that results of P&P form of picture assembling test had significant positive correlations with task performance subscale of Soldier Performance Assessment Scale. Scores of CAT form had significant positive correlations with subtest of block design in WAIS and examination scores of math and physics, but had no significant correlation with scores of Chinese. However, a good CAT test need a good and large item bank to be based on, so the item bank in this study should be enlarged continuously.
出处
《心理学报》
CSSCI
CSCD
北大核心
2009年第2期167-174,共8页
Acta Psychologica Sinica
基金
中国博士后科学基金资助课题(基金号20080431368)