This study attempted to interpret differential item discriminations between individual and cluster levels by focusing on patterns and magnitudes of item discriminations under 2PL multilevel IRT model through a set of ...This study attempted to interpret differential item discriminations between individual and cluster levels by focusing on patterns and magnitudes of item discriminations under 2PL multilevel IRT model through a set of variety simulation conditions. The consistency between the mean of individual-level ability estimates and cluster-level ability estimates was evaluated by the correlations between them. As a result, it was found that they were highly correlated if the patterns of item discriminations were the same for both individual and cluster levels. The magnitudes of item discriminations themselves did not affect much on correlations, as far as the patterns were the same at the two levels. However, it was found that the correlation became lower when the patterns of item discriminations were different between the individual and cluster levels. Also, it was revealed that the mean of the estimated individual-level abilities would not be necessarily a good representation of the cluster-level ability, if the patterns were different at the two levels.展开更多
This paper studies the technics of reducing item exposure by utilizing automatic item generation methods. Known test item calibration method uses item parameter estimation with the statistical data, collected during e...This paper studies the technics of reducing item exposure by utilizing automatic item generation methods. Known test item calibration method uses item parameter estimation with the statistical data, collected during examinees prior testing. Disadvantage of the mentioned item calibration method is the item exposure; when test items become familiar to the examinees. To reduce the item exposure, automatic item generation method is used, where item models are being constructed based on already calibrated test items without losing already estimated item parameters. A technic of item model extraction method from the already calibrated and therefore exposed test items described, which can be used by the test item development specialists to integrate automatic item generation principles with the existing testing applications.展开更多
儿童早期数学能力评估对数学能力的发展研究具有重要意义,研究修订了《早期数学能力评估工具》(Research-Based Early Math Assessment,REMA),并对其信度和效度进行检验.研究以上海市两所幼儿园313名儿童为研究对象,采用项目反应理论中...儿童早期数学能力评估对数学能力的发展研究具有重要意义,研究修订了《早期数学能力评估工具》(Research-Based Early Math Assessment,REMA),并对其信度和效度进行检验.研究以上海市两所幼儿园313名儿童为研究对象,采用项目反应理论中的Rasch模型检验REMA的信效度.结果表明,REMA的信度较好,基本为单一的能力维度结构,怀特图说明量表整体适合中高水平的被试,各个项目的内外适合度指标在0.5~1.5之间,符合Rasch模型,早期数学能力与数学学习品质呈中高水平相关(相关系数在0.34~0.61之间).研究表明,REMA量表具有良好的信效度,适合作为评估3~6岁学前儿童数学能力的有效工具.展开更多
文摘This study attempted to interpret differential item discriminations between individual and cluster levels by focusing on patterns and magnitudes of item discriminations under 2PL multilevel IRT model through a set of variety simulation conditions. The consistency between the mean of individual-level ability estimates and cluster-level ability estimates was evaluated by the correlations between them. As a result, it was found that they were highly correlated if the patterns of item discriminations were the same for both individual and cluster levels. The magnitudes of item discriminations themselves did not affect much on correlations, as far as the patterns were the same at the two levels. However, it was found that the correlation became lower when the patterns of item discriminations were different between the individual and cluster levels. Also, it was revealed that the mean of the estimated individual-level abilities would not be necessarily a good representation of the cluster-level ability, if the patterns were different at the two levels.
文摘This paper studies the technics of reducing item exposure by utilizing automatic item generation methods. Known test item calibration method uses item parameter estimation with the statistical data, collected during examinees prior testing. Disadvantage of the mentioned item calibration method is the item exposure; when test items become familiar to the examinees. To reduce the item exposure, automatic item generation method is used, where item models are being constructed based on already calibrated test items without losing already estimated item parameters. A technic of item model extraction method from the already calibrated and therefore exposed test items described, which can be used by the test item development specialists to integrate automatic item generation principles with the existing testing applications.
文摘儿童早期数学能力评估对数学能力的发展研究具有重要意义,研究修订了《早期数学能力评估工具》(Research-Based Early Math Assessment,REMA),并对其信度和效度进行检验.研究以上海市两所幼儿园313名儿童为研究对象,采用项目反应理论中的Rasch模型检验REMA的信效度.结果表明,REMA的信度较好,基本为单一的能力维度结构,怀特图说明量表整体适合中高水平的被试,各个项目的内外适合度指标在0.5~1.5之间,符合Rasch模型,早期数学能力与数学学习品质呈中高水平相关(相关系数在0.34~0.61之间).研究表明,REMA量表具有良好的信效度,适合作为评估3~6岁学前儿童数学能力的有效工具.