Sequential chemical extraction procedure has been widely used to partition particulate trace metals into various fractions and to describe the distribution and the statue of trace metals in geo environment. One sequen...Sequential chemical extraction procedure has been widely used to partition particulate trace metals into various fractions and to describe the distribution and the statue of trace metals in geo environment. One sequential chemical extraction procedure was employed here to partition various fractions of Mn in soils. The experiment was designed with quality controlling concept in order to show sampling and analytical error. Experimental results obtained on duplicate analysis of all soil samples demonstrated that the precision was less than 10%(at 95%confidence level). The accuracy was estimated by comparing the accepted total concentration of Mn in standard reference materials (SRMs) with the measured sum of the individual fractions. The recovery of Mn from SRM1 and SRM2 was 94.1%and 98.4%, respectively. The detection limit, accuracy and precision of the sequential chemical extraction procedure were discussed in detailed. All the results suggest that the trueness of the analytical method is satisfactory.展开更多
As direct measure of learners' communicative language ability, performance assessment (typically writing and speaking assessment) claims construct validity and a strong power for predictive utility of test scores....As direct measure of learners' communicative language ability, performance assessment (typically writing and speaking assessment) claims construct validity and a strong power for predictive utility of test scores. However, it is also of common concern that the subjectivity of rating process and the potential unfairness for test takers who encounter different writing prompts and speaking tasks would constitute threats to reliability and validity of test scores, especially in those large-scale and high-stakes tests. Therefore, appropriate means for quality control of subjective scoring should be held essential in test administration and validation. Based upon raw scores from one administration of speaking test in PETS Band3 held in Hangzhou, the present study investigates and models possible sources of score variability within the framework of Many-Facet Rasch Model (MFRM). MFRM conceptualizes the possibility of a examinee being awarded a certain score as a function of several facets — examinee ability, rater severity, domain difficulty and step difficulty between the adjacent score categories and provides estimates of the extent to which the examinee's test score is influenced by those facets. Model construction and data analysis was carried out in FACETS Version 3.58, computer program for conducting MFRM analysis. The results demonstrate statistically significant differences within each facet. Despite the generally acceptable rater consistency across examinees and rating domains, fit statistics indicate some unexpected rating patterns in certain raters such as inconsistency and central tendency, to be avoided through future rater training. Fair scores for each examinee are also provided, minimizing the variability due to facets other than examinees' ability. MFRM manifests itself as effective in detecting whether each test method facet functions as intended in performance assessment and providing useful feedback for quality control of subjective scoring.展开更多
文摘Sequential chemical extraction procedure has been widely used to partition particulate trace metals into various fractions and to describe the distribution and the statue of trace metals in geo environment. One sequential chemical extraction procedure was employed here to partition various fractions of Mn in soils. The experiment was designed with quality controlling concept in order to show sampling and analytical error. Experimental results obtained on duplicate analysis of all soil samples demonstrated that the precision was less than 10%(at 95%confidence level). The accuracy was estimated by comparing the accepted total concentration of Mn in standard reference materials (SRMs) with the measured sum of the individual fractions. The recovery of Mn from SRM1 and SRM2 was 94.1%and 98.4%, respectively. The detection limit, accuracy and precision of the sequential chemical extraction procedure were discussed in detailed. All the results suggest that the trueness of the analytical method is satisfactory.
基金Educational Measurement Research Project sponsored by 2006 National Education Science Research Plan of National Education Examinations Authority~~
文摘As direct measure of learners' communicative language ability, performance assessment (typically writing and speaking assessment) claims construct validity and a strong power for predictive utility of test scores. However, it is also of common concern that the subjectivity of rating process and the potential unfairness for test takers who encounter different writing prompts and speaking tasks would constitute threats to reliability and validity of test scores, especially in those large-scale and high-stakes tests. Therefore, appropriate means for quality control of subjective scoring should be held essential in test administration and validation. Based upon raw scores from one administration of speaking test in PETS Band3 held in Hangzhou, the present study investigates and models possible sources of score variability within the framework of Many-Facet Rasch Model (MFRM). MFRM conceptualizes the possibility of a examinee being awarded a certain score as a function of several facets — examinee ability, rater severity, domain difficulty and step difficulty between the adjacent score categories and provides estimates of the extent to which the examinee's test score is influenced by those facets. Model construction and data analysis was carried out in FACETS Version 3.58, computer program for conducting MFRM analysis. The results demonstrate statistically significant differences within each facet. Despite the generally acceptable rater consistency across examinees and rating domains, fit statistics indicate some unexpected rating patterns in certain raters such as inconsistency and central tendency, to be avoided through future rater training. Fair scores for each examinee are also provided, minimizing the variability due to facets other than examinees' ability. MFRM manifests itself as effective in detecting whether each test method facet functions as intended in performance assessment and providing useful feedback for quality control of subjective scoring.