英文摘要 |
The purpose of this study is to survey four ways for computing the composite score of four tests from a test battery, which was simulated using the item parameters taken from a test item banking project. More than ten approaches of constructing composite scores can be found in the literature. Among them, reliability weighting, IRT weighting, principle component analysis and multiple regression were investigated in this study. With the data from Chinese, Math, Science, and Social Science achievement tests, the weights given by raw score and IRT trait score using reliability weighting method are very similar. Treating all items from four subject areas as items of a test and calibrating the trait level with 3PL model, the resulted trait scores has a Pearson correlation coefficient of .969 and .982, respectively, with the principal component score and composite score obtained from traits from that four achievement tests for the fifth grader, and .971 and .990, respectively, for the sixth graders.Finally, both the principal component scores, obtaining from the three subscales from a group intelligence test, and predicted individual intelligence test score correlated highly with the observed score of the individual intelligence test, near .95 for both cases. |