ITesting the test: Using Rasch person scoresby H. P. L. Molloy (Toyo University Faculty of Business Administration) |
Abstract |
The results of analyzing a set of tests used for measuring classroom progress in EFL reading are reported.
Some 353 students in Japanese university English classes took one of four tests in 2008 and 2009. Student ability scores were derived using three analytic situations: all students together,
271 students from 2008 only, and 82 students from 2009 scored with items anchored at levels set the previous year. Student scores differed noticeably in the three analytic
situations, showing that item linking is necessary for intergroup comparisons. In other words, if you're not using the same tests, you cannot compare students. Keywords: language testing, Rasch measurement, test-score correlation, classroom testing, criterion-referenced testing, test-item linking |
[ p. 6 ]
[ p. 7 ]
Table 1. Number of students who took each test on each occasion | ||||
---|---|---|---|---|
Occasion | Test A | Test B | Test C | Test D |
Fall 2008 | 70 | 65 | 67 | 69 |
Fall 2009 | 19 | 20 | 19 | 24 |
Total | 89 | 85 | 86 | 93 |
[ p. 8 ]
[ p. 9 ]
Table 2. Correlation coefficients for test scores under three testing situations for the same students | |||||||||
---|---|---|---|---|---|---|---|---|---|
Year 1-anchored | Year 1-new | Anchored-new | |||||||
r | ρ | τ | r | ρ | τ | r | ρ | τ | |
Class A (n=39) |
0.95 | 0.96 | 0.86 | 0.97 | 0.95 | 0.85 | 0.90 | 0.90 | 0.75 |
Class B (n=35) |
0.95 | 0.97 | 0.96 | 0.85 | 0.97 | 0.86 | 0.91 | 0.92 | 0.76 |
[ p. 10 ]
Table 3. Average absolute deviations of Winsteps ability scores for each class under each combination of testing situations | |||
---|---|---|---|
Year 1-anchored | Year 1-new | Anchored-new | |
Class A (n=39) |
0.14 | 0.11 | 0.21 |
Class B (n=35) |
0.13 | 0.10 | 0.21 |
[ p. 11 ]
[ p. 12 ]