NJ Teacher Evaluation: Math Fail #1
Do the Broad-funded interns at the New Jersey Department of Education understand math?
I ask because their disastrous teacher evaluation proposals, announced with great fanfare last week, betray an embarrassing misunderstanding of the fundamentals of mathematics. It will take a few posts to catalog them all, but let's start with this:
A large portion of a "tested" teacher's evaluation will now include a metric called a "Median Student Growth Percentile," or mSGP. In a previous post, I showed how SGPs are woefully inappropriate for use in teacher evaluation, because they are purely descriptive measures: they do not measure how a teacher contributes to student learning.
But even if we put aside the problems of SGPs, there are still obvious problems with mSGPs; obvious, that is, to anyone with a basic understanding of the difference between a medianand a mean.
Here's how NJDOE describes the use of mSGPs, from the proposed regulations they released last week:
Proposed N.J.A.C. 6A:10-4.2(b) describes which teachers will receive a median student growth percentile, and proposed N.J.A.C. 6A:10-4.2(c) explains how the score will be calculated by the Department. The Department will provide a list of all courses that fall within a standardized-tested grade or subject for the purpose of student growth percentile. For instance, a third grade math class would not be used for student growth percentile because second graders do not take the standardized assessment under the current NJASK schedule and therefore growth from one year to the next cannot be measured. Additionally, teachers must teach a course for at least 60 percent of the time between the start of the year and the time the standardized test is administered and they must have at least 20 students attributed to his or her name through the school district’s course roster data system. If a teacher does not have at least 20 individual student growth percentile scores in a given academic year, up to three years of student data must be used to reach the minimum requirement of 20 students.
Proposed N.J.A.C. 6A:10-4.2(c) explains that the Department will calculate the student growth percentile by finding the median of all students who were enrolled in a course or a group within a course. [emphasis mine]
How would this work? Let's imagine a 4th Grade class of 21 students taught by Ms. Jones. Each of her students is assigned an SGP from 1 to 99 based on the math section of the NJASK-4. Here are their SGPs, in ranked order:
40 40 41 42 46 47 48 49 49 50 50 59 68 78 88 91 92 92 93 95 97
Next door to Ms. Jones is Ms. Smith, who also teaches 21 Fourth Graders. Here are her students' SGPs, again in rank order:
Ms. Smith's Class
2 5 11 14 17 22 23 26 37 44 50 51 52 53 54 55 55 59 59 61 62
Again, let's stay away from the question of whether Ms. Jones is a "better" teacher than Ms. Smith, and instead look at each class's "growth." Which one of these classes "grew" more? Most of us would say Ms. Jones's class did...
But not the NJDOE.
See, our reformy overlords are not interested in judging the classes teachers by their average SGPs, or the mean. They want to judge the classes teachers by the median: the SGP that the middle child in the rank order received. What would that be?
Ms. Jones's Class
40 40 41 42 46 47 48 49 49 50 50 59 68 78 88 91 92 92 93 95 97
Ms. Smith's Class
2 5 11 14 17 22 23 26 37 44 50 51 52 53 54 55 55 59 59 61 62
I highlighted the median score for each class; they are the same. Even though Ms. Jones's class had much greater average or mean growth than Ms. Smith's, they had the same median growth.
Let's graph it:
Ms. Jones class is blue; Ms. Smith's is red. Notice that the lower half of Jones's class is clustered just below 50, while the upper half of Smith's class is clustered just above 50.
Is this a typical example? I don't know - and neither does the NJDOE. We haven't tested this system, so we have no idea what we're going to find when it's rolled out next year. That makes it even more important that the NJDOE not lock school districts into one interpretation of test scores. At the very least, we should look at the median, the mean, and the standard deviation of SGPs for each tested teacher.
But that would deny the NJDOE their opportunity to take personnel decisions out of the hands of local administrators and put them into an untested, mathematically illiterate system. Which seems to be the entire point...
More math fails coming up.
*NOTE - Dear NJDOE: Did you notice how when I averaged the scores, I expressed the average in the correct number of significant digits. Kind of important, don't you think?
Stand by...
This blog post has been shared by permission from the author.
Readers wishing to comment on the content are encouraged to do so via the link to the original post.
Find the original post here:
The views expressed by the blogger are not necessarily those of NEPC.