The Legal Consequences of Mandating High Stakes Decisions Based on Low Quality Information: Teacher Evaluation in the Race-to-the-Top Era
In this article, we explain how overly prescriptive, rigid state statutory and regulatory policy frameworks regarding teacher evaluation, tenure and employment decisions outstrip the statistical reliability and validity of proposed measures of teaching effectiveness. We begin with a discussion of the emergence of highly prescriptive state legislation regarding the use of student testing data within teacher evaluation systems, specifically for purposes of making employment decisions. Next, we explain the most problematic features of those policies, which include a) requirements that test-based measures constitute fixed, non-negotiable weight in final decisions, b) that test-based measures are used to place teachers into categories of effectiveness by applying numerical cutoffs beyond the precision or accuracy of the available data, and c) that professional judgment is removed from personnel decisions by legislating (or regulating) specific actions be taken when teachers fall into certain performance categories. In the subsequent section, we point out that different types of measures are being developed and implemented across states, and we explain that while value-added metrics in particular are, in fact designed to estimate a teacher’s effect on student outcomes, descriptive growth percentile measures are not designed for making such inference and thus have no place in making determinations regarding teacher effectiveness. We also explain that, due to the properties of value-added estimates, they have no place in making high-stakes decisions based on rigid policy frameworks like those described herein. We evaluate the legal implications of rigid reliance on measures of teaching effectiveness that a) lack reliability and b) may be entirely invalid.
Metrics powered by PLOS ALM