This article has been retrieved
times since September 6, 2002
Education Policy Analysis Archives | ||
Volume 10 Number 36 |
September 6, 2002 |
ISSN 1068-2341 |
|
Editor: Gene V Glass College of Education Arizona State University
|
Research and Rhetoric on Teacher Certification:
|
|
Abstract |
|
In October, 2001, the Baltimore-based Abell Foundation issued a report purporting to prove that there is "no credible research that supports the use of teacher certification as a regulatory barrier to teaching" (Walsh, 2001, p. 5). (Note 2) The Abell Foundation paper argued against Maryland's efforts to strengthen teacher preparation requirements and defended the continuation of a local short-term alternative route into teaching that had come under criticism. Suggesting that "educators, policymakers, the media, and the public mistakenly equate teacher quality with teacher certification" (p. 1), Kate Walsh, the author of the paper, complained that efforts to improve education for poor and minority children in Baltimore by the state and local superintendents of schools and by local advocacy organizations foolishly sought to secure more fully certified teachers for their schools. She cited as wrong-headed newspaper articles raising concerns, for example, that: "Least prepared teachers are at worst city schools: One-third lack basic credentials for certification," (p. 1). Calling misguided the efforts of a Baltimore community group that released a study which "bemoaned the fact that more uncertified teachers were teaching in the city's high-poverty, predominantly African-American schools than the city's whiter, more affluent schools" (p. 2), the paper sought to demonstrate that these inequalities in access to certified teachers are not problematic if certification can be discounted as a determinant of achievement. The Abell Foundation proposed that Maryland should 1) "eliminate the coursework requirements for teacher certification" and require only a bachelor's degree and a passing score on an appropriate teacher's exam; 2) "report the average verbal ability score of teachers in each school district and of teacher candidates graduating from the State's schools of education;" and 3) "devolve its responsibility for teacher qualification and selection to its 24 public school districts," delegating all hiring authority to individual school principals (pp. vii-viii). Although these ideas might seem indefensible to those who are engaged in research regarding teacher preparation and recruitment, the U.S. Secretary of Education echoed these recommendations in his Annual Report on Teacher Quality (USDOE, 2002), a report on the national state of teacher quality required under the 1998 reauthorization of Title II of the Higher Education Act. In this report, the Secretary argued that teacher certification systems are "broken," imposing "burdensome requirements" for education coursework that make up "the bulk of current teacher certification regimes" (p. 8). The report argues that certification should be redefined to emphasize higher standards for verbal ability and content knowledge and to de-emphasize requirements for education coursework, making attendance at schools of education and student teaching optional and eliminating "other bureaucratic hurdles" (p. 19). The report suggests that its recommendations are based on "solid research." However, only one reference among the report's 44 footnotes is to a peer-reviewed journal article (which is misquoted in the report); most are to newspaper articles or to documents published by advocacy organizations, some of these known for their vigorous opposition to teacher education. (Note 3) For the recommendation that education preparation be eliminated or made optional, the Secretary's report relies exclusively on the Abell Foundation's paper. Though written as a local rejoinder to Maryland's efforts to strengthen teacher preparation and certification, it appears to have become a foundation for federal policy. This article includes the response I wrote to Walsh's paper (Note 4) when it was first issued, with some additions that respond to a reply she issued with Michael Podgursky (Note 5) and a briefer version of her report recently printed in Education Next, a magazine put out by the Hoover Institution (Walsh, 2002). In order to make a case for her agenda, Walsh attacks all research that has found relationships between teachers' preparation and their measured effectiveness, including students' achievement. She characterizes much of the education research as "flawed, sloppy, aged and sometimes academically dishonest" (p. 13), a characterization that more aptly describes her own paper, which consistently misrepresents the statements of researchers, the findings of studies, and the evidence base for her claims. She claims to have reviewed all of the studies ever cited by proponents of teacher education. In fact, a large number of the references in the paper and appendix are not directly on the topic of teacher education, and many studies of teacher education effects are not included in the report. Furthermore, her paper does not actually review most of the studies it mentions. An original report appendix listing studies shrank from 175 in July, 2001 to fourteen in the version of the report released in October, 2001 selected according to no obvious criteria and omitting many of the most prominent studies on the topic. (Note 6) The "reviews" in a now separate appendix published on the foundation's website are generally not careful assessments of research methods or findings but a list of complaints and random observationssometimes accurate but often notabout various aspects of the studies or how they have been cited by others. (A number of examples are included below.) All studies have limitations, and some are too problematic to be relied upon, including a number that Walsh relies upon for her own assertions. However, Walsh's paper, which is littered with inaccuracies, misstatements, and misrepresentations, sheds little light on the research or its implications for teacher education and certification. In what follows I discuss the inaccuracies in Walsh's account, the actual findings of many of the studies she purports to review, and the findings of other studies she chooses to ignore, as well as the implications of her proposals for teachers, their knowledge, and the students they teach. In the course of the paper, I review some of the studies that have found influences of teacher education and certification on student achievement at the levels of the individual teacher (e.g. Goldhaber & Brewer, 2000; Hawk, Coble, & Swanson, 1985; Monk, 1994); the school (Betts, Rueben, & Danenberg, 2000; Fetler, 1999); the school district (Ferguson, 1991; Strauss & Sawyer, 1986); and state (Darling-Hammond, 2000c). The convergence of findings in analyses using different units of analysis reinforces the strength of the inferences that might be drawn from any single study.
What are the Arguments?The Abell Foundation report admits that teacher qualifications make a difference but it also tries to make a case that "the backgrounds and attributes characterizing effective teachers are more likely to be found outside the domain of schools of education. The teacher attribute found consistently to be most related to raising student achievement is verbal ability.... usually measured by short vocabulary tests..." (p. v). Later in the report, Walsh suggests that subject matter knowledge may be an additional criterion for hiring secondary teachers, but not for elementary teachers. Walsh objects to the state requirements regarding content coursework in each of the core academic areas for elementary teachers, since many who want to enter through the alternative Resident Teacher program in Maryland have had trouble meeting these requirements. Walsh then tries to dismiss all studies that find evidence that knowledge about teaching also makes a different for teacher performance, or to claim that studies finding positive effects of teacher education or certification are either too old, too small, too highly aggregated, or dependent on evidence about teacher performance other than student achievement or are not really about certification after all, even if their authors say they are. She often does this by misrepresenting the studies' actual methods and findings, as I detail below. While there are legitimate concerns to be raised about various studies in the literatureon all sides of the questionthis article does not shed much light on them. A thorough review of the quality and accurately portrayed findings of the several bodies of research that bear on this question would be a service to this field. Unfortunately, this document's inaccuracies and misinterpretations make it of little use in this regard. In what follows, I address five major issues regarding the Abell report and the research base on teaching and teacher education:
Evidence IgnoredWhile the Abell Foundation report claims that teachers do not need professional knowledge in order to teach, the field has been moving rapidly to codify the ways in which teaching knowledge makes a difference in student learning. For example, the National Reading Panel of the National Institute of Child Health and Human Development last year published a major review of carefully controlled research which found that children's reading achievement is improved by systematic teaching of phonemic awareness, guided repeated oral reading, direct and indirect vocabulary instruction with careful attention to readers' needs, and a combination of reading comprehension techniques that include metacognitive strategies. The report notes that teacher education is critical to the success of reading instruction with respect to both instruction in phonemic awareness and more complex comprehension skills: Knowing that all phonics programs are not the same brings with it the implication that teachers must themselves be educated about how to evaluate different programs to determine which ones are based on strong evidence and how they can most effectively use these programs in their own classrooms. It is therefore important that teachers be provided with evidence-based preservice training and ongoing inservice training to select (or develop) and implement the most appropriate phonics instruction effectively. (p. 11) Teaching reading comprehension strategies to students at all grade levels is complex. Teachers not only must have a firm grasp of the content presented in the text, but also must have substantial knowledge of the strategies themselves, of which strategies are most effective for different students and types of content and of how best to teach and model strategy use.... (Data from the studies reviewed on teacher training) indicated clearly that in order for teachers to use strategies effectively, extensive formal instruction in reading comprehension is necessary, preferably beginning as early as pre-service (National Reading Panel, 2000, pp. 15-16). Studies have documented that professional training can be effective in providing teachers with the strategies that enable them to teach these complex comprehension skills, and teachers who receive such training significantly improve students' reading outcomes (e.g, Duffy, Roehler, Sivan et al., 1987; Duffy & Roehler, 1989, regarding explicit strategy instruction; Palincsar & Brown, 1989, regarding reciprocal teaching). Similar insights in our understanding of how to develop student proficiency in mathematics and science, and how to develop teachers' skills for doing so, have recently emerged. For example, recent analyses of the National Assessment of Educational Progress (NAEP) which control for student characteristics and a number of measures of school inputs have found that students whose teachers have majored in mathematics or mathematics education, who have had more pre- or in-service training in how to work with diverse student populations and more training in how to develop higher-order thinking skills, and who engage in more hands-on learning do better on the NAEP mathematics assessments. Similarly, students whose teachers have majored in science or science education and who have had more pre- or in-service training in how to develop laboratory skills and who engage in more hands-on learning do better on the NAEP science assessments (Weglinsky, 2000). (Note 8) A recent review commissioned by the Department of Education, which was carefully vetted by a panel of researchers, disagreed with the Abell Foundation's conclusions. This review, which analyzed 57 studies that met specific research criteria and were published after 1980 in peer-reviewed journals, concluded that the available evidence demonstrates a relationship between teacher education and teacher effectiveness (Wilson, Floden, & Ferrini-Mundy, 2001). The review shows that empirical relationships between teacher qualifications and student achievement have been found across studies using different units of analysis and different measures of preparation and in studies that employ controls for students' socioeconomic status and prior academic performance. It is ironic that just as the field is learning more about how to prepare teachers to teach children effectively, the Abell Foundation suggests that we truncate teacher education and end the certification policies that would encourage and enable teachers to acquire this knowledgeor at least that we do so for the children of the poor, who also attend school in districts with minimal resources for professional development. The unanswered question is, How are teachers to learn what is known about how to teach well if there are no expectations, incentives, or supports for them to do so?
Unfounded ClaimsWhile ignoring these serious questions, Walsh makes a number of claims that are not supported either by the research she presents or by other evidence in the field. These include the following:
The Effectiveness of Certified and Uncertified TeachersFor her proposition that "new teachers who are certified do not produce greater student gains than new teachers who are not certified," Walsh cites seven studies, none of which provides support for this proposition, and five of which actually provide evidence that contradicts her claim. Three of the studies (Bliss, 1992; Stoddart, 1992; Lutz & Hutton, 1989) include no data on student achievement at all, although Walsh elsewhere dismisses all other studies that do not use student achievement data as the dependent variable. (In a reply to my response, Walsh and Podgursky (2001) note that these studies have been deleted in a newly printed version, along with some studies Walsh cited that were not peer reviewed, "so that the report ... does not appear to convey a double standard" (p. 15)). Six of the studies Walsh cites actually deal with alternatively certified rather than uncertified teachersthat is, teachers who had undertaken teacher education at the post-baccalaureate level in university- or school district-based programs that rearrange the way teacher education is delivered. The findings across the studies are mixed, but none of them shows that uncertified teachers do as well as certified teachers, and one of them shows that this is clearly not true. Several of the studies point instead to the value of teacher education: The more positive findings are found for the alternatives that provide more complete preparation.
Evidence about Preservice Teacher EducationFor the proposition that "there is little evidence that the content and skills taught in preservice education coursework is (sic) either retained or effective" (p. 7), Walsh cites two articles (Murnane, 1983; Veenman, 1984) from among the many dozens of studies of teacher education that could have been retrieved from the peer-reviewed literature, had she done a search. Both of these are very old pieces, published long before recent reforms in teacher education. Neither of them makes any statement in support of Walsh's claim.
The Influence of Verbal Ability on Teacher EffectivenessThere is little disagreement about the fact that verbal ability and subject matter knowledge influence teacher effectiveness, although Walsh tries to set up a straw man by suggesting, inaccurately, that some researchers, including myself, have argued otherwise. (See the section on "Misrepresentations of Research" below.) There are two areas of real disagreement, however. One is whether verbal ability alone is the only or best measure of teacher effectiveness. The other is how to evaluate the size of relative contributions of various kinds of knowledge to teacher effectiveness. As examples cited earlier illustrate, the literature on teacher characteristics and their effects on teacher performance has been a captive of the measures most likely to be available in large data sets at any moment in time. While there are many studies evaluating the influences of teachers' standardized test scores, especially measures of verbal or general academic ability, because these variables have been readily available in large-scale data sets since the 1960s, data on teachers' course-taking backgrounds or teacher education experiences have been included in large data sets only since the early 1990s. Thus, there are more studies finding influences of variables that have most often been measured. Finally, most of the studies that have included measures of verbal ability or content knowledge have not included measures of teacher education or certification. In a recent review, Wayne and Youngs (in press) found five studies that observed relationships between measures of teachers' verbal or general academic ability and student achievement and that met the standard of having controlled for students' socioeconomic status and prior achievement. Four of these studies employed data sets from the 1960s and 1970s and none of the five included measures of teacher education or certification. Looking across studies in these different eras, in many cases, the relative effect sizes of verbal ability measures are no larger than those of teacher education and certification measures in the studies that use these instead.
The Academic Ability of Teachers who Lack CertificationAnother argument made by those who would eliminate certification is that an unconstrained market would allow the recruitment of individuals with higher verbal or general academic ability who do not now enter teaching. While it is probable that some individuals would choose to teach if they did not have to prepare, it is not clear that most of these entrants would be more academically able, that they would be better teachers, or that they would stay long in teaching. It is also unlikely that given current wages, individuals who are now preparing for much higher-paying careers in medicine, the law, engineering, and other professions that require much more onerous preparation and licensing processes would choose teaching as a career simply because they did not have to be certified. Labor market contexts are relevant to this question. The qualifications of individuals preparing for teaching improved noticeably between the early 1980s and the early 1990s in terms of both academic attainment and ability measures, in part because of the changes in admissions requirements to teacher education adopted by states and universities but also likely because of the substantial increases in real wages for teachers that occurred during the 1980s. Whereas prospective teachers were disproportionately drawn from the bottom quartile of college students in the early 1980s (Lanier & Little, 1986), both grades and test scores improved for teacher candidates by the 1990s. The Recent College Graduates Survey, which tracks college graduates into the labor market, found that the grade point averages of newly qualified teachers in 1990 were higher than those of the average college graduate, with 51% earning a GPA of 3.25 or better as compared to 40% of all graduates (Grey et al., 1993). However, average GPAs were significantly lower for the 15% of college graduates entering teaching who were neither certified nor eligible for certification. Most of the uncertified entrants (57%) had grade point averages below 3.25, and 20% had GPAs below 2.25. Attrition was also high for the untrained candidates. By the time of the survey (one year later), only one-third of the uncertified entrants were still engaged in teaching as their primary jobs (Grey et al., 1993). In addition, the Educational Testing Service found that among 270,000 test-takers in 1995 through 1997, college admissions test scores were highly correlated with initial teacher licensing scores (Praxis I and Praxis II), and the lowest average scores on both kinds of tests were those held by individuals who entered teaching without preparation (Gitomer, Latham, and Zimek, 1999). (Walsh describes this 14% of the sample as an "error" in the study since the individuals had not enrolled in a teacher education program; she misunderstands the fact that these Praxis test-takers were the entrants to teaching who used emergency or alternative routes. (Note 20) Prepared teachers scored much higher than unprepared teachers. While students who prepare to enter fields other than teaching have higher average test scores on measures like the SAT than do those preparing to enter elementary school teaching, there is no significant difference for prospective secondary teachers, most of whom earn a disciplinary degree along with their teaching certificate. The narrowing of this gap between prospective teachers and others is likely a function of the more rigorous admissions requirements for teacher education enacted in most states and the growth in wages between the early 1980s and the mid-1990s. Finally, the study found that graduates of NCATE-accredited colleges of education passed the Praxis subject matter tests for teacher licensing at a significantly higher rate than did graduates of unaccredited programs, boosting their chances of passing the examination by nearly 10 percent (Gitomer, Latham, and Zimek, 1999). Walsh suggests that this higher Praxis pass rate might simply reflect the fact that NCATE schools could be located in states with low cutoff scores. However, additional analyses of the data by ETS and another independent study (Note 21) indicate that this is not the case. A more likely explanation is that NCATE's requirements that colleges demonstrate how they screen applicants for general ability and that they ensure strong content backgrounds translate into somewhat greater attention to these matters in institutions that are accredited. These data suggest that standards may increase the general as well as specialized qualifications of prospective teachers. They do not suggest that removal of certification requirements brings higher ability individuals into teaching or keeps them there. It is important to recognize that labor market incentives operate among individuals actually entering teaching. For example, several studies of alternative certification programs found that the academic records of recruits varied substantially by teaching field, with alternatively-certified candidates in high demand shortage fields, such as mathematics and science, having much poorer academic records than candidates in other fields and than candidates from traditional teacher education programs in those same fields (see Natriello & Zumwalt, 1992, re: New Jersey; Lutz and Hutton, 1989 re: Dallas; Stoddart, 1992, re: Los Angeles). It is unlikely that eliminating requirements for training would increase the career attractions to teaching for academically able candidates as much as increased wages would. Meanwhile, eliminating training requirements could result in a less well-qualified teaching force, especially if the elimination of certification standards not only reduced the knowledge of entrants but also reduced pressures for competitive wages. The Private School ArgumentFinally, a claim sometimes made by opponents of teacher certification, including Walsh, is that private schools are more effective than public schools, and that this is becauseor at least is not impeded bythe fact that private school teachers are not certified. There are two major problems with the private school "proof": First, there are conflicting findings about the relative effectiveness of public and private schools, with credible evidence on both sides of the question. Second, most private school teachers are certified and an even larger majority have specific preparation for teaching, even when they have not sought certification. On the effectiveness of private schools, Walsh cites Coleman, Hoffer, & Kilgore (1982), who examined data from the first wave of High school and Beyond surveys, conducted in 1980, and found evidence of higher performance for comparable students in Catholic and other private schools as compared to public schools. The researchers attributed their findings primarily to differences in student behavior across school sectors, measured by variables like lower rates of absenteeism, cutting class, and fighting, along with factors like more time spent on homework and higher individual student attendance. They also found that achievement was actually higher for comparable students who were in public schools that had these characteristics. Subsequent studies have produced findings that favor both public and private schools after controlling for student characteristics and school organization (Bryk & Lee, 1992; Lee & Bryk, 1988; Lee, Dedrick, & Smith, 1991). Most studies have pointed to variables like school and class size, school organization, and curriculum differentiation as critical variables in determining both public and private school effectiveness. When these factors are controlled, public school students often do as well or better than private school students in schools with similar features. Furthermore, differences in the preparation of public and private school personnel are not as large as many people assume. More than 30 states certify private school personnel (Feistritzer, 1984), and, when Coleman did his analysis, more than 85% of private and parochial school teachers were certified, as compared to about 95% of public school teachers (NCES, 1985). This has changed only slightly in the years since. Although certification is not required for private school teachers in all states, only 34% of private school teachers in 1993-94 (the most recent year for which national data are available), were not certified in their primary assignment field. Some of these teachers were certified in fields other than their primary assignment field. Many undertook teacher preparation, even though they did not apply maintain a state license or certificate. In 1993-94, public and private school teachers were almost equally likely to have received an undergraduate degree in education (68.9% for public vs. 61.5% for private elementary teachers and 19.8% for public vs. 19.3% for private secondary teachers) (NCES, 1997, p. 25). The education degree as an indicator of preparation is quite partial, since the education degree has waned as certification increasingly requires a content degree with an education minor or credential. The percentage of 1992-93 bachelor's degree recipients who had taken education courses was 87.1% for public school teachers and 71.6% for private school teachers, (Note 22) and the average number of education credits earned was 37.4 for public school teachers as compared to 35.2 for private school teachers (NCES, 1997, table A-51). (Note 23) Public school teachers were also more likely to have taken subject matter degrees in their teaching fields than private school teachers. For example, 66% of public school mathematics teachers held a major or minor in the field, as compared to 58% of those in private school. (Goldhaber and Brewer, 2000 reported a similar finding.) The same differentials hold in other fields to somewhat lesser extents. The greater content preparation of public school teachers is likely a function of the fact that certification has required increasing amounts of subject matter coursework in the field to be taught, thus leveraging stronger content preparation for public school teachers in states where private school teachers are not required to hold certification. Almost all states now require certified teachers to hold at least a minor in the field to be taught, and many require a major in the field. Finally, even if it were true that untrained teachers were unusually effective in some private schools for students of comparable initial achievement levelsa point about which there is no published evidenceit would be a large leap of faith to assume that such teachers would be equally effective in schools where many students have much greater educational needs and students are not pre-selected for their academic ability, their positive school attendance and behavior, and their parents' income and interest in education. There are very large differences in the populations of students attending public and private schools in the United States, (Note 24) which have important implications for teachers' knowledge and skills. It is one thing for a teacher to offer information in whatever manner comes instinctively to students who are academically able, have learned to learn independently, and are well-supported at home by educated parents, tutors, and other supports for their learning. It is quite another thing to teach by the seat of the pants when students do not have these learning supports at home and may present a variety of language and learning differences. Being effective with students who need substantial support for their learning requires greater diagnostic ability and knowledge of how to present information and structure experiences in ways that help them become successful. Systematic knowledge about how to organize curriculum and reach students with special learning needs is most needed in the schools that serve most students with these needs.
Other Misrepresentations of Research FindingsThe remainder of Walsh's review continues the kind of misrepresentations documented above, appearing to rely on the belief that readers will read its accusations, but will not read or understand the research itself. Although she prepared a draft appendix with 192 studies that sought to critique many of the studies she dismisses (often inaccurately), it was not published with the report. Appendix B, to which the reader is repeatedly referred for reviews, includes only 14 studies. Throughout the report, the reader is referred to this appendix for critiques of studies that do not appear there. The selection of research included in the published version of the report's appendix is very strange. Many strong studiessome of the key citations in the fieldare omitted, along with the flawed rationales for dismissing them that now appear in a separately-published appendix. Some much less important and less well-designed studies are included, with the apparent goal of critiquing their size or designs as though they represented the dozens of studies not mentioned or excluded. Thus, the paper does not include information regarding most of the studies Walsh claims she has reviewed and does not provide evidence for her claim that, of all the studies cited in support of teacher education and certification, "none bear up to scrutiny." Here are just a few additional examples of major misrepresentations.
Methodological IssuesOne of the ways that Walsh seeks to make much of the research on teacher education disappear is by suggesting that it is inappropriate to cite studies that are older, smaller, use measures of performance other than student achievement scores, are aggregated at a level above the classroom, or are published in venues other than peer-reviewed journals. As noted above, Walsh uses a double standard in selecting research to reject when it finds evidence of the influence of teacher education on student learning and research to cite for her own purposes. While she discounts the findings of many dissertation studies and technical reports because they were not published in peer-reviewed journals, in making her own claims, she cites at least 15 studies that were not published in peer-reviewed journals or technical report series and at least 20 that were published before 1980, including some that she elsewhere dismissed from consideration because she did not like specific findings. For findings she likes, she also cites several that use supervisory ratings as the only measures of teacher effectiveness and others that she later dismisses for aggregation bias. Sometimes she represents the studies' findings accurately; sometimes not. Many of the studies she cites for various propositions do not contain the findings for which they are citedor, in several cases, any data on the question at all. I would not argue, as Walsh does, that none of these studies have value as contributions to the literature. However, the double standard she applies in using studies of different eras, sizes, aggregation levels, dependent variables, and publication statuses perhaps proves the point that to evaluate the weight of evidence in a field it is often necessary to triangulate findings that used different methods, over different time periods, and at different levels of aggregation to see where there is an accrual of evidence over time and across methods. Of course it is important to do this with appropriate attention to the methodological strengths and weaknesses of various studies and lines of research. Unfortunately, Walsh often does this poorly, appearing to misunderstand critical research design issues. Below, I discuss the issues of study size and design, level of aggregation, choice of dependent variable (including the use of supervisory ratings of teacher performance), age, and venue of publication. Study Size and DesignIn one part of her review, Walsh bemoans the lack of experimental research. She then rejects the results of studies with experimental designs because of their smaller sample sizes and cites almost exclusively non-experimental correlational studies, whichthough largerlack direct controls for the variables of interest and must rely on statistical manipulations of data to account, indirectly, for these other influences. This kind of correlational research is, of course, legitimate for staking out broad possibilities in relationships among variables, but it has its own limitations. Many of the more carefully controlled experimental designs can in fact offer more solid evidence about effects, because the "treatment" they are studying is known and the samples can be better controlled than is true for large correlational studies that use proxies and statistical controls rather than direct observation of the phenomena of interest. Medical research, for example, typically uses small sample experimental research as the basis for establishing the possibilities of effects, while using large correlational studies as rough indicators of possible relationships that then require further examination. Single case studies of clinical findings are part of the medical research base along with small experiments sometimes carefully controlled and sometimes not, larger clinical trials, and correlational studies looking at broad tendencies. The usefulness of small, experimental and quasi-experimental studiesincluding those that Walsh cites and sometimes dismisses (and other times embraces, depending on her reading of and agreement with the findings)is not in the definitiveness of their individual findings but in their contribution to a larger body of work from which a preponderance of evidence can be examined. Although medical researchers generally consider correlational studies to comprise a weaker source of evidence about effects than smaller experimental designs, they recognize that mixed methods of research serve complementary purposes. Of course, one of the reasons correlational studies must be interpreted with caution is that there is always the question of what direction the correlations may point, sometimes referred to as "reverse causation." There is also the problem that variables in these studies are frequently crude proxies for the actual measures of interest and may either fail to capture the intended construct or in fact be reflecting the influences of other unmeasured variables. As noted above, many of the variables that can arguably be said to reflect constructs of interest are highly correlated with one another. Furthermore, many of the variables of interest are not well-represented in large data sets. Thus it is critical to represent in any review of research a range of studies that can tease apart the different relationships of interest with a range of measures. Level of AggregationAnother criticism used to dismiss some studies' findings as irrelevant is the charge of "aggregation bias." For example, Walsh dismisses studies that include favorable findings about the value of teacher education in which data are aggregated at the level of the school or district, although she, herself, cites similarly aggregated data for her conclusion that verbal ability matters most (e.g. Coleman, 1966; Ferguson, 1991; Strauss & Sawyer, 1986). More important, this critique misses a crucial point about how research results accrue and are triangulated to look at possible relationships among conditions and outcomes. Just as individual level data about health practices and outcomes inform medical research, for example, so do highly aggregated data at the level of cities, counties, and even countries when researchers seek to understand why, for example, women in some nations have low levels of breast cancer or men have low levels of heart disease. Studies at different levels of aggregation provide different kinds of insights about the phenomena under study. In building a corpus of research on any topic, a wide array of research strategies and levels of analyses are used. It is true that the size of measured effects of different variables can vary at different levels of the system; however, it is not always clear in which way the bias will operate. Often, the general direction of the results holds at different levels of the system, even if effect sizes differ. For example, in their Alabama study, Ferguson and Ladd (1996) found the effects on student achievement of teachers' test scores, masters degrees, and experience held at both the district and school levels in terms of both significance and directionality. There are pros and cons of both kinds of analyses. On the one hand, disaggregated data can exhibit greater measurement error. On the other hand some analysts have argued that omitted variables may bias the coefficients of school input variables upward when data are aggregated to the district or state level (Hanushek, Rivkin, & Taylor, 1995). However, this generalization does not always prove true. For example, although Summers and Wolfe (1975) found that selectivity ratings of each teacher's undergraduate institution were important in explaining 6th grade students' achievement when examined at the individual teacher level, this relationship disappeared with they aggregated the college ratings and other school inputs into school-level averages. This contradicts the assumption about the usual direction of aggregation bias. Of course, omitted variables can bias results at any level of the system. Sometimes, especially when the goal of a study is to evaluate broad trends and policy influences, it is important to have data aggregated and analyzed at multiple levels. For interpreting the weight of evidence on a particular issue, the most important question is whether consistent results are found at different levels of aggregation. Just as Walsh cites highly aggregated data as well as less aggregated data on the question of the influences of verbal ability, so the studies examined here reveal influences of measures of teacher education and certification on student achievement at the levels of state (Darling-Hammond, 2000c), school district (Ferguson, 1991; Ferguson & Ladd, 1996; Strauss & Sawyer, 1986), school (Ferguson & Ladd, 1996; Fetler, 1999), and individual teacher (Goldhaber & Brewer, 2000; Hawk, Coble, & Swanson, 1985; Monk, 1994). Measures for Assessing Teacher PerformanceWalsh argues that studies using various ratings of student performance other than student achievement test scores should be discounted, noting that supervisory ratings "can be too subjective to measure teacher quality accurately" (p. 20). As support for this, she cites in her appendix a review of research on teacher evaluation I conducted with colleagues at the RAND Corporation (Darling-Hammond, Wise, & Pease, 1983). While her statement of why I cited the review in another article is completely inaccurate, (Note 29) she is correct when she notes that teacher evaluations by principals and other school-based supervisors have been found to lack strong reliability. Our study of evaluation practices noted that this has been a function of principals' lack of time, inadequate expertise for evaluating all teaching situations, insufficient evaluation training, and inappropriate instrumentation. However, this critique does not extend to ratings of performance that are based on structured observations conducted by trained, expert raters that have been developed and demonstrated to have high reliability. Some of the studies Walsh dismisses use systematic ratings systems by trained observers (e.g. Ferguson & Womack, 1993; Guyton & Farokhi, 1987). The extent to which ratings of performance should be considered or discounted depends on who conducts the rating process, with what training and instrumentation, under what conditions, and with what efforts to enhance reliability. Age of StudiesThe age of studies is also a legitimate but not determinative issue. Studies do not become invalid merely because they are old. While Walsh argues that many older studies using large data sets lacked certain kinds of variables as controls, this does not stop her from citing many of these studies for propositions with which she agrees. More important, the designs of some older studies are at least as strong as some of the more recent studies, and weak studies exist now as then. There is not a strong relationship between study vintage and quality. It is certainly true that teacher education programs and certification requirements have changed over time, so that inferences from studies conducted in one era do not automatically generalize to others; the extent to which one can learn something of use from a study depends on how well the variables are defined and on a knowledge of their relevance to more recent conditions as well as on the strengths and limits of its methodology. Vintage does influence the prevalence of studies of certain kinds. With respect to studies of the effects of teacher education and certification, a large number of studies were conducted in the high-demand era of the 1960s and '70s when there was great variability in entry pathways and much interest in the topic. It is also true that federal funding for educational research was substantially larger before 1980 than it was during the severe budget cuts of that decade. In addition, in times of relatively low demand, like most of the 1980s, virtually all teachers were certified and there was too little variability to find effects of this variable in large-scale studies. Few studies were concerned with these issues and few data sets had measures of teacher education variables. Interest and data on this topic have just begun to return in the 1990s. Those who are interested in the extent to whichand the ways in whichdifferent kinds of preparation may matter for teacher performance and student learning can and should be informed by earlier studies where they are applicable to the questions under study. Publication VenueAlthough Walsh is incorrect in her statement that dissertations are not retrievable (there are library systems for doing so, if sometimes less than convenient), it is legitimate to suggest that the kind of review they have received is often more variable, and may be less strenuous depending on the university and department, than for many peer-reviewed journals. There are certainly some universities whose dissertation review process is more rigorous than some journals, but the reverse is also certainly true. The same variability in review stringency is true for conference papers and technical reports. However, Walsh herself cites a substantial number of unreviewed papers in support of various positions she takes. There are different schools of thought about how to treat these papers in reviews. Some would argue, as does Veenman (1984), a reviewer cited by Walsh, that the use of all identified studies is justified for a review that seeks to delineate global trends where large numbers of findings are similar (p. 166). Others would argue that papers that have not been published with peer review should be used only when the review includes a critique of each study's methods. Still others might argue, as Walsh does (at least rhetorically if not in practice), that such studies should be excluded from consideration. I accept the point that it is a useful common ground to rely on research published in peer-reviewed journals, and I restrict the analysis in this paper to those studies. Even with this criterion, there is substantial evidence to be weighed and discussed.
Who is Affected by this Debate?The critical issue here is not the protection of researchers' reputations or the turf of schools of education but the protection of students, especially low-income students and students of color who are disproportionately taught by unprepared and uncertified teachers. As Walsh's paper shows in her references to data on the disparities in access to qualified teachers for students in Baltimore, the children most affected by these arguments are economically and educationally disadvantaged children in central cities who are substantially abandoned by the funding and hiring protections that should operate to provide a foundation for their education. These are the students whose education is most undermined by their lack of access to teachers who have the knowledge and skills to ensure that they learn to the new high standards the society and the state demand. What the statistics on the lack of certified teachers actually mean on the ground is that many of Baltimore's most educationally vulnerable childrenmost of them African Americanare taught in their elementary school years by teachers who have had no training in how to teach them to read, much less to develop other basic and higher order skills they must have to succeed in school and life. When they fail to learn, they begin the tortuous process of educational failure that will end for many of them in dropping out or being unable to pass the state tests that would grant them a diploma. This then launches a life spent either in a marginal part of the economy that barely yields subsistence wages or, as is true for more than 50% of high school dropouts, in the inability to gain any job at all. In today's economy, these young people are fated to become part of the growing criminal justice system, as incarceration is increasingly linked to inadequate education. More than half of the growing number of inmates in the United States are functionally illiterate and cannot gain access to today's labor market. This is not unrelated to the fact that so many low-income students have been taught by teachers who never learned how to teach them to read. Illogical Policy ConclusionsThe disparities in access to qualified teachers in Maryland are a function of a state school finance system that has underfunded Baltimore's schools for decades, along with inadequate incentivesfor example, service scholarships, forgivable loans, and recruitment attractions like salaries and housing assistanceto encourage individuals to acquire strong training and then teach in high-need fields and locations. The Abell Foundation report does not argue for more equitable funding for the schools that serve Maryland's poor and minority students or for stronger incentives to attract well-prepared teachers to these schools. In fact, the report cites approvingly a paper prepared to stave off an equity lawsuit in Maryland (Hanushek, 1996b) which argues against district investments in smaller class sizes or higher salaries in Baltimore, asserting that "Baltimore City would not benefit from additional resources as much as it could benefit by better school management." (Note 30) The Abell Foundation report argues that the enormous disparities in resources and qualified teachers between Baltimore and other districts are not a problem because teacher certification does not mean anything, and that in fact the solution is to do away with certification altogether. In suggesting that devolving all hiring decisions to principals is the answer to the problem of recruitment for the schools serving minority and poor children, Walsh ignores the fact that, even if all principals had infinite information at their disposal about the likely effectiveness of teachers and made wise, fully informed choices (two assumptions that have been challenged by some research on teacher selection practices), principals do not control the major levers for addressing the problems of unequal supply: unequal district revenues, noncompetitive teacher salary levels, and the policies that govern recruitment and preparation that would allow them to seek out and hire the individuals they might most want to recruit. Eliminating certification requirements would eliminate pressures for competitive wages or recruitment incentives for teachers, since an open marketplace in a resource-constrained public sector could resolve shortages by lowering standards. In addition, eliminating certification requirements would eliminate evidence about disparities in students' opportunities to learn, for if there are no minimum standards, there will be no evidence of differences in the extent to which they have been achieved by teachers working with different groups of students. This would in turn reduce pressures for the creation of policies to rectify these inequities. Finally, eliminating such standards would remove the mechanisms states have been developing and improving to be sure that teachers know their content well, know how to teach the content to students, know how to teach fundamental skills like reading, and have the ability to meet the special needs of learners who may have learning disabilities that require distinct teaching strategies, whose first language is not English, or who simply struggle with certain kinds of academic tasks and need diagnostic assistance. The outcome of Walsh's argument, were it to be successful in the policy community, would be continued inequality in funding, depressed salaries for teaching in high-need areas, continued lack of access for poor children to a stable teaching force of well-qualified teachers by any definition, and tragic loss of a productive future for students who are underserved. To be sure, certification is but a proxy for the subject matter knowledge and knowledge of teaching and learning embodied in various kinds of coursework and in the evidence of ability to practice contained in supervised student teaching. It is true that certification is a relatively crude measure of teachers' knowledge and skills, since the standards for subject matter and teaching knowledge embedded in certification have varied across states and over time, are differently measured, and are differently enforced from place to place. The quality of preparation in both university programs and other alternatives has varied as well, although a number of states have made substantial recent headway in strengthening teachers' preparation and reducing this variability. Given the crudeness of the measure, it is perhaps remarkable that so many studies have found significant effects of teacher certification. This does not mean that we should be sanguine about certification policies. There are questions about the quality of tests, courses, and institutions that are the subject of study and action across the country (see, for example, Darling-Hammond, Wise, & Klein, 1999). The answer to flaws that may be perceived, however, is not to eliminate or undermine the pathways that enable and require teachers to gain knowledge and students to have access to teachers who have the knowledge they need. If teacher knowledge and skill about both content and how to teach it is important, as substantial evidence suggests it is, the most sensible policy goal is to work to improve preparation opportunities and certification standards so that they increasingly approximate what teachers need to know and do in order to be successful with diverse students. As Levin (1980) noted, certification is a critically important exercise in the economics of information that should be a target of continual improvement: (T)he facts that we expect the schools to provide benefits to society that go beyond the sum of those conferred upon individual students, that it is difficult for many students and their parents to judge certain aspects of teacher proficiency, and that teachers cannot be instantaneously dismissed, mean that somehow the state must be concerned about the quality of teaching. It cannot be left only to the individual judgments of students and their parents or the educational administrators who are vested with managing the schools in behalf of society. The purpose of certification of teachers and accreditation of the programs in which they received their training is to provide information on whether teachers possess the minimum proficiencies that are required from the teaching function. Because this is an exercise in the provision of information, it is important to review the criteria for setting out how one selects the information that is necessary to make a certification or accreditation decision (p. 7). ConclusionKate Walsh has dismissed or misreported much of the existing evidence base in order to argue that teacher education makes no difference to teacher performance or student learning and that students would be better off without state efforts to regulate entry into teaching or to ensure certain kinds of teachers' learning. While she argues for recruiting bright people into teaching (and who could disagree with that?), her proposals offer no incentives for attracting individuals into teaching other than the removal of preparation requirements. While this proposal is couched as the elimination of "barriers" to teaching, evidence suggests that lack of preparation actually contributes to high attrition rates and thereby becomes a disincentive to long-term teaching commitments and to the creation of a stable, high ability teaching force. Lack of preparation also contributes to lower levels of learning, especially for those students who most need skillful teaching in order to succeed. The evidence from research presented here and elsewhere makes clear that the policies Walsh endorses could bring harm to many children, especially those who are already least well served by the current system. Those who make such arguments for eliminating one of the few protections these children have should bear the burden of proof for showing how what they propose could lead to greater equity and excellence in American schools. Notes1. The research assistance of Lisa Marie Carlson is gratefully acknowledged. 2. "Teacher Certification Reconsidered: Stumbling for Quality" is published through the Abell Foundation website: www.abellfoundation.org. The version of the report that was publicized and published on this website in October, 2001 is the basis for this response. The report has since been amended. In a reply to my response posted to the Abell Foundation website, Walsh noted that some of the errors I pointed out have been removed in the hard copy version the foundation published in December 2001. 3. In addition to the Abell Foundation, these include the Fordham Foundation, which has issued a "manifesto" urging the elimination of teacher education and certification requirements. 4. See The Research and Rhetoric on Teacher Certification: A Response to Teacher Certification Reconsidered, at http://www.nctaf.org. 5. See Teacher Certification Reconsidered: Stumbling for Quality, A Rejoinder (November, 2001) at www.abellfoundation.org. 6. A separate appendix is published on the Abell Foundation website. Soem of its entries have changed as criticisms of the report have been lodged. 7. See, for example, footnote 18 on p. 13 where Walsh refers readers to Appendix B for analysis of six studies, only two of which (Guyton & Farokhi, 1987; Monk, 1994) are actually included there. Appendix B of the published version of Walsh's report includes only 14 of 192 studies originally included in her draft of July 23, 2001 and does not include most of the key studies on the topic. A longer appendix was later added to the Abell Foundation website. Readers who consult with that document will find that many of the studies listed are not concerned with teacher education but are cited for other reasons related to one of Walsh's own arguments; many others are not reviewed because they were not retrieved or were deemed too old or too small; still others are "reviewed" only in the sense that complaints are made about them or about the way they were cited by another researcher. 8. In a reply to my response, Walsh and Podgursky (2001) suggest that Wenglinsky referred only to in-service education. However, the NAEP questions Wenglinsky analyzed for evidence of teacher learning covered coursework or professional development teachers had encountered before or after entering teaching. The stem for these questions was in each case one of the following: "During the past five years, have you taken courses or participated in professional development activities in any of the following?" or "Have you ever received training in any of the following, either in courses or in-service education?" 9. Another study by the California Commission on Teacher Credentialing found the attrition rates of Los Angeles Teacher Trainees who dropped out before they entered teaching to be quite high. Of the first cohort, 80.3% completed the first year of training and only 64.6% completed the second year and received a clear credential the year after (Wright, McKibbon, and Walton, 1987). This 35% attrition rate prior to graduation from the program added to the 53% attrition rate of those who completed the program but left the district within the subsequent 7 years (Stoddart, 1992) left only about 30% of the original cohort in the district after 7 years. 10. In her Education Next article, Walsh (2002) lists a set of studies with sample sizes of up to 55 teachers as "too small to produce results that are reliable or that can be generalized to the larger population," (on-line version, p. 9). However, in her reply to me (Walsh and Podgursky, 2001, p. 14), she states that because Miller, McKenna, & McKenna's study was a matched pair study, a "gold standard of research," its small numbers (18 teachers for examining student achievement effects) are justified. Yet just pages earlier in the same document (p. 8), she and Podgursky criticize another matched pair study (Hawk, Coble, & Swanson, 1985) which has a larger sample (36 teachers) and stronger design for evaluating student achievement (Miller et al. drop most of their teachers and the matched comparison design when they evaluate student test scores) as lacking statistical controls (also missing in the Miller et al. study) and failing to adjust for pre-test scores of students (Miller, McKenna and McKenna do not even present the pre-test scores of students). The Hawk et al. study, which Walsh originally cited approvingly as an argument for content knowledge is now dismissed by Podgursky as "small and not well-controlled" to avoid having to acknowledge its results, which find positive effects of teacher certification on student achievement. 11. Personal communications with economist Susanna Loeb and statistician William Billet. 12. As one of dozens of examples of general sloppiness, neither the Goldhaber and Brewer study nor the Hawk, Coble, and Swanson study cited by Walsh for this proposition even treated the question of whether "the most distinct problem in schools serving poor children is the number of teachers who are teaching subjects in which they have no expertise." Neither study examined or reported on the socioeconomic status of students or the distribution of teachers in schools serving different children. 13. As the study clearly states, California uses emergency permits for those who lack either subject matter competence or pedagogy or both. The requirement for a clear credential is passage of both subject matter competence and a set of pedagogical requirements, whether these are completed in a "traditional" or an "alternative" program, which in California would be an internship model requiring the candidates to meet the same standards as traditional programs. In fact, the composition of the emergency permit pool in California is nearly the opposite of what Walsh seems to surmise. This pool includes many teachers who have passed the subject matter test (or alternative content course requirements) in mathematics but who have not completed teacher education requirements. It also includes many teachers who have passed a basic skills test but have not completed either the subject matter or teacher education requirements for a clear credential. It includes very few individuals who have completed teacher education requirements but who have not completed subject matter requirements, since demonstration of subject matter competence is a prerequisite for entering the student teaching or internship portion of teacher education in California. Furthermore, experienced teachers who may be teaching math out of field would generally have been included in Fetler's data set as credentialed, since out of field teaching is not monitored by the state through the data set he used. 14. The original appendix was included in Walsh's draft dated July 23, 2001. Her final complete appendix published in October, 2001 modifies this statement only slightly, stating, "The author's principal and clear lament is the lack of subject matter knowledge in mathematics, with little mention at all of education coursework that may be lacking." 15. High-minority schools were defined as those with more than 90% students of color; low-minority schools had fewer than 30%. High-achieving schools were defined as those in the top quartile of achievement on the SAT-9 tests used by the state; low-achieving schools were those in the bottom quartile. 16. Rosalind Rossi, "Teacher woes worst in poor schools," Chicago Sun Times, October 10, 2001. 17. Walsh states that, "L. Darling-Hammond ... presents a chart using an ambiguous term 'Teacher Qualifications' which accounted for nearly half of the student achievement gains." (p. 17). The chart to which Walsh alludes actually referred to another study by Ferguson (1996) and was clearly labeled as such. Another chart next to this one was drawn directly from a table in the Greenwald, Hedges, and Laine study, and was also clearly marked. 18. In a later response to my reply (Walsh & Podgursky, 2001), Walsh notes that she cited Ferguson & Womack in error and meant to cite Ferguson and Ladd (1996). However, this study is one she should have discounted due to its level of aggregation if she were adhering to her own standards for evaluating research. 19. One odd criticism is that the institution, Arkansas Tech, has "low entrance requirements, making it unlikely that enough variance in student ability, background and coursework is present to reflect a broader population. The variance may be too narrow or at least skewed." Walsh seems to be unaware that the variance in student ability measures is usually much larger in large state universities like this one than it is in more selective colleges, thus making some kinds of inferences more, rather than less supportable. The more appropriate question about single institution studies is whether they may generalize to unlike institutions, a legitimate point that Walsh does not raise, and that should be answered by conducting studies within and across institutional contexts. 20. Some may also have been those teachers who needed to take the Praxis as an entrance examination for a post-baccalaureate teacher education program. 21. The ETS re-analysis is soon to be published. An earlier analysis of the federal Baccalaureate and Beyond data base found that 1993 graduates of NCATE-accredited teacher education programs were about 50% more likely to have scored above the 50th percentile on SAT and ACT tests than graduates of non-NCATE teacher education programs (Shotel, 1998). NCATE graduates had also taken more social science, computer science, advanced foreign language credit, pre-college mathematics, and teaching coursework and fewer remedial English courses than non-NCATE graduates, with other areas being approximately equal (Shotel, 1998). 22. The proportions who had taken other kinds of liberal arts coursework also differed little. For example, the proportion of 1992-93 bachelor's degree recipients who had taken college coursework in mathematics at the level of calculus and above was 18.3% in public schools and 16.9% in private schools; science was 77.2% vs. 73.5% (table A-51). 23. These statistics pertain to the youngest teachers in public and private schools: 1992-'93 bachelors degree recipients hired by 1993-94. These teachers are the least likely to be certified, even though they have taken education coursework at rates nearly as high as public school teachers. This suggests that many of these teachers may have prepared to teach but did not seek or secure state certification. In 1993-94, NCES reports that about 36% of private school teachers held no certificate in their primary assignment field (the data are not presented regarding their certification in another field other than the primary teaching assignment). The rates of non-certification ranged from 27% for those with 20 or more years of teaching experience to 51% for those with 3 or fewer years of teaching experience (NCES, 1997, table A3.14a). 24. For example, while most private school students (52%) attend schools that are less than 10% minority, only 31% of public school students do (NCES, Digest of Education Statistics, 1999, p. 71, table 60 and p. 119, table 99). African American and Latino students are at least 50% more likely to attend public than private schools. (NCES, 1997, Table A2.13). Most low-income students and students of color now attend public schools in urban public school districts. 25. Walsh objects to a composite "education and performance" variable created by the authors, which included the amount of education coursework, student teaching grade, GPA, and science teaching experience. 26. In Walsh's original appendix, this study is further critiqued because the reviewer was not clear on the meaning of the term "out-of-field" in the study when referencing elementary school teachers. The article defined the proportion of "well-qualified teachers" as the proportion holding state certification and the equivalent of a major (either an undergraduate major or masters degree) in the field taught. For elementary teachers, the equivalent of a major was defined an elementary education degree for generalists who teach multiple subjects to the same group of students or as degree in the field taught for elementary specialists (e.g. reading, mathematics or mathematics education, special education). The study defined "out-of-field" for elementary teachers in the same way it was defined for secondary teachers: holding less than a minor or the equivalent in the fields described above (elementary education in the case of generalists or the specialist field (e.g. reading or mathematics in the case of specialists). 27. For some mysterious reason, Walsh also tries to make a point that I differentiate (wrongly in her view) between cognitive ability or IQ and verbal ability (see her footnote 14, p. 8), despite the fact that this is a standard distinction in the literature made by many of the analysts Walsh herself quotes for support of the importance of verbal ability measures. Few measurement experts would argue that IQ, as it was defined and measured in the 1940s and '50s, represents the same construct as verbal ability, as Walsh seems to be invested in proving. 28. Walsh makes a hash of the research cited here on the relationship between teacher test scores and measures of teacher effectiveness, striving to prove that studies which found largely insignificant positive and negative relationships between NTE scores and student achievement at least did not find significant negative relationships. Since there is little disagreement about the value of having teachers demonstrate their basic skills and subject matter knowledge through either coursework or testing, I do not review each of these older studies here. 29. In her separately-published appendix, Walsh states that, "In 1999, Darling-Hammond summarized the main point of this article as a call for using student achievement as the measure of teacher quality." In fact, in Darling-Hammond (1999), I cited this review for an entirely different point. I cited it for the proposition that "Teachers' abilities to structure material, ask higher order questions, use student ideas, and probe student comments have also been found to be important variables in what students learn." 30. Cited in the separately-published appendix entry 88, p.50. References Andrew, M. & Schwab, R.L. (1995). Has reform in teacher education influenced teacher performance? An outcome assessment of graduates of eleven teacher education programs. Action in Teacher Education, 17: 43-53. Andrews, J.W., Blackmon, C.R., & Mackey, J.A. (1980). Preservice performance and the National Teacher Examinations. Phi Delta Kappan, 61(5): 358-359. Ashton, P. & Crocker, L. (1987). Systematic study of planned variations: The essential focus of teacher education reform. Journal of Teacher Education, 2-8. Ayers, J.B., & Qualls, G.S. (1979). Concurrent and predictive validity of the National Teacher Examinations. Journal of Educational Research, 73 (2): 86-92. Begle, E.G. (1979). Critical variables in mathematics education: Findings from a survey of the empirical literature. Washington, DC: Mathematical Association of American and National Council of Teachers of Mathematics. Begle, E.G. & Geeslin, W. (1972). Teacher effectiveness in mathematics instruction. National Longitudinal Study of Mathematical Abilities Reports No. 28. Washington, DC: Mathematical Association of America and National Council of Teachers of Mathematics. Berliner, D.C. (1986). In pursuit of the expert pedagogue, Educational Researcher (August/September): 5-13. Betts, J.R., Rueben, K.S., Danenberg, A. (2000). Equal resources, equal outcomes? The distribution of school resources and student achievement in California. San Francisco: Public Policy Institute of California. Bliss, T. (1992). Alternative certification in Connecticut: Reshaping the profession. Peabody Journal of Education, 67(3): 35-54. Bowles, S., & Levin, H.M. (1968). The determinants of scholastic achievement- An appraisal of some recent evidence. Journal of Human Resources, 3: 3-24. Bradshaw, L. & Hawk, P. (1996). Teacher Certification: Does It Really Make a Difference in Student Achievement? Greenville, NC: Eastern North Carolina Consortium for Assistance and Research in Education. Bryk, A.S. & Lee V.E. (1992). Are politics the problems and markets the answer? An essay review of "Politics, markets and America's schools." Economics of Education Review, 11(4): 439-451. Byrne, C.J. (1983). Teacher knowledge and teacher effectiveness: A literature review, theoretical analysis and discussion of research strategy. Paper presented at the meeting of the Northwestern Educational Research Association, Ellenville, NY. Carroll, J.B. (1975). The Teaching of French as a Foreign Language in Eight Countries. New York: John Wiley and Sons. Coleman, J.S., Campbell, E.Q., Hobson, C.J., McPartland, J., Mood, A.M., Weinfeld, F.D., & York, R.L. (1966). Equality of Educational Opportunity. Washington, DC: U.S. Government Printing Office. Coleman S. J., Hoffer T., and Kilgore, S. (1982). Cognitive outcomes in public and private schools. Sociology of Education, 55 (2-3): 65-76. Darling-Hammond, L. (1992). Teaching and knowledge: Policy issues posed by alternate certification for teachers. Peabody Journal of Education, 67(3): 123-154. Darling-Hammond, L. (1997). Doing What Matters Most: Investing in Quality Teaching. NY: National Commission on Teaching and America's Future, Teachers College, Columbia University. Darling-Hammond, L., Wise, A.E., & Klein, S.P. (1999). A license to teach. San Francisco: Jossey-Bass. Darling-Hammond, L. (2000a). Reforming teacher preparation and licensing: Debating the evidence. Teachers College Record, 102, (1): 28-56. Darling-Hammond, L. (2000b). Solving the Dilemmas of Teacher Supply, Demand, and Standards: How We Can Ensure a Competent, Caring, and Qualified Teacher for Every Child. NY: National Commission on Teaching and America's Future. Darling-Hammond, L. (2000c). Teacher quality and student achievement. Education Policy Analysis Archives, 8(1): http://epaa.asu.edu/epaa/v8n1.html Darling-Hammond, L., Berry, B., & Thoreson, A. (2001). Does Teacher Certification Matter? Evaluating the Evidence. Educational Evaluation and Policy Analysis, 23(1): 57-77. Darling-Hammond, L., Hudson, L., & Kirby, S. (1989). Redesigning Teacher Education: Opening the Door for New Recruits to Science and Mathematics Teaching. Santa Monica: The RAND Corporation. Darling-Hammond, L., Wise, A.E. & Pease, S.R. (1983). Teacher evaluation in the organizational context: a review of the literature. Review of Educational Research, 53: 285-237. Denton, J.J., & Peters, W.H. (1988). Program Assessment Report: Curriculum Evaluation of a Non-traditional Program for Certifying Teachers. Texas A&M University, College Station, TX. Druva, C.A., & Anderson, R.D. (1983). Science teacher characteristics by teacher behavior and by student outcome: A meta-analysis of research. Journal of Research in Science Teaching, 20(5): 467-479. Duffy, G. & Roehler, L. (1989). The tension between information-giving and mediation: Perspectives on instructional explanation and teacher change. In J. Brophy (ed.), Advances in research on teaching, Vol. 1. Greenwich, CT: JAI. Duffy, G., Roehler, L., Sivan, E., Rackliffe, G., Book, C., Meloth, M., Vavrus, L., Wesselman, R., Putnam, J., & ; Bassiri, D. (1987). Effects of explaining reasoning associated with using reading strategies. Reading Research Quarterly, 22(3): 347-368. Evertson, C., Hawley, W., & Zlotnick, M. (1985). Making a difference in educational quality through teacher education. Journal of Teacher Education, 36 (3), 2-12. Feistritzer, C.E. (1984). The Making of a Teacher. Washington, DC: National Center for Education Information. Ferguson, R.F. (1991). Paying for public education: New evidence on how and why money matters. Harvard Journal of Legislation, 28(2): 465-498. Ferguson, R.F. & Ladd, H.F. (1996). How and why money matters: An analysis of Alabama schools. In Helen Ladd (ed.) Holding Schools Accountable, pp. 265-298. Washington, DC: Brookings Institution. Ferguson, P. & Womack, S.T. (1993). The impact of subject matter and education coursework on teaching performance. Journal of Teacher Education, 44 (1): 55-63. Fetler, M. (1999). High school staff characteristics and mathematics test results. Education Policy Analysis Archives, 7(9): http://epaa.asu.edu/epaa/v7n9.html Gitomer, D.H., Latham, A.S., & Ziomek, R. (1999). The Academic Quality of Prospective Teachers: The Impact of Admissions and Licensure Testing. Princeton, NJ: Educational Testing Service. Goe, L. (forthcoming). Legislating equity: The distribution of emergency permit teachers in California. Berkeley: Graduate School of Education, University of California, Berkeley. Goldhaber, D.D. & Brewer, D. J. (1998, October). When should we reward degrees for teachers? Phi Delta Kappan, 134-138. Goldhaber, D.D. & Brewer, D.J. (2000). Does teacher certification matter? High school certification status and student achievement. Educational Evaluation and Policy Analysis, 22: 129-145. Greenwald, R., Hedges, L.V., & Laine, R.D. (1996). The effect of school resources on student achievement. Review of Educational Research, 66: 361-396. Grey, L., Cahalan, M., Hein, S., Litman, C., Severynse, J., Warren, S., Wisan, G., & Stowe, P. (1993). New Teachers in the Job Market: 1991 Update. Washington, DC: U. S. Department of Education, Office of Educational Research and Improvement. Guyton, E. & Farokhi, E. (1987). Relationships among academic performance, basic skills, subject matter knowledge and teaching skills of teacher education graduates. Journal of Teacher Education (Sept-Oct.): 37-42. Haney, W., Madaus, G., & Kreitzer, A. (1987). Charms talismanic: testing teachers for the improvement of American education. In E.Z. Rothkopf (Ed.) Review of Research in Education, 14: 169-238. Washington, DC: American Educational Research Association. Haney, W. (2000). The myth of the Texas miracle in education. Education Policy Analysis Archives, 8 (41): http://epaa.asu.edu/epaa/v8n41/ Hanushek, E. (1971). Teacher characteristics and gains in student achievement: Estimation using micro data. The American Economic Review 61(2): 280-288. Hanushek, E. (1992). The trade-off between child quantity and quality. Journal of Political Economy, 100: 84-117. Hanushek, E.A., Rivkin, S.G., &Taylor, L.L. (1995). Aggregation bias and the estimated effects of school resources. Rochester, NY: University of Rochester, Center for Economic Research. Hanushek, E. (1996b). School Resources and achievement in Maryland. Baltimore, MD: Maryland State Department of Education. Hawk, P., Coble, C.R., & Swanson, M. (1985). Certification: It does matter. Journal of Teacher Education, 36(3): 13-15. Hellfritzch, A.G. (1945). A factor analysis of teacher abilities. Journal of Experimental Education, 14: 166-169. Henke, R., Chen, X., & Geis, S. (2000). Progress through the teacher pipeline: 1992-93 college graduates and elementary/secondary school teaching as of 1997. Washington, DC: National Center for Education Statistics, U.S. Department of Education. Ingersoll, R. (1998). The problem of out-of-field teaching. Phi Delta Kappan, (June): 773-776. Jelmberg, J. (1995). College-based teacher education versus state-sponsored alternative programs. Journal of Teacher Education, 47(1), 60-66. (Jan-Feb 1996). Laczko-Kerr, I. & Berliner, D. (2002). The effectiveness of Teach for America and other under-certified teachers on student academic achievement: A case of harmful public policy. Educational Policy Analysis Archives, 10(37). Available: http://epaa.asu.edu/epaa/v10n37/. LaDuke, D.V. (1945). The measurement of teaching ability. Journal of Experimental Education, 14: 75-100. Lanier, J. and J. Little. (1986). Research on Teacher Education. In M. Wittrock (ed.), Handbook of Research on Teaching, Third Edition. New York: Macmillan. Lee, V.E. & Byrk, A.S. (1988) Curriculum tracking as mediating the social distribution of high school achievement. Sociology of Education, 61: 78-94. Lee, V.E., Dedrick, R.F., & Smith, J.B. (1991) The effect of the social organization of schools on teachers' self efficacy and satisfaction. Sociology of Education, 64: 190-208. Levin, H. M. (1980). Teacher certification and the economics of information. Educational Evaluation and Policy Analysis, 2 (4): 5-18. Little, J.W. (1999). Organizing schools for teacher learning. In L. Darling-Hammond and G. Sykes (eds.), Teaching as the Learning Profession, pp. 233-262. San Francisco: Jossey-Bass. Lutz, F.W. & Hutton, J.B. (1989). Alternative teacher certification: Its policy implications for classroom and personnel practice. Educational Evaluation and Policy Analysis, 11(3): 237-254. Miller, J.W., McKenna, M.C., & McKenna, B.A. (1998). A comparison of alternatively and traditionally prepared teachers. Journal of Teacher Education, 49(3): 165- 176. Mitchell, N. (1987). Interim Evaluation Report of the Alternative Certification Program (REA87-027-2). Dallas, TX: DISD Department of Planning, Evaluation, and Testing. Monk, D. (1994). Subject area preparation of secondary mathematics and science teachers and student achievement. Economics of Education Review, 12(2): 125-142. Monk, D. & King, J. (1994). Multi-level teacher resource effects on pupil performance in secondary mathematics and science. In R.G. Ehrenberg (ed.), Choices and Consequences. ILR Press, Ithaca, NY. Murnane, R.J. (1985). Do Effective Teachers have Common Characteristics: Interpreting the Quantitative Research Evidence. Paper presented at the National Research Council Conference on Teacher Quality in Science and Mathematics, Washington, DC Murnane, R.J. (1983). Understanding the sources of teaching competence: Choices, skills and the limits of training. Teachers College Record, 84(3): 564-569. National Center for Education Statistics (NCES) (1985). The Condition of Education, 1985. Washington, DC. U.S. Department of Education. National Center for Education Statistics (NCES) (1997). America's Teachers: Profile of a Profession. Washington, DC: U.S. Department of Education. National Center for Education Statistics (NCES) (2000). Digest of Education Statistics, 1999. Washington, DC: U.S. Department of Education. National Commission on Teaching and America's Future (1996). What Matters Most: Teaching for America's Future. New York: Author. National Reading Panel (2000). Teaching Children to Read: An Evidence-based Assessment of the Scientific Research Literature on Reading and Its Implications for Reading Instruction. Washington, DC: National Institute of Child Health and Human Development. Natriello, G. & Zumwalt, K. (1992). Challenges to an alternative route for teacher education. In Lieberman, A. (Ed.). The 91st Yearbook of the National Society for the Study of Education, Vol. 1, pp. 59-78. Chicago: University of Chicago Press. Quirk, T.J., Witten, B.J., & Weinberg, S.F. (1973). Review of studies of concurrent and predictive validity of the National Teacher Examinations. Review of Educational Research, 43: 89-114. Palincsar, A.S. & Brown, A.L. (1984). Reciprocal teaching of comprehension-fostering and comprehension-monitoring activities. Cognition & Instruction, 1: 117-175. Raymond, M., Fletcher, S., & Luque, J. (2001). Teach for America: An Evaluation ofTteacher Differences and Student Outcomes in Houston, Texas. CREDO, The Hoover Institution, Stanford University. Available: http://www.rochester.edu/credo Rostker, L.E. (1945). The measurement of teaching ability. Journal of Experimental Education, 14: 5-51. Schalock, D. (1979). Research on teacher selection. In D.C. Berliner (ed.), Review of Research in Education (vol. 7), Washington, DC: American Educational Research Association. Shields et al., Stanford Research International (SRI) (2000). The Status of the Teaching Profession, 2000: An Update to the Teaching and California's Future Task Force. Santa Cruz, CA: The Center for the Future of Teaching and Learning. Shotel, J.R. (Summer 1998). Does NCATE Make a Difference? Quality in Teacher Education. Washington, DC: George Washington University. Skinner, W.A. (1947). An Investigation of Factors Useful in Predicting Teaching Ability. University of Manchester. Master of Education thesis. Soar, R.S., Medley, D.M., and Coker, H. (1983). Teacher evaluation: A critique of currently used methods. Phi Delta Kappan, 65(4): 239-246. Stafford, D. & Barrow, G. (1994). Houston's alternative certification program. The Educational Forum, 58: 193-200. Stoddart, Trish (1992). An alternate route to teacher certification: Preliminary findings from the Los Angeles Unified School District Intern Program. Peabody Journal of Education, 67(3). Strauss, R.P. & Sawyer, E.A. (1986). Some new evidence on teacher and student competencies. Economics of Education Review, 5(1): 41-48. Summers, A.A., & Wolfe, B.L. (1975). Which school resources help learning? Efficiency and equality in Philadelphia public schools. The American Economic Review, 67(4): 639-652. Texas Center for Educational Research (2000). The Cost of Teacher Turnover. Austin, TX: Texas State Board for Teacher Certification (SBEC). U.S. Department of Education. (2002). Meeting the highly qualified teachers challenge: The Secretary's Annual Report on Teacher Quality. Washington, DC: U.S. Department of Education, Office of Postsecondary Education, Office of Policy Planning and Innovation. Veenman, S. (1984). Perceived problems of beginning teachers. Review of Educational Research, 54: 143-178. Vernon, P.E. (1965). Personality factors in teacher trainee selection. British Journal of Education Psychology (35): 140-149. Walsh, K. (2001). Teacher certification reconsidered: Stumbling for quality. Baltimore, MD: Abell Foundation. Available: http://www.abellfoundation.org. Walsh, K. (2002, Spring). The evidence for teacher certification. Education Next, 2(1): 79-84. Wayne, A.J., & Youngs, P. (under review). Teacher characteristics and student achievement gains: A review. Review of Educational Research. Wenglinsky, H. (2000). How teaching matters: Bringing the classroom back into discussions of teacher quality. Princeton, NJ: Educational Testing Service. Wilson, S., Floden, R., & Ferrini-Mundy (2001). Teacher Preparation Research: Current Knowledge, Gaps, and Recommendations. University of Washington: Center for the Study of Teaching and Policy. Wright, David P., Michael McKibbon, & Priscilla Walton (1987). The Effectiveness of the Teacher Trainee Program: An Alternate Route into Teaching in California. California Commission on Teacher Credentialing. About the AuthorLinda Darling-HammondSchool of Education Stanford University Email: ldh@leland.stanford.edu Linda Darling-Hammond is Charles E. Ducommun Professor of Education at Stanford University and was Founding Executive Director of the National Commission on Teaching and America's Future. Her research, policy, and teaching focus on teacher education and teaching quality, school restructuring, and educational equity. Among other writings, she is author of The Right to Learn, which received the Outstanding Book Award from the American Educational Research Association in 1998. | ||||||||||||||||||||||
Copyright 2002 by the Education Policy Analysis ArchivesThe World Wide Web address for the Education Policy Analysis Archives is epaa.asu.edu General questions about appropriateness of topics or particular articles may be addressed to the Editor, Gene V Glass, glass@asu.edu or reach him at College of Education, Arizona State University, Tempe, AZ 85287-2411. The Commentary Editor is Casey D. Cobb: casey.cobb@unh.edu . EPAA Editorial Board
|