91制片厂视频

School & District Management

Panel Finds Few Learning Benefits in High-Stakes Exams

By Sarah D. Sparks 鈥 June 07, 2011 7 min read
  • Save to favorites
  • Print
Email Copy URL

As Congress debates how to structure the next iteration of federal school accountability, a new national has raised serious concerns about the effectiveness of test-based incentives to improve education.

A blue-ribbon committee of the National Academies鈥 National Research Council undertook a nearly decade-long study of test-based incentive systems, including the 鈥渁dequate yearly progress鈥 measures under the No Child Left Behind Act, high school exit exams, teacher merit-pay programs, and other testing-and-accountability initiatives. While the panel says it supports evaluating education systems and holding them accountable, on the whole it found the approaches implemented so far have had little or no effect on actual student learning, and in some cases have run counter to their intended purposes.

The results are likely to add fuel to ongoing debates across the country over how to fairly evaluate schools and teachers for student progress and whether to tie consequences for students and teachers to results from current forms of testing.

The study, released May 26, drew a mix of reactions.

鈥淚t鈥檚 an antidote to what has been the accepted wisdom in this country, the belief that performance-based accountability and incentive systems are the answer to improving education,鈥 said Jon Baron, the president of the Washington-based Coalition for Evidence-Based Policy and the chairman of the National Board for 91制片厂视频 Sciences, which advises the U.S. Department of 91制片厂视频鈥檚 research arm. 鈥淭hat was basically accepted without evidence or support in NCLB and other government and private-sector efforts to increase performance,鈥 he said.

Eric A. Hanushek, an economics professor at Stanford University, said he was 鈥渟tunned at how broad鈥 the findings were. But he warned against using the committee鈥檚 critique of test-based incentives to throw out accountability systems in education altogether.

鈥淪ome form of accountability is undoubtedly useful, but you have to be careful with how you structure accountability systems,鈥 Mr. Hanushek said. 鈥淲hat we鈥檝e done to date hasn鈥檛 been perfect; there are lots of obvious flaws in either results or program structure to date. As we go into the future, we should learn from our results.鈥

Jim Bradshaw, a spokesman for the 91制片厂视频 Department, said in an email: 鈥淭his report confirms what we already know鈥攖he accountability system in No Child Left Behind is broken and needs fixing this year. We need better assessments, college- and career-ready standards, and a more fair, focused, and flexible accountability system because children only get one shot at a world-class education.鈥

Preventing Gaming

One critical flaw the study focused on was that test-based systems often use the same tests to gauge student progress and evaluate the system as a whole, with insufficient safeguards and monitoring to prevent educators or students from gaming the system to produce high scores disconnected from learning.

Committee on Incentives and Test-Based Accountability

Michael Hout (Chair)*
Sociology Chairman
University of California; Berkley

Dan Ariely
Professor of Psychology and Behavioral Economics
Duke University; Durham, N.C.

George P. Baker III
Professor of Business Administration
Harvard Business School; Boston

Henry Braun
Professor of 91制片厂视频 and Public Policy; Director of the Center for the Student of Testing, Evaluation, and 91制片厂视频al Policy
Boston College; Chestnut Hill, Mass.

Anthony S. Bryk (until 2008)
President
Carnegie Foundation for the Advancement of Teaching; Stanford, Calif.

Edward L. Deci
Professor of Psychology and Social Sciences; Director of the Human Motivation Program
University of Rochester; Rochester, N.Y.

Christopher Edley Jr.
Professor and Dean of Law
University of California; Berkeley

Geno J. Flores
Former Chief Deputy, Superintendent of Public Instruction
California Department of 91制片厂视频

Carolyn J. Heinrich
Professor and Director of Public Affairs; Affiliated Professor of Economics
University of Wisconsin-Madison

Paul T. Hill
Research Professor; Director of the Center on Reinventing Public 91制片厂视频
University of Washington Bothell

Thomas J. Kane**
Professor of 91制片厂视频 and Economics; Director of the Center for 91制片厂视频 Policy Research
Harvard University; Cambridge, Mass.

Daniel M. Koretz
Professor of 91制片厂视频
Harvard University; Cambridge, Mass.

Kevin Lang
Professor of Economics
Boston University; Boston

Susanna Loeb
Professor of 91制片厂视频
Stanford University; Stanford, Calif.

Michael Lovaglia
Professor of Sociology; Director of the Center for the Study of Group Processes
University of Iowa; Iowa City

Lorrie A. Shepard
Dean and Professor of 91制片厂视频
University of Colorado at Boulder

Brian M. Stecher
Associate Director for 91制片厂视频
Rand Corp.; Santa Monica, Calif.

* Member, National Academy of Sciences
** Was not able to participate in the final committee deliberations due to scheduling conflict.

SOURCE: National Academies

鈥淭oo often it鈥檚 taken for granted that the test being used for the incentive is itself the marker of progress, and what we鈥檙e trying to say here is you need an independent assessment of progress,鈥 said Michael Hout, the sociology chairman at the University of California, Berkeley, and the chairman of the 17-member committee.

The panel, a who鈥檚 who of national experts in education law, economics, and social sciences, was launched in 2002 by the National Academies, a private, nonprofit quartet of institutions chartered by Congress to provide policy advice on science, technology, and health. Since its formation, the committee has been tracking the implementation and effectiveness of 15 test-based incentive programs, including:

鈥 National school improvement programs under the No Child Left Behind Act and prior iterations of the Elementary and Secondary 91制片厂视频 Act;

鈥 Test-based systems of teacher incentive pay in Texas, Chicago, Nashville, Tenn., and elsewhere;

鈥 High school exit exams such as those required by 28 states;

鈥 Pay-for-scores programs for students in New York City and Coshocton, Ohio; and

鈥 Experiments in teacher incentive pay in India and student and teacher test incentives in Israel and Kenya.

On the whole, the panel found the accountability programs often used assessments too narrow to accurately measure progress on program goals and used rewards or sanctions not directly tied to the people whose behavior the programs sought to change. Moreover, the programs often had inadequate checks in place to prevent manipulation of the system.

鈥淚t鈥檚 not that there鈥檚 no information in the objective performance measures, but they are imperfect, and including the subjective performance measures is also very important,鈥 said Kevin Lang, an economics professor at Boston University. 鈥淚ncentives can be powerful, but not necessarily in the way you would like them to be.鈥

As a result, educators facing accountability sanctions tend to focus on actions that improve test scores, such as teaching test-taking strategies or drilling students closest to meeting proficiency cutoffs, rather than improving learning. Such a response undercuts the tests鈥 validity, the report says.

As an example, the report points to New York鈥檚 requirement that all high school seniors pass the state regents鈥 exam before graduating from high school. The policy led to more students passing the tests, but scores on the lower-stakes National Assessment of 91制片厂视频al Progress, which was testing the same subjects, didn鈥檛 budge during the same time period.

鈥淚t鈥檚 human nature: Give me a number, I鈥檒l hit it,鈥 Mr. Hout said. 鈥淐onsequently, something that was a really good indicator before there were incentives on it ... becomes useless because people are messing with it.鈥

In fact, the study found that, rather than leading to higher academic achievement, high school exit exams so far have decreased graduation rates nationwide by an average of 2 percentage points.

The study found a growing heap of evidence that schools and districts have tinkered with how and when students take exit exams as well as other high-stakes tests in order to boost scores on paper for students who do not know the material鈥攐r to prevent those students from taking the tests at all.

AYP and Academics

For similar reasons, school-based accountability mechanisms under the NCLB law have generated minimal improvement in academic learning, the study concludes. When the systems are evaluated鈥攏ot using the high-stakes tests subject to inflation, but using instead outside tests, such as NAEP鈥攕tudent-achievement gains dwindle to about .08 of a standard deviation on average, mostly clustered in elementary-grade mathematics.

For perspective, an intervention considered to have a small effect size is usually about 0.1 of a standard deviation; a 2010 federal study of reading-comprehension programs found a moderately successful program had an effect size of .22 of a standard deviation.

Moreover, 鈥渁s disappointing as a .08 standard deviation might be, that鈥檚 bigger than any effect we saw for incentives on individual students,鈥 Mr. Hout said, noting that NCLB accountability measures school performance, not that of individual students.

Mr. Baron of the Coalition for Evidence-Based Policy said he was impressed by the quality of the panel鈥檚 research review, but unsurprised at the minimal results for various incentive programs.

Incorporating diverse types of studies鈥攁s the panel did鈥攖ypically reduces the overall effects found for them, he noted.

鈥淥ne of the contributions that this makes,鈥 he said of the study, 鈥渋s that it shows that looking across all these different studies with different methodologies and populations, some in different countries, there are very minimal effects in many cases, and in a few cases larger effects. It makes the argument that details matter.鈥

Committee members see hopeful signs in the 2008 federal requirement that state NAEP scores be used as an outside check on achievement results reported by districts and states, as well as the broader political push to incorporate more diverse measures of student achievement in the version of the ESEA that will revise the No Child Left Behind edition.

鈥淚t鈥檚 a message to all of us to slow down and think this through,鈥 Jack Jennings, the president of the Center on 91制片厂视频 Policy, in Washington, said of the findings. 鈥淲e put all this weight on these tests that just weren鈥檛 designed for these things.鈥

He said the study is likely to focus lawmakers鈥 attention on the nearly $400 million Race to the Top assessment grants, in which state consortia are developing testing systems to go along with the new common-core state standards. 鈥淭here鈥檚 a lot riding on how these consortia do,鈥 Mr. Jennings said.

A version of this article appeared in the June 08, 2011 edition of 91制片厂视频 Week as Panel Finds Few Learning Benefits in High-Stakes Exams

Events

Recruitment & Retention Webinar Keep Talented Teachers and Improve Student Outcomes
Keep talented teachers and unlock student success with strategic planning based on insights from Apple 91制片厂视频 and educational leaders.鈥
This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of 91制片厂视频 Week's editorial staff.
Sponsor
Families & the Community Webinar
Family Engagement: The Foundation for a Strong School Year
Learn how family engagement promotes student success with insights from National PTA, AASA鈥痑nd leading districts and schools.鈥
This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of 91制片厂视频 Week's editorial staff.
Sponsor
Special 91制片厂视频 Webinar
How Early Adopters of Remote Therapy are Improving IEPs
Learn how schools are using remote therapy to improve IEP compliance & scalability while delivering outcomes comparable to onsite providers.
Content provided by 

EdWeek Top School Jobs

Teacher Jobs
Search over ten thousand teaching jobs nationwide 鈥 elementary, middle, high school and more.
Principal Jobs
Find hundreds of jobs for principals, assistant principals, and other school leadership roles.
Administrator Jobs
Over a thousand district-level jobs: superintendents, directors, more.
Support Staff Jobs
Search thousands of jobs, from paraprofessionals to counselors and more.

Read Next

School & District Management Video Tour a School Built to Stay Open in Extreme Weather
River Grove Elementary is built to stay open, with the lights on, as extreme weather strikes.
2 min read
School & District Management Opinion From One Superintendent to Another: Get Political
Strong relationships with political leaders help create a supportive network for your schools, even amid partisan turbulence.
George Philhower
5 min read
Vector of an education leader hand holding a book bridging the gap in education for a group of political people walking on
Feodora Chiosea/iStock
This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of 91制片厂视频 Week's editorial staff.
Sponsor
School & District Management Whitepaper
Courageous 91制片厂视频 Makes Literacy Change Happen
Get your blueprint for sustainable change and get ready to 鈥渕ake it happen.鈥
Content provided by 95 Percent Group
School & District Management Q&A What Should School Administrators Wear to Work? A Superintendent鈥檚 Style Tips
Melanie Kay-Wyatt describes her wardrobe as professional, comfortable, and colorful.
3 min read
Melanie Kay-Wyatt stands for a portrait inside Alexandria City High School on Sept. 9, 2024 in Alexandria, Va. Kay-Wyatt serves as superintendent for Alexandria City Public Schools.
Melanie Kay-Wyatt, the superintendent for the Alexandria, Va., school district, stands for a portrait inside Alexandria City High School on Sept. 9, 2024. She considers her professional style to be an important part of how she presents herself in her role.
Maansi Srivastava for 91制片厂视频 Week