91制片厂视频

Special Report
Federal

Experts Hope Federal Funds Lead to Better Tests

By Stephen Sawchuk 鈥 August 05, 2009 10 min read
  • Save to favorites
  • Print
Email Copy URL

No matter where teachers, state officials, and testing experts stand on the debate about school accountability, they generally agree that the United States鈥 current multiple-choice-dominated K-12 tests are, to use language borrowed from the No Child Left Behind Act, in need of improvement.

Now, federal officials are signaling that they expect the caliber of testing to change.

U.S. Secretary of 91制片厂视频 Arne Duncan recently announced that he will set aside $350 million of the $4.35 billion in discretionary aid in the to improve assessments.

Testing experts say that money could serve as a down payment for scaling up tests that would better measure students鈥 critical-thinking skills and improve teacher and student engagement in the assessment process. The catch, they warn, is that truly achieving that goal may force federal officials to rethink the current parameters around assessment and accountability in the NCLB law.

鈥淎ccountability testing is seen as a necessary evil to be minimized. It鈥檚 like going to the dentist. You have to do it, but it hurts,鈥 said Randy E. Bennett, a distinguished presidential scholar at the 91制片厂视频al Testing Service, a nonprofit testing and research organization based in Princeton, N.J.

Sample Assessment

The proposed application requirements for the Race to the Top Fund de铿乶e a 鈥渉igh quality鈥 assessment as one that uses 鈥渁 variety of item types, formats, and assessment conditions,鈥 including performance-based tasks, to measure student achievement.

College- and Work-

Readiness Assessment:
You advise Pat Williams, the president of DynaTech, a company that makes precision electronic instruments and navigational equipment. Sally Evans, a member of DynaTech鈥檚 sales force, recommended that DynaTech buy a small private plane (a SwiftAir 235) that she and other members of the sales force could use to visit customers. Pat was about to approve the purchase when there was an accident involving a SwiftAir 235. Your document library contains the following materials:

鈥 Newspaper article about the accident

鈥 Federal Accident Report on in-铿俰ght breakups in single-engine planes

鈥 Internal correspondence (Pat鈥檚 e-mail to you and Sally鈥檚 e-mail to Pat)

鈥 Charts relating to SwiftAir鈥檚 performance characteristics

鈥 Excerpt from magazine article comparing SwiftAir 235 with similar planes

鈥 Pictures and descriptions of SwiftAir Models 180 and 235

Sample Questions:
Do the available data tend to support or refute the claim that the type of wing on the SwiftAir 235 leads to more in-铿俰ght breakups? What is the basis for your conclusion? What other factors might have contributed to the accident and should be taken into account? What is your preliminary recommendation about whether or not DynaTech should buy the plane and what is the basis for this recommendation?

SOURCE: Council for Aid to 91制片厂视频

鈥淭he goal,鈥 he said, 鈥渟hould be to make even the test as much of a learning experience as possible, so the student actually benefits from taking it, and teachers are given some important information for the purposes of instruction.鈥

Measuring Critical Thinking

The image of fill-in-the-bubble multiple-choice items has become all but inseparable from the NCLB law, which more than doubled the amount of federally mandated testing to grades 3-8 and once in high school.

Multiple-choice items can efficiently discover whether a student has assembled discrete pieces of knowledge across a subject. The results are also typically highly reliable, meaning the error associated with the results is low鈥攁 desirable quality for high-stakes tests. And they are are easy and cheap to score.

Such tests, though, are not ideal for identifying whether students can take multiple pieces of domain-specific knowledge and analyze, integrate, and apply them in unfamiliar contexts, Mr. Bennett said. And researchers familiar with international benchmarking argue that those critical-thinking skills are precisely the type that will be in demand as the global economy becomes increasingly knowledge-oriented.

鈥淚 think the tragedy is that things that are easy to test and teach lose relevance,鈥 said Andreas Schleicher, the head of the indicators and analysis division for the Paris-based Organization for Economic Cooperation and Development. The OECD sponsors the Program for International Student Assessment, or PISA, which includes performance-based items.

鈥淭he feature that is central to PISA is that we鈥檙e not that interested in whether students can reproduce content knowledge,鈥 Mr. Schleicher said, 鈥渂ut whether they can extrapolate what they know and apply it in novel situations.鈥

Performance-based tests designed to measure those abilities are common in specialized fields such as medicine, which requires examinees to diagnose and treat simulated patients, for example. But the exams typically require scoring by humans, and for that reason are costlier than those that use exclusively multiple-choice questions. They also produce results that paint a deeper picture of students鈥 understanding but are less mathematically reliable than multiple-choice tests.

Issues of both cost and reliability, testing experts say, explain why extended performance-based tasks have not penetrated K-12 assessment under the NCLB law.

Technology as Mediator?

What now seems to be an intractable choice between richer tasks and reliable data, though, could be mediated by advancements in technology that could improve access, cost, and reliability of performance-based testing, some experts argue.

And the federal funding, they say, could be the lever to support that work.

鈥淚t鈥檚 expensive to put [new item formats] into practice, and to the extent that infusion can help create not only prototypes of promising assessment but support some of the infrastructure needed to deliver them efficiently [it] will be an important legacy,鈥 Mr. Bennett said.

Federal officials have not yet revealed the details on the funding, which will be awarded to states as part of the Race to the Top fund. But Secretary Duncan has intimated in public appearances that the funding will support assessments aligned to the common core of standards now in development.

Some standardized performance-based examples already exist, such as the College and Work Readiness Assessment, a computer-based test that is given primarily to high school freshmen and seniors in private schools.

The exam, run by the Council for Aid to 91制片厂视频, a New York City-based nonprofit group that works to improve access to higher education, includes a task that requires students to sift through various texts and sources of data and draw conclusions from them to support an argument.

鈥淏y and large, the real world doesn鈥檛 present itself as nice little abstract tasks with four options that you choose from,鈥 said Richard J. Shavelson, a professor of education at Stanford University who helped design the assessment.

A typical College and Work Readiness Assessment question might present examinees with a dossier of materials relating to a child who had a roller-skating accident at school. The materials could include newspaper articles, technical reports about the skates, data about competitors鈥 products, sales figures, medical reports, and the number of documented accidents. Then, the student would be asked to analyze those materials and write a memo about whether the skates are truly dangerous, and to justify his or her conclusions drawing from the information.

Mr. Shavelson said he and other researchers have been investigating ways of reducing the complexity of such items for younger students.

The high costs of scoring such a complicated assessment with an almost unlimited number of answers, he added, could be mitigated by advancements in natural-language-processing software鈥攅ssentially programming that proponents claim can judge written essays as accurately as human readers and reduce, though not eliminate, the need for costly human evaluation.

In addition, experts say, technology offers the ability to measure student understanding of concepts and processes involving critical thinking that have been notoriously difficult to assess using only multiple-choice items.

For the 2009 National Assessment of 91制片厂视频al Progress in science, officials assessed a subset of students using 鈥渋nteractive computer tasks.鈥 Those items require students to engage in the entire process of scientific inquiry, in which they must participate in a simulated experiment, record data, and defend or critique a hypothesis.

One of the benefits of the computer-based tasks, said Mary Crovo, the deputy staff director of the National Assessment Governing Board, which sets policy for NAEP, is that computers can simulate tools that would be dangerous or impractical to replicate in an assessment context, or processes such as evolution that occur over long expanses of time.

The results, she added, will provide data not only on student aptitude but also on how students approached the tasks鈥攕uch as whether they were able to deploy the appropriate tools and how many 鈥渢est runs鈥 they performed in their experiments.

Improving Instruction

Experts add that the infusion of federal cash could also provide more opportunities to devise tests that will better engage teachers in the cognitive science about how knowledge develops over time.

鈥淲e know that it鈥檚 not only the amount of knowledge that鈥檚 important, but the way it鈥檚 organized, and we don鈥檛 test knowledge organization at all, at least not directly,鈥 Mr. Bennett said. 鈥淭hat鈥檚 a significant omission in the way we design our current assessments.鈥

One potential prototype for such a system is the ETS鈥 Cognitively Based Assessment of, for, and as Learning. The reading, writing, and mathematics tests are not made up of just one analytical, performance-based item, but incorporate the knowledge and skills that students must master to succeed in the more-complex tasks.

An assessment on fictional reading, for instance, might ask students to diagram the various structures of the plot, such as the conflict, rising action, and conclusion, before moving on to an analytical open-ended question. A nonfiction unit, in contrast, would ask students to weigh the reliability of different sources of information before asking them to integrate information across a series of related texts.

The ETS assessment also will include subunits that teachers can use in a non-high-stakes setting to help students home in on prerequisite content and skills. In Portland, Maine, where the ETS has developed and field-tested the system in collaboration with teachers in three middle schools, officials praised the level of teacher involvement in its design.

鈥淭he landmark piece of this whole project is how much teachers have helped design these assessments,鈥 said Tom Lafavore, the district鈥檚 director of educational planning. 鈥淲e are breaking down the bigger skills into smaller ones that we can check along the way.鈥

Purposeful Approach?

Still, assessment experts express some wariness about the new federal funding, saying it might not improve test design unless U.S. officials also consider the context in which such new assessments might be used.

If measures of higher-order, critical-thinking skills are to be part of an accountability system, for instance, federal officials will probably need to reconsider aspects of the No Child Left Behind law, they said. The law, the 2002 edition of the Elementary and Secondary 91制片厂视频 Act, is overdue for reauthorization by Congress.

鈥淚f I told you to develop a much more energy-efficient car but you can鈥檛 change the materials, the engine, and the fuel it uses, you鈥檙e not going to get very far,鈥 said Bill Tucker, the chief operating officer of 91制片厂视频 Sector, a Washington-based think tank that has released a of on advanced testing techniques.

鈥淚t is an incredible opportunity,鈥 he said of the federal aid, 鈥渂ut we could spend $350 million on the current state of the art and marginally make that better, or we could spend $350 million moving to the next generation of testing.鈥

Psychometricians point in particular to the constraints on testing placed by the federal law, which requires 95 percent of all students in each grade and each ethnic subgroup to be assessed. For efficiency, cost, and security reasons, each state typically conducts all its testing on the same day, in a narrow time frame.

鈥淚 think one thing that鈥檚 got to give is the idea of a short test,鈥 said Mr. Bennett of the ETS. 鈥淵ou can鈥檛 cover a domain broadly, or enough of a domain deeply, if you give a short test, and you can鈥檛 give back information that鈥檚 going to be valuable to the teacher or student in terms of what to do.鈥

It might be possible to administer assessments in parts over the course of the year and to aggregate the results, rather than simply create longer tests, he suggested.

Another possible solution, experts say, would be to move to a system that samples student performance, rather than giving every student the same test form. Each student would take only a part of the exam, with results aggregated at a higher level.

Such a system, already used by NAEP and PISA, could keep costs down, mitigate schools鈥 technological limitations, and reduce overall testing times. But it has not been used for school accountability purposes, and would contravene the NCLB requirements that all students in a state take the same test, as well as complicate efforts to break out schools鈥 test-score results by racial or ethnic and income-level categories, among other areas.

鈥淚t鈥檚 a question of what your purpose is,鈥 said Brian Stecher, the associate director of education at the RAND Corp., a Santa Monica, Calif.,-based research and analysis group. 鈥淚f you鈥檙e monitoring how well the system is performing, you don鈥檛 need a score on every kid. I think there is a way to strike a better balance.鈥

Ultimately, experts say, the federal agenda for the funding will likely determine the utility of the new funding.

鈥淯nless they鈥檙e very clear about the uses鈥攁ccountability, instruction, evaluation鈥攊t鈥檚 very easy for this to get corrupted,鈥 said Scott Marion, the associate director of the Dover, N.H.-based Center for Assessment, a test-consulting group. 鈥淚 think you can easily waste this money if you鈥檙e not really careful about it.鈥

A version of this article appeared in the August 12, 2009 edition of 91制片厂视频 Week as Experts Hope Federal Funds Lead to Better Tests

Events

Recruitment & Retention Webinar Keep Talented Teachers and Improve Student Outcomes
Keep talented teachers and unlock student success with strategic planning based on insights from Apple 91制片厂视频 and educational leaders.鈥
This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of 91制片厂视频 Week's editorial staff.
Sponsor
Families & the Community Webinar
Family Engagement: The Foundation for a Strong School Year
Learn how family engagement promotes student success with insights from National PTA, AASA鈥痑nd leading districts and schools.鈥
This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of 91制片厂视频 Week's editorial staff.
Sponsor
Special 91制片厂视频 Webinar
How Early Adopters of Remote Therapy are Improving IEPs
Learn how schools are using remote therapy to improve IEP compliance & scalability while delivering outcomes comparable to onsite providers.
Content provided by 

EdWeek Top School Jobs

Teacher Jobs
Search over ten thousand teaching jobs nationwide 鈥 elementary, middle, high school and more.
Principal Jobs
Find hundreds of jobs for principals, assistant principals, and other school leadership roles.
Administrator Jobs
Over a thousand district-level jobs: superintendents, directors, more.
Support Staff Jobs
Search thousands of jobs, from paraprofessionals to counselors and more.

Read Next

Federal Photos PHOTOS: Behind the Scenes at the Moms for Liberty National Summit
Former President Trump was a keynote the final night鈥攁nd said little about schools.
1 min read
Moms for Liberty member Aura Moody dances with others at the annual Moms For Liberty Summit in Washington, D.C., on Aug. 30, 2024.
Moms for Liberty member Aura Moody dances with others at the conservative parents' rights organization's annual summit in Washington, on Friday, August 30, 2024.
Lawren Simmons for 91制片厂视频 Week
Federal At Moms for Liberty National Summit, Trump Hardly Mentions 91制片厂视频
In a "fireside chat" with a co-founder of the parents' rights group, the former president didn't discuss his education policy priorities.
5 min read
Republican presidential nominee former President Donald Trump speaks with Moms for Liberty co-founder Tiffany Justice during an event at the group's annual convention in Washington, Friday, Aug. 30, 2024.
Former President Donald Trump, the Republican presidential nominee, speaks with Tiffany Justice, a Moms for Liberty co-founder, during the group's national summit on Friday Aug. 30, 2024, in Washington. The former president spoke only briefly about issues directly related to education.
Mark Schiefelbein/AP
Federal Then & Now Why It's So Hard to Kill the 91制片厂视频 Department鈥攁nd Why Some Keep Trying
Project 2025 popularized plans to end the U.S. Department of 91制片厂视频, but the idea has been around since the agency's inception.
9 min read
President Ronald Reagan is flanked by 91制片厂视频 Secretary Terrel Bell, left, during a meeting Feb. 23, 1984 meeting  in the Cabinet Room at the White House.
President Ronald Reagan is flanked by 91制片厂视频 Secretary Terrel Bell, left, during a meeting Feb. 23, 1984 meeting in the Cabinet Room at the White House. Bell, who once testified in favor of creating the U.S. Department of 91制片厂视频, wrote the first plan to dismantle the agency.
91制片厂视频 Week with AP
Federal 鈥楥oaching and Politics鈥: What Coaches See in Tim Walz's VP Candidacy
Tim Walz's experience as a football coach is viewed by fellow coaches as good preparation for national politics.
7 min read
Benjamin C. Ingman, center, former student of Democratic vice presidential candidate Minnesota Gov. Tim Walz, is joined on stage by former members of the Mankato West High School football team during the Democratic National Convention Wednesday, Aug. 21, 2024, in Chicago.
Benjamin C. Ingman, center, a former student of Gov. Tim Walz, the Democratic vice presidential candidate, is joined on stage by former members of the Mankato West High School football team during the Democratic National Convention Wednesday, Aug. 21, 2024, in Chicago.
J. Scott Applewhite/AP