Is testing the next frontier in ed reform?

Natalie Wexler challenges a fundamental assumption in American education: that reading is merely a set of transferable skills like finding the main idea, rather than a product of specific knowledge. Her analysis of a quietly abandoned experiment in Louisiana offers a rare, data-backed glimpse into what happens when tests finally measure what students actually know, rather than their ability to guess at unfamiliar topics.

The Knowledge Gap

Wexler begins by dismantling the standard model of reading assessment. She notes that while states claim to test what is taught, the reality is that standardized tests often rely on passages about obscure topics—like rugby—that assume a background knowledge many students, particularly those from less educated families, simply do not possess. "To be able to make an inference, for example, you need a certain threshold of relevant background knowledge," Wexler explains, highlighting how the current system penalizes students for gaps in their world knowledge rather than their reading ability.

Is testing the next frontier in ed reform?

This framing is crucial because it shifts the blame from the student's lack of skill to the test's lack of context. The author argues that the traditional approach creates a "superficial" instructional environment where teachers drill isolated skills instead of building robust understanding. As she puts it, "The problem, as then-state superintendent of Louisiana John White explained... is that those skills don't transfer from one context to another." This observation lands with particular force for busy professionals who understand that expertise in any field comes from deep domain knowledge, not just generic problem-solving techniques.

The Louisiana Experiment

The core of Wexler's piece is the story of Louisiana's "Innovative Assessment Pilot," a bold attempt to align testing with a content-rich curriculum. Instead of generic reading passages, the pilot tested students on specific units of history and literature they had just studied, including "warm" reads that connected thematically to the core text. The results were striking. A white paper from NWEA found that students felt less anxious and more engaged. More importantly, the achievement gap between economically disadvantaged students and their affluent peers was "significantly smaller than on the LEAP."

Wexler writes, "The new test design may be 'leveling the playing field' by providing students a more equitable opportunity to show what they know." This is the piece's most compelling evidence: when the test rewards knowledge acquisition, it stops being a gatekeeper for the privileged and starts being a tool for equity. The experiment also transformed teacher behavior. Educators stopped wasting time on generic test prep and began focusing on the actual content. "We don't do that anymore," one teacher told White. "We devote our time to diving into the unit and making sure that students have a strong understanding, as much background knowledge as we can possibly give them."

Critics might note that the pilot's success relied heavily on the fact that 80% of Louisiana schools used the same curriculum, a level of standardization that is nearly impossible to replicate in other states with fragmented systems. However, Wexler acknowledges this limitation while arguing that the principle remains valid: if the test changes, the teaching changes.

"What gets tested gets taught," according to a timeworn but clearly evidence-based adage. If we continue to test illusory skills, that's what teachers will continue to focus on, to the continued detriment of many students.

The Political and Practical Hurdles

Despite the promising data, the experiment was quietly discontinued in 2024. Wexler attributes this to a mix of administrative turnover, the disruption of the pandemic, and the high cost of implementation. She points out a deeper structural issue: the disconnect between curriculum experts and the psychometricians who design tests. The latter prioritize statistical reliability over educational substance. "There's not great curiosity, in that technocratic worldview, of getting under the messy hood of, well, was it worth it?" Wexler quotes White asking. "Did kids learn the content that people need to be productive?"

The author suggests that the federal government could help by funding research into new assessment models, noting a potentially surprising political dynamic. She observes that the current Republican administration might be more open to such experimentation than previous Democratic ones, which often viewed testing reforms as threats to civil rights protections. This nuance is vital; it suggests that the path forward may not be ideological but rather pragmatic, requiring a coalition that values outcomes over process.

Bottom Line

Wexler's strongest argument is that the current testing regime is not just ineffective, but actively harmful, forcing teachers to ignore the very content that would help disadvantaged students catch up. The piece's vulnerability lies in the immense logistical and political difficulty of scaling a content-based test across a fragmented national system. The reader should watch for whether any state other than Louisiana is willing to take the risk of aligning their assessments with what students actually learn, rather than what they can guess.

Is testing the next frontier in ed reform?

by Natalie Wexler · Natalie Wexler · Read full article

Image generated by AI (as you can probably tell)

Eight years ago, Louisiana’s top education official announced a bold experiment: a radically different kind of state reading test, one that would assess students’ learning based on what they had actually been taught.

What’s so bold and radical about that, I hear you say? (Or some of you, anyway—perhaps those who are new to this Substack.) Don’t states routinely test students on what they’ve been taught?

Well, yes and no. Standardized reading tests aim to measure students’ abilities to do things like find the main idea of a text or make an inference about what a word means in a particular context. In most schools, reading or ELA instruction focuses primarily on those kinds of skills. So in that sense, students are being tested on what they’ve been taught.

The problem, as then-state superintendent of Louisiana John White explained in a 2018 opinion piece, is that those skills don’t transfer from one context to another. To be able to make an inference, for example, you need a certain threshold of relevant background knowledge. Lots of kids lack enough background knowledge to make sense of the passages on reading tests, which are designed to avoid topics that might be covered in the school curriculum. A student might be asked to find the main idea of a passage on, say, rugby, when she’s never even heard of the sport.1

Kids from less highly educated families are particularly likely to lack the background knowledge they need for the tests. And the problem becomes most apparent at higher grade levels, when the test passages assume the reader possesses increasing amounts of knowledge and vocabulary.

In his 2018 op-ed, White announced that Louisiana had just submitted a proposal to the federal Department of Education to develop an “Innovative Assessment Pilot” under the recently passed ESSA legislation.2

“Rather than administering separate social studies and English tests at the end of the year,” White wrote, “Louisiana schools participating in the pilot will teach short social studies and English curriculum units in tandem over the course of the year, pausing briefly after each unit to assess students’ reading, writing and content knowledge. Students, teachers and parents will know the knowledge and books covered on the tests well in advance. Knowledge of the world and of specific books will be measured as a co-equal to students’ literacy skills. And teachers ...

The Knowledge Gap

The Louisiana Experiment

The Political and Practical Hurdles

Bottom Line

Sources

Is testing the next frontier in ed reform?