← Back to Library

Sbsq #28b: Maye vs. Stafford and yelp vs. Google

Nate Silver doesn't just crunch numbers; he dismantles the very definitions we use to measure greatness, whether on the gridiron or in a bustling city restaurant. In a piece that feels more like a masterclass in statistical literacy than a standard Q&A, Silver challenges the conventional wisdom surrounding the NFL MVP race and the reliability of crowd-sourced data, arguing that our obsession with "headline numbers" often blinds us to the true drivers of value.

The Hidden Value of Mobility

The most provocative claim in Silver's analysis is his insistence that the NFL MVP award should go to quarterback Drake Maye over the statistically dominant Matt Stafford. While traditional metrics favor Stafford's raw passing volume, Silver argues that these stats ignore the strategic reality of modern football. "Stafford's 'headline numbers' certainly look better — 4707 yards, 46 TDs versus Maye's 4394 and 31," Silver writes, acknowledging the surface-level appeal of the Rams' quarterback. However, he quickly pivots to a more nuanced evaluation, noting that "Maye is almost a full win ahead of Stafford in QBERT (5.7 WAR versus 4.8), a clear enough lead that you wouldn't say it's within margin-of-error range."

Sbsq #28b: Maye vs. Stafford and yelp vs. Google

Silver's methodology here is distinct because it refuses to treat a sack as merely a negative play; instead, it treats it as a missed opportunity for positive yardage that a mobile quarterback can sometimes salvage. He explains that "if you're going to account for sacks, what about scramble attempts that result in positive yardage?" This reframing is crucial. By counting any pass, sack, or rushing attempt as a "QB play," Silver levels the playing field. The result is a stark contrast in efficiency: "Maye had a considerably higher completion percentage — even Tom Brady never approached 72 percent for a full season — while also averaging considerably more yards per attempt."

The argument gains further weight when Silver isolates the impact of mobility on team strategy. He points out that "Stafford just doesn't run — at all," netting only one rushing yard all season, which "really limited the Rams' strategic options in some situations." In contrast, Maye's ability to scramble forces defenses to account for the run, creating space for everyone else. "Pitching a shutout in the rushing categories is enough to outweigh Stafford's slight edge in pure passing," Silver concludes. This is a sophisticated take that moves beyond the box score, echoing the evolution seen in baseball's "Wins Above Replacement" debates, where the value of a player's defensive range or baserunning speed was once ignored until advanced metrics proved their worth. Critics might note that MVP voting has historically favored volume and traditional accolades like All-Pro selections, making Silver's case a battle against entrenched bias rather than just data. Yet, the logic holds: if the award is for the most valuable player, a quarterback who expands the offense's playbook is inherently more valuable than one who does not.

Pitching a shutout in the rushing categories is enough to outweigh Stafford's slight edge in pure passing.

The Algorithm of Taste

Shifting from the gridiron to the dinner table, Silver applies the same skeptical lens to how we judge restaurants. He tackles the common frustration of discrepancies between Google and Yelp ratings, offering a data-driven explanation for why the two platforms diverge. "Google does tend to have considerably higher star ratings than Yelp on average," Silver notes, attributing this to structural incentives rather than random noise. He cites a source suggesting that Google's design encourages businesses to solicit reviews, often leading to "5-star ratings," while Yelp's "higher friction" to publish a review—requiring both a rating and text—filters out the casual, potentially biased feedback.

Silver's analysis goes deeper, identifying a demographic bias in Yelp's user base. He observes that Yelp attracts a "more specific 'foodie-centric' demographic," which in practice skews toward "relatively well-off (stereotypically white or Asian American) urban Millennials and Gen Xers." This creates a blind spot for certain types of establishments. "Often these are places in sort of an 'cute' upper middlebrow sweet spot and overweight customer experience relative to the quality of the food," he writes, while true "holes-in-the-wall" may be rated down if their service doesn't meet the platform's specific aesthetic standards. This is a vital insight for busy readers navigating a city like New York, where the "perfect" restaurant might be a trap for the uninitiated.

Silver's advice is pragmatic: "Pay more attention to the number of reviews than the rating." He argues that popularity is a strong signal, even if it attracts more haters. "There's a phenomenon wherein as a restaurant, or anything really, becomes more popular, it will attract more haters because it appeals to a broader clientele, and people will feel more comfortable ragging on it," he explains. Furthermore, he warns against the paralysis of choice in major markets, suggesting that "the perfect can be the enemy of the good." He advises readers to look for neighborhood specialties rather than chasing the highest-rated spot in the city, noting that "spontaneous experiences can often be highly enjoyable so long as you avoid outright tourist traps." This section serves as a reminder that data is only as good as the context in which it is collected, a lesson that applies to polling, sports, and dining alike.

The Flawed Logic of Selection

The final segment of Silver's commentary turns a critical eye toward the College Football Playoff selection committee, exposing a methodology that seems to defy basic statistical principles. He describes a system that appears to have "no idea what to do with the results of conference championship games," often ignoring decisive blowouts or double-counting losses in ways that distort the final rankings. "The final rankings appear to reflect so many bad uses of data that they seem better off going back to a BCS-style algorithm," Silver argues, suggesting that a pure algorithmic approach might actually be more transparent and fair than the current human-driven process.

The core of his frustration lies in the inconsistency. "Even if the CFP is going to use a methodology for selecting teams that isn't the best use of data, shouldn't they at least use the same methodology with each ranking, rather than changing the method?" he asks. This inconsistency undermines the credibility of the entire selection process, leaving fans and analysts to guess at the criteria. Silver's critique highlights a broader issue in sports governance: the tension between the desire for human judgment and the clarity of objective metrics. While the committee may argue that they are weighing "strength of schedule" or "head-to-head" results, their inability to apply these factors consistently suggests a lack of rigor. A counterargument might be that human committees can account for intangible factors like team momentum or injury context that algorithms miss, but as Silver points out, the current execution fails to demonstrate any such nuance.

Bottom Line

Nate Silver's commentary succeeds by refusing to accept surface-level narratives, whether they are about a quarterback's passing yards or a restaurant's star rating. His strongest argument is the demonstration that mobility and context are often undervalued in traditional metrics, a lesson that applies far beyond the NFL. The piece's greatest vulnerability is its reliance on proprietary metrics like QBERT, which, while logically sound, may struggle to gain traction against the inertia of traditional voting blocs. For the busy reader, the takeaway is clear: trust the data that accounts for the full scope of performance, not just the headlines.

Pitching a shutout in the rushing categories is enough to outweigh Stafford's slight edge in pure passing.

Deep Dives

Explore these related deep dives:

  • Wins above replacement

    The article heavily discusses QBERT's WAR (Wins Above Replacement) metric for comparing quarterbacks. Understanding the statistical foundations of WAR and how it's calculated across sports provides crucial context for evaluating the Maye vs. Stafford debate.

Sources

Sbsq #28b: Maye vs. Stafford and yelp vs. Google

by Nate Silver · · Read full article

Hey subscribers, we’ve got a busy week ahead:

Later today, we’ll update QBERT and ELWAY with the results of the divisional playoffs.

Tuesday is the anniversary of Trump’s inauguration. I’ll have a Substack Live with Matt Yglesias of Slow Boring at 1 p.m. Tuesday, where we’ll discuss that and other things.

I’m also planning to revisit my 113 Trump predictions column from a year ago, probably landing late this week. Spoiler alert: I think the predictions are reasonably well-calibrated based on what we know so far. But I also think that doesn’t tell the whole story. (I have a lot of “meta” thoughts about how it’s going and whether the resistance libs were right about everything.)

Finally, our generic ballot polling average is ready to go. We just have to finish up a few elements on the charts and the presentation. This is basically the kickoff to our midterm election coverage, so we think it’s kind of a big deal! That’s also landing this week. It will also have some extra goodies attached, such as state-by-state benchmarks and historical generic ballot data.

Today, however, is a federal holiday1 and the college football national championship game, so we’re going to have some fun. I have a few leftover questions from this month’s SBSQ2 — two about football and one about food.

Why Drake Maye, not Matt Stafford, should be NFL MVP

Yelp vs. Google and Nate’s 3 tips for scouting restaurants

What’s wrong with the College Football Playoff selection committee?

SBSQs usually run behind the paywall, but we’ve been a little heavy on paid content lately, so we’re gonna run this one free. Asking questions is a benefit for paid subscribers, however! As always, we’d encourage you to submit questions for SBSQ #29 in the comments below.

Why Drake Maye, not Matt Stafford, should be NFL MVP.

FlipperAndersonFan (not a real person3) asks:

Why are you and Bill Simmons stanning so hard for Maye? My boy Matty Stafford had more TDs, more yards. And the Rams faced a much tougher schedule. Approximately 85.4362 percent of Mays’s [sic] yards were against the Jets. Aren’t you supposed to be a stats guy? Stafford was a SUPER BOWL CHAMPION the same year that Maye was the second-string quarterback in the Duke’s Mayo Bowl.

The NFL’s MVP award is almost certainly going to come down to Maye and Stafford. Both of them led their teams ...