← Back to Library

How we calculate our pele ratings

Most soccer fans treat team rankings as a static scoreboard of past glory, but Nate Silver argues that true predictive power requires a living model that accounts for economics, geography, and the shifting tides of player age. This piece is notable not just for its technical depth, but for its radical claim: that a team's future performance is often better predicted by its GDP and the market value of its youth than by its last match result. For the busy reader, this reframes the entire 2026 World Cup not as a tournament of luck, but as a collision of long-term structural forces.

The Architecture of Prediction

Silver opens by dismantling the traditional view of team strength. "PELE stands for Predictive Elo with Lineup Equilibria," he writes, immediately signaling that this system prioritizes future probability over historical prestige. He contrasts this sharply with standard FIFA rankings, noting, "we're not interested in which teams are most 'deserving' of a particular slot. Rather, we're looking for factors that have a predictive impact." This distinction is crucial; it shifts the focus from moral judgment to statistical utility.

How we calculate our pele ratings

The model's core innovation lies in its "Lineup" component, which anchors ratings to real-world data rather than just match outcomes. Silver explains, "For years since 2005... team ratings are gradually 'nudged' toward team aptitude as estimated by these player values." This approach acknowledges that a team's potential is physically embodied in its roster. By integrating age data, the system anticipates trajectories: "younger teams are expected to improve, while older teams are expected to decline." This dynamic adjustment is what separates PELE from static historical lists.

However, this reliance on market values introduces a potential blind spot. Critics might note that Transfermarkt valuations can be speculative bubbles, potentially overvaluing young stars in wealthy leagues while undervaluing gritty veterans in less commercialized regions. Silver's model assumes the market is efficient, a premise that doesn't always hold in the chaotic world of international football.

The Geography of Home Advantage

Perhaps the most compelling section of Silver's analysis is his granular treatment of home-field advantage (HFA). He rejects the idea that playing at home is a uniform bonus, arguing instead that "HFA varies over time" and is heavily influenced by logistics. He points out that "traveling from Brussels to Amsterdam is less burdensome than flying from South Korea to Brazil," a nuance often ignored by simpler models.

The treatment of altitude is particularly striking. Silver notes, "As you may have experienced yourself, engaging in intensive physical activity at 10,000 feet is more than twice as hard as at 5000 feet." This leads to a nonlinear formula where teams like Bolivia gain a massive, physics-based edge. He further refines this by customizing HFA for each nation, observing that "teams in far-flung and war-torn places tend to have larger HFAs, while richer nations in Europe and the Middle East tend to have smaller ones." This economic and geopolitical layering adds a depth of realism that pure match-result models miss.

The impact of home-field advantage is underrated by other systems, yet it is often the difference between a draw and a victory.

Silver also tackles the weight of match importance, rejecting the binary of "friendly" versus "serious." He argues that "low-impact matches, such as friendlies, tend to predict performance in future low-impact matches," while high-stakes games predict high-stakes outcomes. By applying a multiplier system—where World Cup matches count 1.6x and friendlies only 0.5x to 0.7x—he creates a more accurate signal-to-noise ratio. This is a sophisticated move that respects the context of every single game played.

Historical Continuity and Discontinuity

The article's scope is staggering, covering nearly 50,000 matches since 1872. Silver faces the difficult task of defining what constitutes a "team" over a century of shifting borders. He draws a parallel to hockey history, noting that his system recognizes an "Original Ten" of pre-FIFA nations, "analogous to the NHL's 'Original Six'." This historical framing helps ground the data in a recognizable lineage.

However, the decisions on political continuity are where the model's philosophy shines. Silver writes, "FIFA regards West Germany as having inherited pre-WW2 Germany's football legacy... Our definitions are stricter and treat major changes in national boundaries as discontinuous." He treats the breakup of the Soviet Union and Yugoslavia as resets, yet tolerates minor splits like "Timor-Leste splitting from Indonesia." This approach prioritizes the economic and geographic reality of a nation over bureaucratic continuity. It's a bold choice that acknowledges that a team representing a dissolved state is fundamentally different from the state that replaced it.

Critics might argue that treating every border change as a discontinuity ignores the cultural continuity of footballing traditions. A team like Czechoslovakia had a distinct identity that didn't vanish simply because the political map changed. Silver's strict adherence to geography risks losing some of the soul of the sport in favor of data purity.

The Economics of the Starting Line

Finally, Silver addresses the starting conditions of the model. In traditional Elo systems, everyone starts equal. Silver rejects this, arguing that "which teams are strong at football is predictable" based on first principles. He introduces a "GDP prior," where a country's purchasing power parity and population size set the initial baseline. "We use aggregate GDP, not GDP per capita, so both population size and living standards per citizen matter," he explains. This ensures that a nation like Argentina doesn't start the simulation at the same level as American Samoa, reflecting the reality that football success is inextricably linked to the resources available to the nation.

This economic determinism is the model's most controversial yet defensible pillar. It suggests that the gap between the Global North and South in football is not just about talent, but about the structural capacity to develop it. As Silver puts it, "Even if you'd never seen a soccer game, you'd know that Argentina is much larger, has a much longer football legacy, and comes from a region where football plays a much more prominent role in the culture."

Bottom Line

Silver's PELE system is a triumph of contextual modeling, successfully weaving together economics, geography, and roster dynamics to create a forecast that feels alive. Its greatest strength is the refusal to treat football as an isolated sport, instead viewing it as a reflection of the nations that play it. The model's biggest vulnerability remains its reliance on market values and GDP as proxies for talent, which may struggle to account for the unpredictable magic of a team that punches above its weight. For the 2026 World Cup, this is the most rigorous lens through which to view the coming chaos.

Deep Dives

Explore these related deep dives:

  • The Expected Goals Philosophy Amazon · Better World Books by James Tippett

  • Elo rating system

    The article explains how PELE adapts the chess-derived Elo formula for soccer by introducing zero-sum updates and predictive adjustments rather than descriptive rankings.

  • Original Six

    This term is used to contrast the limited historical scope of traditional soccer rankings with PELE's unique ability to backdate ratings continuously to the first international match in 1872.

  • Timor-Leste independence

    The article likely cites Timor-Leste as a specific case study where a nation's recent political formation and lack of historical match data required the model to rely heavily on non-match factors like GDP and regional legacy.

Sources

How we calculate our pele ratings

by Nate Silver · · Read full article

PELE is Silver Bulletin’s rating system for international soccer teams. Each team gets two principal ratings: a PELE rating describing its overall skill level and a Tilt rating indicating its propensity toward attacking or defensive play. Based on these ratings, we can evaluate past match results and forecast future matches. PELE ratings are updated continuously and backdated to 1872 (!).

PELE is also the backbone of our 2026 World Cup forecasts, set to be published later. We’ll update this article with any World Cup-specific adjustments once they’re ready.

We’re extremely proud of PELE, but it was a lot of work, and it’s not our simplest model. This article describes the system in detail.

The basics of PELE.

PELE stands for Predictive Elo with Lineup Equilibria. We know it’s a little bit nerdy, but this backronym captures most of the essential features of the system:

Predictive means that the goal of PELE is to probabilistically forecast the outcome of future soccer games. These aren’t the FIFA rankings: we’re not interested in which teams are most “deserving” of a particular slot. Rather, we’re looking for factors that have a predictive impact. International football teams play relatively few important games, and some of the most predictive indicators don’t derive from match results alone.

Elo means that PELE shares many properties with an Elo rating system — and indeed, PELE ratings are designed to be comparable to Elo ratings such as the FIFA rankings or the World Football Elo Ratings.1 As with other Elo ratings systems, PELE ratings are updated iteratively at the end of each match, and updates are zero-sum. (If Brazil beats Bolivia 4-1, whatever gain Brazil makes in its PELE rating is offset by a loss of points for Bolivia.) However, PELE deviates from traditional Elo ratings in other important respects, as we’ll describe below.

Lineup means we use player market values and age data from Transfermarkt to help anchor PELE ratings. We look at the market values for the top 23 nationals2 with their respective club teams, with some soft positional constraints. For years since 2005 (when Transfermarkt’s coverage begins3) team ratings are gradually “nudged” toward team aptitude as estimated by these player values. Player ages, weighted by market value, also affect the system — younger teams are expected to improve, while older teams are expected to decline. PELE also uses market values to help calculate whether a team’s strengths ...