Qualcomm's hexagon AI accelerators

In a market obsessed with raw speed and massive memory bandwidth, Babbage makes a counter-intuitive claim: the most critical architecture for the next decade of AI might be the one quietly running in your pocket. While the industry fixates on high-cost, high-power server chips, Babbage argues that Qualcomm's decades-long investment in power-efficient mobile processing could finally solve the energy bottleneck choking data centers. This is not just a product launch analysis; it is a historical deep dive suggesting that the future of artificial intelligence may depend on scaling down rather than scaling up.

The Power Efficiency Pivot

Babbage opens by dissecting the recent announcement of the AI200 and AI250 servers, noting that the market's initial reaction was a mix of skepticism and a surprising $20 billion jump in Qualcomm's valuation. The author points out that while competitors rely on expensive and scarce High Bandwidth Memory (HBM), Qualcomm is betting on Low Power Double Data Rate (LPDDR) memory. "Qualcomm AI250 introduces an innovative memory architecture, offering a generational leap in effective memory bandwidth and efficiency for AI workloads," Babbage writes, highlighting the strategic shift toward cost and power management over pure peak performance.

This framing is compelling because it addresses the industry's growing anxiety about energy consumption. As Babbage notes, "Power draw is now clearly a key constraint on the growth of AI datacenter." The argument suggests that the efficiency principles honed over twenty years in smartphones could be the secret weapon for servers. However, critics might note that the gap between mobile inference and massive data center training is vast; what works for a phone battery may not scale to a 200-megawatt facility without significant architectural changes.

"Qualcomm's Hexagon might just be the most important architecture that no-one is talking about."

The author draws a parallel to the rise of Arm processors in laptops, suggesting a similar trajectory for NPUs (Neural Processing Units). Babbage explains that while the industry often dismisses mobile chips as too weak for heavy lifting, the sheer volume of deployment offers a unique advantage. "The overwhelming majority of Hexagon NPUs, though, have shipped in smartphones using Snapdragon SoCs or in Qualcomm's cellular modems used by Apple amongst others. In fact Qualcomm has shipped billions of Hexagon cores over the last two decades. Hexagon NPUs are, quite literally, almost everywhere!" This ubiquity provides a mature, tested foundation that new entrants lack, even if the skepticism regarding previous server attempts remains a valid concern.

From Space Signals to Smartphone Chips

To understand the potential of this architecture, Babbage takes the reader on a historical tour, tracing the lineage back to the 1970s and the Linkabit Microprocessor. The narrative reveals that the roots of this technology lie in decoding signals from outer space, not in consumer electronics. "In 1973, only two years after Intel released the world's first microprocessor, the Intel 4004, Gilhousen began writing the microcode describing the operations of the LMP," Babbage writes, detailing how early designs were built for military command platforms like the Boeing EC-135.

This historical context is crucial. It reframes the Hexagon architecture not as a derivative mobile chip, but as a specialized tool born from the rigorous demands of telecommunications and signal processing. The author notes that the first DSP (Digital Signal Processor) was developed around 1988-1989 specifically because off-the-shelf options from Texas Instruments and Motorola "weren't up to the job." This origin story explains why the architecture is so adept at handling the complex, real-time data streams required for modern AI inference.

The evolution from DSP to NPU is presented as a natural progression. Babbage observes that "Qualcomm's DSPs have changed so much that Qualcomm isn't calling them DSPs any more: they've evolved into a higher form of life in the shape of the AI-focused 'Neural Processing Unit' or NPU although it's important to note that they still perform 'DSP type' roles." This continuity suggests that the new AI accelerators are not untested experiments but the culmination of a specific engineering philosophy focused on specialized, efficient computation.

"The nature of that compute, and the power constraints inherent in cellular designs, means that a general purpose CPU architecture such as Arm or x86 isn't up to the task."

Babbage also addresses the elephant in the room: Qualcomm's history of failed server attempts. The author candidly admits that the company "attempted entry into the Arm server market twice before only to abandon its efforts," citing the Centriq project which "didn't even ship a single chip before giving up!" Yet, the argument pivots to the idea that the current landscape is different. With the acquisition of Nuvia and the subsequent court victory over Arm, the company now has the freedom to deploy high-performance server CPU cores without legal encumbrance. "The original Nuvia designs themselves were high performance server CPU cores which might come in handy in 'AI servers'," Babbage suggests, implying that the combination of Nuvia's CPU power with Hexagon's efficiency could finally break the deadlock.

The Bottom Line

Babbage's strongest argument lies in reframing the AI hardware race: the winner may not be the one with the fastest chip, but the one that can deliver the most inference per watt at the lowest cost. The historical depth provided—from 1970s military comms to modern smartphone NPUs—gives weight to the claim that Hexagon is a mature, battle-tested architecture ready for the data center. However, the biggest vulnerability remains execution; the gap between shipping billions of mobile chips and successfully deploying a 200-megawatt data center in Saudi Arabia is a chasm of logistics and scale that skepticism is right to question. The world is watching to see if efficiency can indeed trump raw power in the AI era.

Qualcomm's hexagon AI accelerators

by Babbage · The Chip Letter · Read full article

This is a part free and part paid post. The first part, discussing Qualcomm’s recently announced AI200 and AI250 AI accelerators and the history of the Hexagon architecture is free, whilst the detailed look at the Hexagon architecture that follows is paid. None of what follows is investment advice.You can upgrade to a paid subscription and support The Chip Letter using the button below.

“Qualcomm’s Hexagon might just be the most important architecture that no-one is talking about”

On 28 Oct 2025 Qualcomm announced two new data-centre AI inference servers, the AI200 and AI250. According to the press release:

Qualcomm AI200 and AI250 solutions deliver rack-scale performance and superior memory capacity for fast data center generative AI inference at industry-leading total cost of ownership (TCO). Qualcomm AI250 introduces an innovative memory architecture, offering a generational leap in effective memory bandwidth and efficiency for AI workloads.

Qualcomm also announced a partnership with Saudi Arabia’s HUMAIN to deploy its new data-centre technology:

Under the program, HUMAIN is targeting 200 megawatts starting in 2026 of Qualcomm AI200 and AI250 rack solutions to deliver high-performance AI inference services in the Kingdom of Saudi Arabia and globally.

The AI200 won’t be available until the end of 2026 and the AI250 until 2027 - an age in terms of the AI server market - and many are sceptical about Qualcomm’s chances of success in a market with lots of competitors. Qualcomm already has AI server products in the shape of the AI100 series and their lower cost cousins the AI80 series, announced as far back as 2019, which have made very little impression.

Despite all this the AI200 and AI250 announcements led to a pop in Qualcomm’s share price on the day.

That’s an extra $20 billion or so in market value which seems a lot but is of course less than 1/2% of Nvidia’s market cap!

What is in the AI200 and AI250 to cause this excitement? Qualcomm are leading with ‘low total cost of ownership’ as the key selling point of the new designs.

Part of this is probably Qualcomm’s use of LPDDR memory rather than the more expensive and scarce HBM. This should allow higher memory capacity - Qualcomm quotes up to 768GB per card - as well as reducing costs.

The idea of avoiding HBM isn’t new though, with Groq in particular apparently building a promising architecture and business on the concept.

SemiAnalysis ...

Qualcomm's hexagon AI accelerators

The Power Efficiency Pivot

From Space Signals to Smartphone Chips

The Bottom Line

Deep Dives

Sources

Qualcomm's hexagon AI accelerators