← Back to Library

Kayfable

Alberto Romero dismantles the public narrative surrounding Anthropic's recent clash with the federal government, arguing that what looks like a regulatory crisis is actually a calculated performance designed to secure state control over artificial intelligence. He posits that the company is not merely reacting to safety concerns but actively engineering a scenario where only they are trusted to wield superintelligent systems.

The Theater of Compliance

Romero frames the recent events involving Anthropic's "Fable 5" model not as a failure of governance, but as a deliberate adoption of kayfable—a term he defines as "a public performance maintained past the point of plausibility." He traces a timeline where the company announces a powerful model, claims it is too dangerous to release fully, and then allegedly refuses to fix specific vulnerabilities even when prompted by federal officials. When the executive branch subsequently imposes export controls, effectively banning the model from foreign access, Romero suggests this was the desired outcome.

Kayfable

He writes, "Anthropic gains three things they wouldn't have had they obliged: They remain the underdog... everyone tries to prevent them from being the AI company of The People." This reframing is provocative because it inverts the standard safety narrative; instead of a rogue actor forcing the government's hand, Romero sees a company courting regulation to establish a monopoly on trust.

"They're courting Trump at court while telling us with their actions how mistreated they are."

Critics might argue that attributing such intricate strategic foresight to a startup ignores the genuine chaos of developing frontier models. However, Romero's analysis gains weight when considering historical precedents like the Export Administration Regulations, where dual-use technologies have long been weaponized for geopolitical leverage. By comparing their safety stance to "giving nukes to North Korea," Romero suggests Anthropic is intentionally escalating the stakes to ensure the government views them as indispensable partners rather than mere vendors.

The Architecture of Control

The core of Romero's argument rests on the assertion that Anthropic has never intended to democratize access to advanced AI. He contends that their "safety" rhetoric is actually a proxy for exclusivity. As he puts it, "Anthropic's ultimate goal is not to build AGI, but to own it. To control it."

Romero observes that the company spends more time debating philosophy and safety protocols than competing on raw benchmarks, creating an environment where talent clusters around their specific ideological framework rather than just technical prowess. He notes, "Talent doesn't leave Anthropic. It is the most powerful attractor, like a black hole." This concentration of human capital allows them to shape the narrative that AI is too dangerous for the public, a stance he describes as "an 'AI is unsafe in your unsafe hands' stance."

"If you thought this is still just marketing, you have understood nothing, and your lack of imagination baffles me more than Anthropic's behavior of late."

This perspective challenges the common belief that commercial competition will naturally lead to open access. Romero suggests a model collapse scenario where the industry narrows into an oligopoly if the government intervenes as a primary customer or regulator. He warns that once the industry transforms into a "national AI initiative akin to the Manhattan Project," companies like Anthropic will no longer need consumer revenue, allowing them to sever public access entirely.

The Cult of Conviction

Perhaps the most striking element of Romero's commentary is his characterization of Anthropic's leadership as a religious order rather than a tech firm. He argues that their belief in the imminent arrival of superintelligence is so profound that it dictates a "cultist" approach to governance. "When the Anthropic guys are called a religious cult, I never take it as a derogatory insult," he writes, suggesting that for those who truly believe they are approaching "the end of time," standard business rules do not apply.

He contrasts this with other tech giants, labeling them "fearful atheists" who pay lip service to safety without genuine conviction. In Romero's view, Anthropic is the only entity willing to make the hard choice: "It won't be available even in closed form... 'It's not a toy,' they will say, 'and you're just a bunch of kids.'"

"No amount of shameless kayfable is too much to ensure the world steers itself their way."

This section forces readers to confront the possibility that the push for regulation is not about preventing harm, but about preventing access. While some may find this conspiratorial, it aligns with the growing tension between private innovation and public oversight seen in recent kleptocracy studies regarding how elite networks capture state power. Romero's fear is that we are witnessing the final consolidation of control before the technology becomes too powerful to share.

Bottom Line

Romero's most compelling insight is the inversion of the safety narrative: he convincingly argues that Anthropic's resistance to quick fixes and their embrace of regulatory hurdles are strategic moves to position themselves as the sole gatekeepers of AGI. The argument's vulnerability lies in its assumption of perfect coordination, potentially underestimating the genuine unpredictability of AI development. Readers should watch closely for whether the administration accepts this "safety" framework or if it pivots toward a more open, competitive market model.

Deep Dives

Explore these related deep dives:

  • The Age of Surveillance Capitalism Amazon · Better World Books by Shoshana Zuboff

    How tech companies turned human experience into raw material for prediction and control.

  • Model collapse

    The article hints at the technical fragility of AI systems where fixing one vulnerability breaks another; this concept details how recursive training on synthetic data or aggressive guardrails can degrade model intelligence, illustrating why Anthropic might view patching as a losing battle.

  • Kleptocracy

    While the text mentions China and supply chain risks, understanding kleptocracy provides the specific geopolitical lens for why US regulators fear foreign state actors accessing dual-use technology to bypass domestic sanctions or steal intellectual property.

Sources

Kayfable

Hey there, I’m Alberto! Each week, I publish long-form AI analysis covering culture, philosophy, and business for The Algorithmic Bridge. Paid subscribers also get Monday how-to guides and Friday news commentary. I publish occasional extra articles. If you’d like to become a paid subscriber, here’s a button for that:

Interrupting the regular schedule because the Anthropic-Fable-US government situation demands it. A recounting of the facts as known, plus my analysis of what's really going on. Monday's practical guide moves to tomorrow (Tuesday).

kayfable noun

kay·fa·ble | \ ˈkā-ˌfā-bəl \

1: a fictitious story whose lesson depends on everyone pretending it is real

2: a public performance maintained past the point of plausibility

3: whatever shit show is going on between Anthropic and the US government

So let’s start with the facts as reported by the press.

April 7th. Anthropic announces Mythos Preview, a model with unprecedented coding and agentic capabilities. They don’t release it, alleging serious risk.

June 9th. Anthropic releases Fable 5, a Mythos-class model and the best in the world. It’s capped by guardrails; prompts related to cybersecurity, biology, distillation, and AI research are routed to an inferior model. People are awed and angry in equal measure.

June 11th. Amazon CEO contacts the US government (USG) with info on a jailbreak that reveals Fable 5 could be really dangerous. The USG reportedly presses Anthropic to fix or shut down the model, over concerns that China could access the model (Anthropic denies the USG ever said anything about China). Anthropic refuses, allegedly stating that the presumed danger is a known issue and has been exaggerated. Surprised by Anthropic’s reluctance, the USG puts Fable under the export control directive, forcing Anthropic to stop serving Fable to any foreign national, including people working at Anthropic.

June 12th. Anthropic retires Fable for all customers “to ensure compliance.”

June 14th. Anthropic staffers fly to Washington to fix the situation.

June 15th. They’ve met with the Trump administration.

At the time of publishing, nothing else has happened.

Among the long chain of implausible events that have surrounded Anthropic in 2026—including but not limited to a wild revenue surge, OpenAI *sorpasso,* and **repeated clashes with the USG—there’s one question that stands out above every other: why not just obey?

As the poster child AI safety company, why not try to solve the supposed jailbreak? Why not concede that the problem—even if it’s a negligible issue ...