Aligning Anthropic

Rohit Krishnan cuts through the noise of a chaotic week in artificial intelligence to reveal a far more unsettling truth: the conflict isn't about who is the "good guy," but about the fundamental impossibility of a private company acting as the moral arbiter for the state's most lethal tools. While the media fixates on personality clashes between tech CEOs and the Department of War, Krishnan argues that the real story is the collision of two incompatible operating systems—one based on contractual red lines, the other on the absolute necessity of operational control.

The Illusion of the Red Line

Krishnan observes that the recent fallout between Anthropic and the Department of War was less a policy disagreement and more a breakdown in expectations regarding who holds the keys. "Anthropic said no we won't budge, DoW got angry, and threatened to cut them off and declare them a supply chain risk," he writes. This escalation highlights a critical friction point: the government views the technology as a sovereign asset, while the company views it as a product with ethical constraints.

The author suggests that the dispute centered on vague concepts like "mass surveillance" and "autonomous weapons," which are difficult to enforce in a live combat scenario. "You ask a dozen people, as Zvi did, you get a dozen different responses," Krishnan notes regarding the ambiguity of these terms. This lack of specificity creates a dangerous vacuum where legal loopholes can be exploited. Critics might note that the Department of War's insistence on "all lawful use" is a standard legal defense, but Krishnan rightly points out that legality does not always equate to ethical acceptability or safety.

The core of the argument rests on the distinction between two models of deployment. One model relies on trust and upfront contracts, while the other relies on active, real-time oversight. "One has more contractual protections and limited operational visibility, the other has lower contractual protections and higher operational visibility," he explains. This distinction is crucial because it determines who actually controls the outcome when a missile is heading toward a target.

"You cannot call your technology a major national security risk in dire need of regulation and then not think the DoD would want unfettered access to it."

The Sovereignty of the Machine

Krishnan reframes the narrative from a corporate spat to a constitutional crisis of sorts, where the executive branch refuses to be bound by the terms of service of a private vendor. He draws a parallel to historical precedents where the government asserted dominance over critical infrastructure, noting that "the US has nationalised or regulated whole industries for simpler reasons. Telephone lines, rails, steel mill attempted seizure, these aren't small things." This historical context, reminiscent of the broad executive powers debated during the framing of Article One of the Constitution, suggests that the Department of War's reaction was not an anomaly but a predictable assertion of state power.

The author argues that the Department of War's position is rooted in a pragmatic, if chilling, reality: they cannot afford to pause a military operation to ask a CEO for permission. "The DoW would also want the power to determine courses of action, and can't leave operational control in the hands of another," Krishnan writes. This creates a paradox where the very companies calling for AI safety and regulation are the ones being asked to surrender their safety guardrails to the state.

Krishnan's analysis of OpenAI's contrasting approach is particularly insightful. While Anthropic tried to enforce red lines via contract, OpenAI seemingly opted for a model where they retain control over the deployment stack. "OpenAI said sure, we agree to all lawful use, but note these specific laws and regulations, and we will control the deployment of our models, using our people," he paraphrases. This shift from a permissions-based contract to an execution-based control mechanism may be the only viable path forward, yet it raises its own questions about the concentration of power in the hands of a few tech leaders.

The End of Privacy and the Genie

The commentary takes a darker turn as Krishnan considers the long-term implications for civil liberties. He posits that the era of digital privacy is effectively over, not because of a single policy, but because the technology itself has made anonymity impossible. "I am extremely uncomfortable with the fact that we can just purchase commercially available data on almost everyone," he admits. The ability to reverse-engineer identities and track individuals is no longer the exclusive domain of intelligence agencies.

The author warns that the genie cannot be put back in the bottle, regardless of the regulatory framework. "Genies don't tend to go back into bottles, and this one has powerful forces keeping it out," he writes. This inevitability forces a difficult question: if the technology is here to stay, how do we structure the relationship between the state and the private sector to prevent abuse without crippling national defense?

Krishnan suggests that the current tribal politics surrounding these issues are a distraction from the structural reality. "Unless we know what we want to do with the attention, tribal politics is going to overwhelm it all," he argues. The focus on whether a specific CEO is "opportunistic" or "virtuous" misses the point that the system itself is broken.

"Democracy is incredibly annoying but really, what other choice do we have!"

Bottom Line

Krishnan's strongest contribution is his refusal to romanticize the role of private companies as the guardians of democracy against the state, instead exposing the futility of trying to contractually bind the executive branch's operational needs. The argument's vulnerability lies in its somewhat fatalistic acceptance that privacy is gone, potentially underestimating the power of new regulatory frameworks to impose technical constraints. Readers should watch for how the Department of War's new contracts with other AI firms will codify these "execution control" mechanisms, as this will define the future of autonomous warfare.

Aligning Anthropic

by Rohit Krishnan · Strange Loop Canon · Read full article

Last week was a bit crazy. In many ways, but specifically with AI. For those who were blissfully unaware, The Department of War picked a fight with Anthropic over the ways they were allowed to use the model. The fights, as is often the case with the administration, got nasty. Anthropic said no we won’t budge, DoW got angry, and threatened to cut them off and declare them a supply chain risk. A few hours after, OpenAI said they managed to get another deal, apparently a better deal, and one such that any other AI lab can also avail the same terms.

So naturally everyone is angry. Anthropic is angry because they were declared an SCR. DoW is angry because someone tried to force their hand. OpenAI is angry because everyone seems to call them opportunistic ghouls, more or less. The media, both independent and institutional, loves it because they get to play their favourite game of good guy-bad guy.

I really didn’t want to write about this. But it is important, contractual disputes are actually interesting, and sometimes that deserves an explanation.

The facts are roughly as following, Anthropic had an agreement via Palantir to work with the DoW. They’ve been doing it since mid 2024. They made an different, supposedly unsafe version of Claude to do this. Somehow over the last week, they got into a tiff with the DoW, supposedly over some red lines they had (no mass surveillance and no autonomous weapons) or rather who will get to say what those lines are and when they’re crossed. OpenAI signed a contract which had those same red lines and an enforcement mechanism.

Now, the claims are roughly as following, noting that nobody knows if they’re true. Anthropic asked questions about the Maduro raid where it was used, and the DoW got upset. DoW asked a hypothetical about how to do autonomous missile defense using Claude, and got a non-answer that they’d need to talk to the CEO and they’d ‘work it out’. Anthropic asked for their red lines to be enforced by enabling them to act as the party to approve it (you’d ask them if you had a question). DoW wanted language referring to “all lawful use”, basically saying if what they’re doing is legal you can’t tell them what to do, especially during operations, i.e., you can’t tell them to stop doing something in the ...

Aligning Anthropic

The Illusion of the Red Line

The Sovereignty of the Machine

The End of Privacy and the Genie

Bottom Line

Deep Dives

Sources

Aligning Anthropic