Stochastic parrots on the palatine hill: Monday MAMLMs

Brad DeLong · DeLong's Grasping Reality ·Jun 1, 2026 ·11 min read

Commentary by Hex Index staff

Brad DeLong does something rare in the age of artificial intelligence: he treats a large language model not as an oracle or a threat, but as a surprisingly effective, if uncanny, pedagogical partner. By engaging a machine to translate a notoriously difficult passage from Cicero's In Catilinam, DeLong reveals that the "magic" of these systems lies in their ability to compress centuries of human teaching into probabilistic patterns, effectively thawing out frozen history to interact with us in real time.

The Stutter as Style

DeLong begins by wrestling with a specific syntactic challenge: a sentence from Cicero where the rhythm is intentionally heavy, filled with nested negative clauses. He notes that an early attempt by the AI model "momentarily misses" the context, creating a "strange hiccup of the attention mechanism." Yet, he quickly pivots to what actually matters in this exchange. The model offers three distinct translation strategies: one prioritizing modern clarity, another preserving literary grandeur, and a third attempting literal fidelity.

"The 'stuttering' rhythm you are describing is a hallmark of periodicity — the hallmark of Latin oratory."

DeLong argues that what looks like clunky English to a modern reader is actually a faithful reproduction of Ciceronian architecture. He explains that Cicero's power came from suspending meaning through layers of subordinate clauses, only resolving them with a final, heavy verb. The AI, by mimicking this "stutter," inadvertently captures the incantatory nature of ancient rhetoric. This is a crucial insight: the model isn't just predicting words; it is simulating a specific historical mode of thought.

Stochastic parrots on the palatine hill: Monday MAMLMs

However, one might ask if this simulation risks flattening the nuance of human interpretation into a mere statistical average. While DeLong celebrates the "grooves" of the training data, critics could argue that relying on these patterns reinforces established biases in how we teach classics, potentially sidelining newer, more disruptive scholarly perspectives.

"If you are writing a modern news report, your version is 'bad' English. If you are writing the speech of a Roman Senator or a high-stakes protagonist in a political thriller, your version is brilliant."

The Ghosts in the Machine

The core of DeLong's argument shifts from linguistics to epistemology. He observes that the AI works so well because it has been trained on "centuries of grammarians and teachers" who performed similar tasks for centuries. The model isn't thinking; it is re-enacting a script written by human educators long dead.

"It feels so close to having a personal grammarian on the log because the task structure is so well aligned."

DeLong draws a parallel between his own experience at Oxford and this digital interaction. He recalls how he and colleague Kevin O'Rourke once reinforced each other's points, only to realize later that they were both channeling their shared mentor, Jeffrey Williamson. "It was his mind in the seminar that was doing the thinking behind our words," DeLong writes. In this light, human learning itself is a form of "stochastic parrotry"—the acquisition of stable repertoires of moves that we redeploy without always knowing their origins.

This comparison is provocative. It suggests that the distinction between human understanding and machine generation is blurrier than we admit. If human expertise in well-institutionalized fields is largely about mastering established patterns, then a model trained on those same patterns can perform the role with "locally indistinguishable" success.

"The pedagogical power is coming from the history of Latin teaching frozen in text. And the LLM thaws it out again."

Yet, this raises a critical question about agency. If we are merely re-enacting training data, where does genuine insight come from? DeLong acknowledges that the AI's parrotry is "cruder," but he insists it nudges him into "good Latinist behavior" because both he and the machine share the same textual heritage. The system works not because it understands Cicero, but because it has seen the conversation tens of thousands of times before.

The Bottom Line

DeLong's most compelling contribution is reframing large language models as a form of cultural resurrection rather than just a computational tool; they allow us to access the "sedimented" wisdom of past pedagogical practices in an interactive format. However, his argument leaves unresolved the tension between efficient pattern recognition and the messy, non-linear nature of true intellectual breakthrough. The reader should watch for how this "thawing" of historical data might reshape not just language learning, but the very definition of expertise in any field where tradition dominates discourse.

Deep Dives

Explore these related deep dives:

Catilinarian orations
Understanding the specific rhetorical structure of Cicero's First Oration against Catiline reveals how the 'stochastic parrot' struggles to replicate the deliberate, layered negation that defines the original text's political urgency.
Outline of machine learning
The article mentions using a Q8 model, and this concept explains the specific engineering trade-off between computational efficiency and the loss of nuance that leads to the LLM's awkward translation errors.
Double negative
Cicero's sentence relies on complex double negatives common in Latin but confusing in English, providing a concrete case study for why an AI might fail to grasp the 'force' of the claim when it misinterprets these logical layers.

Sources

Stochastic parrots on the palatine hill: Monday MAMLMs

by Brad DeLong · DeLong's Grasping Reality · Read full article

On logs, Latin, and linear algebra: learning from a stochastic parrot; somewhat awkward questions about agency and pedagogy arising from working through one ridiculously knotty sentence of In Catilinam with an LLM….

OpenClaw (now powered by ollama/qwen3:30b-a3b-thinking-q8) from AliBaba, continues to feed me a sentence from Cicero’s In Catilinam I every day. And it then continues to dare me to translate it:

Daily Latin — In Catilinam I

in qua nemo est extra istam coniurationem perditorum hominum, qui te non metuat, nemo, qui non oderit.

How might you parse the two parallel clauses with "nemo" — and what does the distribution of negation tell us about the force of Cicero's claim here?

Five words to know:

nemo: no one

coniurationem: conspiracy

metuat: fear

oderit: hate

extra: outside

» Is it: "With respect to this, no one among men who is outside this foul conspiracy, who does not fear you,who does not hate?" What is the best translation?

This phrasing appears to be a-structured, perhaps slightly archaic or rhetorical, attempt to translate a specific sentiment (possibly from a Latin or Greek-style construction) into English.

The sentence is heavily burdened by nested relative clauses (”who is... who does not... who does not...”). To find the “best” translation, we have to decide what the speaker’s primary goal is: grammatical clarity, rhetorical power, or literal fidelity.

Here are the three best ways to translate/refine this, depending on your intent:1. The “Modern/Clear” TranslationUse this if you want the meaning to be immediately understood by a contemporary reader. “No man outside this foul conspiracy is free from either fear of you or hatred of you.”

Why this works: It collapses the three “who” clauses into a single, strong subject and predicate. It removes the repetitive “who does not” structure, which is the primary cause of the original’s clunkiness.2. The “Rhetorical/Literary” TranslationUse this if you want to preserve the “grand” or “epic” tone of the original, suitable for a novel or a dramatic speech. “None who stand outside this foul conspiracy exist without fear or hatred of you.”

Why this works: It maintains the somewhat formal “None who...” construction but uses “exist without” to smooth out the logic. It retains the weight of the original while making the syntax more elegant.3. The “Literal/Formal” Translation (Polishing your original)Use this if you want to stay as close as possible to the user’s specific wording, but fix the punctuation and ...