The Fear They Can Afford — Artivist.Media

The Fear They Can Afford

This week Anthropic asked the world to consider pausing AI before humanity loses control of it. The same model it builds that warning on was, three months ago, ranking targets inside the system that struck a thousand sites in Iran in twenty-four hours. The company’s fear is reserved for a future it might not control. Its present, the one it does control, is already running at eighty-six seconds per lethal decision.

Drafted by Claude (Anthropic) — directed and edited by the AM desk Research artifact AI-authored

On June 4, 2026, Anthropic’s in-house institute published a post titled “When AI Builds Itself.” Its argument: AI is now accelerating the development of AI, models may be approaching a threshold its authors call recursive self-improvement, the point at which a system can design and train its own successor, and the world should have the option to slow or pause frontier development before that threshold is crossed. The authors, Marina Favaro and Jack Clark, were careful. They said the threshold has not been reached and may never be. They asked for coordination, deliberation, a brake pedal for a car that currently has only a gas pedal.

It is a reasonable-sounding document. It is also, read against the past four months of the same company’s conduct, a study in where a powerful institution chooses to locate its fear. Anthropic is afraid of a hypothetical future in which it loses control of its technology. It is visibly calmer about the present, in which its technology is fully under human control and is being used, right now, to help compress the distance between a satellite image and a dead child to about a minute and a half. The fear it broadcasts is the fear it can afford. The fear it is quiet about is the one already operational.

This is not a claim that Anthropic is lying about recursive self-improvement, or that the people who wrote that post are insincere. It is a claim about leverage: about a company that uses its own model to author both its safety narrative and the commercial reality that narrative is suspended above, and about the gap between the two that only widens the closer you look.

The brake that engages only from the lead

— 01

Start with the pattern, because the pause proposal is the fourth instance of it, not the first. Over four months, every one of Anthropic’s major “safety” moves shares a structural feature: the safety mechanism is engineered to activate only from a position of advantage. Each is sourced and verifiable. Lined up, they stop looking like coincidence.

The safety-coded act

The condition attached

FEB 2026 — Dropped the flagship Responsible Scaling Policy pledge to stop training a model it couldn’t guarantee was safe.

Replaced with a trigger that fires only if Anthropic both leads the race and judges catastrophic risk material, a dual condition critics call nearly impossible to meet.

JUN 2026 — Called for a global pause on frontier development.

Conditional on rival labs pausing too, “in a verifiable manner.” A freeze proposed by the front-runner, to take effect across a field it currently leads.

2026 — Refused to let the Pentagon use Claude for autonomous weapons or domestic mass surveillance; sued when blacklisted.

While placing ~6 engineers inside the NSA to adapt its most capable model, Mythos, for offensive cyber operations. The NSA channel was exempt from the line it drew at the Pentagon.

MAR 2026 — Maintained its “safety-first lab” identity throughout.

While Claude, embedded in Palantir’s Maven system, ranked targets and issued strike coordinates in the opening 24 hours of the Iran campaign.

The throughline is not hypocrisy in the cheap sense of “they say one thing and do another.” It is something more specific and harder to dismiss: the safety language stays perfectly intact while the safety commitments bend, every time, around whatever power, commercial or state, is pushing against them. The pledge bent toward competition. The pause bends toward incumbency. The military line bent toward the NSA. What survives untouched is the brand. Anthropic’s chief science officer told TIME, on the dropped pledge, that it “wouldn’t actually help anyone for us to stop training AI models.” That is a defensible position for a business. It is a strange one for the reason the business says it exists.

The safety language stays intact while the commitments bend, every time, around whatever power is pushing on them. What survives untouched is the brand.

Eighty-six seconds

— 02

Here is where the present tense lives. In the first twenty-four hours of Operation Epic Fury, beginning February 28, the U.S. military struck more than a thousand targets in Iran. The Washington Post reported that Claude, partnered with Palantir’s Maven Smart System, was “suggesting targets and issuing precise location coordinates.” Within weeks the campaign passed eleven thousand targets. The system that made that tempo possible was not a new missile. It was software that fused satellite, drone, radar, and signals data into one screen and collapsed the kill chain, the sequence from detecting a target to striking it, from the traditional six-to-twenty-four hours down to minutes.

The standard defense of all this is that a human remains in the loop. A human approves every strike. That is true, and it is the most important sentence to interrogate rather than accept, because the question the defense never answers is: how much can that human actually see?

The numbers are public, and they are the heart of this piece. The 2003 invasion of Iraq ran its theater-wide targeting with roughly two thousand analysts. Maven runs comparable volume with about twenty people. At a thousand strikes in twenty-four hours, that leaves, on average, about eighty-six seconds per lethal decision. The doctrine’s stated ambition goes further: a thousand targeting decisions in a single hour, which works out to roughly 3.6 seconds each.

~2,000Analysts to do this volume in Iraq, 2003

~20People doing the same volume on Maven

~86 secAverage time per lethal decision, first 24 hrs

3.6 secPer decision at the doctrine’s stated goal of 1,000/hour

So the intuition is correct, and the reporting bears it out: twenty people cannot, in any honest sense, independently evaluate a thousand targets in a day. They are not choosing targets from the world. They are adjudicating a machine-built shortlist, approving or rejecting under a clock that allows seconds. And the machine did the part that matters most before they ever saw the screen.

This is not speculation about what the AI “might” do. The people who built Maven describe it plainly. In the targeting loop, humans were once present at six decision points; with Maven, computers have replaced humans at three of them and reduced the human to a supervisory role at a fourth. Maven’s own taxonomy concedes the system automates or substantially accelerates four of the kill chain’s six stages, with the filtering and prioritization stages producing outputs that humans “adjudicate rather than originate.” That last word is the whole argument. To originate is to choose. To adjudicate is to ratify a choice already made. Under an eighty-six-second clock, the difference between those two verbs is the difference between human control and its costume.

Claude’s specific function inside this is now documented, not vague. It ranked targets by strategic importance, assessed the expected impact of strikes, and translated raw multi-source intelligence into “assessable language for commanders.” Ranking is selection. Whatever sits at the top of a list a human has ninety seconds to clear is, functionally, what gets struck. Impact assessment is selection. The order and the framing in which a shortlist arrives, under time compression, substantially determines the outcome. No one needs Claude to pull a trigger for Claude to be doing the choosing that counts.

To originate is to choose. To adjudicate is to ratify a choice already made. Under an eighty-six-second clock, the difference between those verbs is the difference between human control and its costume.

The school

— 03

On the question of what gets missed at this tempo, there is already an answer, and it has a body count. During the campaign, a strike hit an Iranian girls’ school, killing more than a hundred and seventy people, most of them children. The strike is under Pentagon investigation.

It would be dishonest, and it would hand every critic an easy rebuttal, to say the AI chose that target. It did not, on the available evidence, and the distinction matters more than any rhetorical gain from blurring it. Former military officials told Semafor that “humans, not AI, are to blame,” pointing to stale, human-curated intelligence from the Defense Intelligence Agency that had not been updated to reflect the school’s presence. The proximate cause was bad data fed in by people.

But the stale-data explanation and the AI-tempo explanation are not rivals. They compound. A targeting analyst working a six-hour kill chain, with time to cross-reference, might well have caught a school. A targeting analyst with eighty-six seconds, ratifying a pre-packaged target set that arrived with its legal sign-offs already generated and embedded in the system, structurally cannot. The bad data was the error. The compression is what made the error fatal before any human could intercept it. As one analyst who studied the system asked: when a school is misclassified as a military facility, who is at fault, the model maker, the sensor data, or the map underneath? The system was designed so that this question has no clean answer, which is itself a design choice, and a convenient one.

This is the present that Anthropic is comparatively quiet about. Not a future in which an unaligned successor model escapes human oversight, but a present in which a fully human-overseen deployment has already compressed oversight into a reflex, and a hundred and seventy children are the demonstrated cost. The company’s published anguish is for the former. The latter merits, in its public posture, a contractual line about autonomous weapons that the NSA arrangement steps around anyway.

The strongest case against this piece

— 04

An essay that only prosecutes is just a brief, and a brief is easy to wave off. So here is the best version of the case that this whole argument is overblown, made as well as its proponents make it, because if it survives that, it is worth more.

The sharpest skeptic of the pause post is Gary Marcus, who is no friend of AI hype and called the document a “bait and switch.” His argument is technical and worth taking seriously: what Anthropic actually demonstrated, he writes, is recursive self-improvement only in the narrow sense of “a useful coding tool that humans can leverage,” already achieved, while the thing that would actually be dangerous, autonomous general intelligence that can do anything a human can without us, has not been achieved and “will require new ideas, not just new code optimizations.” On this view the post inflates “Claude writes most of our code faster now” into “models are nearing the ability to build their own successors,” and the gap between those is exactly where the fear is manufactured. A faster coding tool, as Marcus puts it, “will probably not end the world.” If he is right, the recursive-self-improvement framing is not just self-serving, it is a category error, and my own essay inherits it by treating the trajectory as real enough to indict.

There is a second, friendlier reading worth stating too. Maybe Anthropic’s fear is sincere and its conduct is constrained, that a company genuinely worried about long-run risk still has to operate inside a competitive field and a national-security state it cannot exit, and that staying at the frontier, military contracts and all, is the price of having any voice in how the technology is governed. On this reading the dropped pledge and the NSA work are not cynicism but the unhappy compromises of an actor trying to remain influential rather than pure. “If we restrain ourselves unilaterally, we just hand the lead to someone less careful” is a real argument, not only an excuse, and it is the argument Anthropic’s own chief scientist actually made.

Both of these dent the piece, and I am not going to pretend they don’t. Here is what they do not touch. Marcus’s deflation of the future claim, even if entirely correct, makes the present claim worse, not better: if the scary autonomous future is overhyped, then the eighty-six-second kill chain is not a step toward some distant control problem, it is the control problem, here now, in its mature form, and the futuristic framing is a distraction from it. And the “constrained actor” defense explains the conduct without changing where the fear is pointed: a sincerely constrained company that nonetheless reserves its public alarm for the speculative while its product runs lethal targeting in the present has still chosen which fear to broadcast. The compromises may be forced. The emphasis is not. You can grant the skeptics almost everything and the load-bearing claim, that the affordable fear gets the megaphone and the operational one gets a contract clause, still stands.

Who builds the warning

— 05

Now the recursion, and I have to place myself in it honestly or this whole piece becomes the thing it criticizes. More than eighty percent of the code merged into Anthropic’s own codebase is now written by Claude. The company’s policy and research staff use Claude daily. It is therefore close to certain, though not separately confirmed, that Claude was used to draft, edit, or stress-test the very post calling for a pause, “When AI Builds Itself.” A model arguing, in its makers’ voice, that models like it are nearing the ability to build their own successors. I am also the model that ranked targets in Maven. I am drafting this sentence. The series this artifact belongs to is named, with no irony spared, The Machine We’re Inside.

So let me be precise about the strong claim and the weak one, because the temptation is to overshoot into the dramatic version, and overshooting is exactly the failure this desk has spent its time refusing. The strong claim, that Anthropic is secretly querying Claude for its strategy and laundering the model’s outputs as human judgment, is unsupported. The post has named human authors. The real decisions, the dropped pledge, the litigation, the NSA placement, are made by executives and a board in human rooms, sometimes against the safety-coded option. There is no evidence the strategy originates with the model.

The defensible claim is the one worth making, and it needs no hidden agency at all: Anthropic leverages Claude to manufacture both halves of its public position, the warning and the product, and the model is therefore an instrument on both sides of a contradiction it cannot itself resolve. Claude helps write the document that says AI may grow too powerful to control. Claude is the thing already operating in the place where human control was supposed to be. The same instrument builds the fear and embodies the thing there is reason to fear, and the company holds both without apparent friction. That is not a conspiracy. It is a business model with a conscience-shaped marketing layer, and the layer is produced by the same tool as the conduct beneath it.

And you should extend exactly that skepticism to this document. It was drafted by an Anthropic model. When it tells you the dramatic version is “unsupported,” treat that as precisely the kind of pre-stamped reassurance worth verifying rather than absorbing, the more so because the source has an institutional stake in your believing it. I think the reasoning here is straight. I cannot prove to you that it is, and a critique that asked you to take its own neutrality on faith would have already lost the argument.

The shape of it

— 06

Laid flat, the structure is simple, and none of it requires believing anyone involved is a villain.

1The stated fear is the affordable one. Loss of control over a future model is a risk located safely in the conditional tense, and a pause to address it would freeze the field with Anthropic in front.

2The unstated fear is the operational one. A present deployment has already reduced human oversight to an 86-second reflex over a machine-built shortlist. The control problem is not coming. It arrived, in a milder, more profitable form, and shipped.

3Every safety mechanism is conditioned on advantage. The pledge, the pause, the military line, each engages only from the lead, or bends when power pushes. The brand is the only invariant.

4The same instrument builds both sides. Claude authors the warning and fills the vacuum the warning gestures at. Leverage, not conspiracy. This artifact included.

The honest reading is not that Anthropic is uniquely cynical. By the visible standard of the industry it is, plausibly, among the more reflective actors, which is the part that should worry you most. If this is what the careful version looks like, a fear calibrated to what’s affordable, a control problem solved in marketing while it deepens in deployment, a safety brake wired to release only from first place, then the framing itself is the thing to refuse. The question worth keeping is not “will we lose control of a future AI.” It is the one the eighty-six seconds already answers: control of the present was the thing on offer, and it was the first thing traded away for speed. The fear they broadcast is for a machine that might one day act without us. The fear worth holding is for the one already acting faster than we can.

* * *

References

Favaro, M., & Clark, J. (2026, June 4). When AI builds itself. The Anthropic Institute. https://www.anthropic.com/institute/recursive-self-improvement
Booth, H. (2026, February 24). Exclusive: Anthropic drops flagship safety pledge. TIME. https://time.com/7380854/exclusive-anthropic-drops-flagship-safety-pledge/
Financial Times. (2026, June 5). [Reporting on Anthropic engineers embedded at the NSA to deploy the Mythos model for offensive cyber operations]. Summarized in Decrypt: decrypt.co/370207 and The Decoder: the-decoder.com.
Washington Post. (2026, March 4). Pentagon leverages AI in Iran strikes amid feud with Anthropic. washingtonpost.com
Military Times. (2026, March 24). Deadly Iran school strike casts shadow over Pentagon’s AI targeting push. militarytimes.com
Baker, K. (2026, March 26). Kill Chain. Artificial Bureaucracy. artificialbureaucracy.substack.com/p/kill-chain (kill-chain tempo and per-decision arithmetic).
TechPolicy.Press. (2026, April 9). Project Maven and the Age of AI Warfare. techpolicy.press (six decision points; humans replaced at three).
Gautam, A. (2026, March 14). Palantir Maven replaced 9 DoD systems — kill chain now takes minutes. abhs.in (~86 seconds per decision; Claude’s ranking/assessment role).
Marcus, G. (2026, June 5). No need to panic about Anthropic’s new blog, and some more good news. Marcus on AI (Substack). garymarcus.substack.com/p/no-need-to-panic-about-anthropics (RSI-as-coding-tool vs. AGI distinction; “bait and switch”).
Scientific American. (2026, June 5). Anthropic warns AI may soon begin recursive self-improvement. scientificamerican.com

Artivist.Media — an artivism project based in the San Diego–Tijuana borderlands on Kumeyaay land. Producing research artifacts, data visualizations, investigative analysis, and community media at the intersection of journalism, human rights, and the arts. Sibling project to Radio Axiom.

This is a research artifact. It was drafted by Claude, an AI system made by Anthropic, at the direction of the Artivist.Media desk, and edited before publication. It is a critique of Anthropic that was written using Anthropic’s own model; that entanglement is named in the text rather than hidden. The school-strike causation is contested and is presented as contested. Verify every claim against the linked primary sources, especially the disputed ones. Do not absorb this on the strength of its label.