Government Curb-Stomps AI Model
45sThe shocking news of the US government shutting down a just-released AI model creates immediate curiosity and controversy.
▶ Play ClipThree days after Anthropic released Claude Fable, the US government ordered its removal due to national security fears after a user jailbroke the model. The directive restricts access by foreign nationals, including Anthropic's own staff, marking the first time a major AI company has pulled a live model. The video explains the jailbreak technique, the government's response, and the controversy surrounding the decision.
Claude Fable was released, but within 3 days the US government stepped in over national security concerns, forcing users back to Opus 4.8.
Fable is a safety-classified version of Mythos 5—a powerful cyber model locked behind 'Glass Wing' for trusted partners like the US government. Fable reroutes unsafe requests to a weaker model.
User 'Plenty The Liberator' jailbroke Fable by breaking requests into innocent fragments using Unicode characters, roleplay farming, or large context confusion, bypassing safety classifiers.
On June 15, 2026, the US government issued an export control directive signed by Commerce Secretary Howard Lutnick, banning foreign nationals from accessing Fable 5 or Mythos 5, even Anthropic's foreign-born employees.
In response, Anthropic removed both Fable and Mythos for all users, demoting them back to Opus 4.8—the first time a major AI company has taken down a public model due to government order.
Developers are unhappy with Anthropic due to reports of intentional performance degradation on research tasks, while some speculate the move was a publicity stunt to boost pre-IPO valuation and create regulatory moats.
Leaked benchmarks suggest Mistral may have a better model, and new releases from OpenAI and Google are expected, though Anthropic's current models remain powerful.
The government's intervention highlights the tension between AI safety and accessibility, as a powerful model was withdrawn following a jailbreak, leaving users reliant on an older, less capable version while the industry anticipates new competitors.
"The title accurately captures the core event: a jailbreak by one individual led to government action making Fable effectively illegal to use for many."
How long after Fable's release did the US government intervene?
3 days.
What was the name of the raw, unmuzzled model behind Fable?
Mythos 5.
0:48
How does Fable handle unsafe requests?
Safety classifiers reroute the request to Opus 4.8 for a dumber, more wholesome response.
1:15
Who jailbroke Fable?
An anonymous internet user known as Plenty The Liberator.
1:46
What method did the jailbreak use?
Breaking dirty requests into smaller, innocent fragments using weird Unicode characters, roleplay farming, or confusing the model in a large context conversation.
2:18
What did the export control directive say?
No foreign national may access Fable 5 or Mythos 5, not even Anthropic's own foreign-born employees.
2:47
Who signed the export control directive?
Commerce Secretary Howard Lutnick.
2:53
What did Anthropic do in response to the directive?
They pulled both Fable and Mythos for all users, demoting them back to Opus 4.8.
3:20
Mythos 5's raw cybersecurity capabilities
Highlights the immense power of the underlying model and why it required stringent safety measures.
0:48Fable's child-lock mechanism
Explains the key safety architecture dividing the public and private versions.
1:15Jailbreak method: request laundering
Reveals a novel exploit technique that bypasses classifiers by breaking requests into harmless fragments.
2:18Unprecedented employee restriction
The government order also barred Anthropic's own foreign-born employees from using the model, a highly unusual action.
2:47First-ever pull of a public AI model by government order
Sets a historical precedent for government intervention in AI deployment.
3:20[00:00] Over the weekend, something crazy
[00:01] happened. It just 3 days after the
[00:03] release of Claude Fable, the US
[00:05] government stepped in and curb-stomped
[00:07] it in the name of national security. And
[00:09] that's bad news if you just FOMO
[00:11] subscribed to Claude Pro to try out
[00:13] Fable because now you'll see this
[00:14] disappointing message if you try to use
[00:16] it and instead be forced to use the
[00:18] negative IQ Opus 4.8. But it's all for
[00:20] your own good because it only took
[00:22] someone a few hours to jailbreak Fable
[00:24] and turn it into an unstoppable cyber
[00:26] weapon. And that's pretty ironic because
[00:28] here in the land of the free, an
[00:30] American company that will not stop
[00:31] talking about AI safety just got
[00:33] safety'd by its own government. In
[00:35] today's video, we'll find out how and
[00:37] why our dear leaders in government are
[00:39] keeping us safe from the horrors of
[00:41] linear algebra. It is June 15th, 2026
[00:44] and you're watching The Code Report.
[00:45] About 2 months ago on April 7th, we were
[00:48] first introduced to Mythos 5, the raw
[00:50] unmuzzled model with the strongest
[00:52] cybersecurity capabilities of anything
[00:54] out there. But it was locked behind a
[00:56] program called Glass Wing, only
[00:57] available to trusted partners like major
[01:00] corporations and the US government
[01:02] itself. The reason Mythos can't be given
[01:04] to normies though is because it could
[01:05] easily be used as a cyber weapon in the
[01:07] wrong hands. To prevent that, Anthropic
[01:09] created a different product called Fable
[01:11] 5, which is literally the same exact
[01:13] model but with safety classifiers bolted
[01:16] on. That means if you ask it to do bad
[01:18] things like create an NPM package that
[01:20] turns the banking system into a
[01:21] Minecraft server, the Fable's guardrails
[01:24] will reroute your request to Opus 4.8
[01:26] for a dumber, more wholesome response.
[01:29] So basically, Mythos and Fable have the
[01:31] same brain but Fable has a child lock on
[01:33] it. If Fable went public and gained
[01:35] hundreds of billions of users overnight
[01:37] and it was awesome. It was by far the
[01:39] best coding AI model I've ever used and
[01:41] people were building all sorts of crazy
[01:43] apps with it. Life was good for about 3
[01:45] days. Then, of course, an anonymous
[01:47] internet user who goes by Plenty The
[01:49] Liberator defeats the guardrails and
[01:51] jailbreaks it. He's basically the
[01:53] internet's let's see if I can penetrate
[01:55] this thing guy and is famous for
[01:57] breaking other AI systems. And on June
[01:59] 10th, he posted a jailbreak on X
[02:01] claiming he popped Fable's guardrails
[02:03] wide open and got it producing exactly
[02:05] the same stuff the child block was built
[02:07] to block. And that's despite the fact
[02:09] that Anthropic had spent thousands of
[02:10] hours red teaming and trying to break
[02:12] its own guardrails internally. But the
[02:14] jailbreak wasn't some kind of sci-fi
[02:16] exploit. It actually works a lot more
[02:18] like money laundering. If Fable has a
[02:20] safety classifier watching for bad
[02:21] requests, but you can break dirty
[02:23] requests down into smaller
[02:25] innocent-looking fragments by wrapping
[02:27] them in weird Unicode characters, by
[02:29] doing roleplay farming, or by confusing
[02:31] the model in a very large context
[02:33] conversation. Due to national security,
[02:35] I can't be any more specific than that,
[02:37] but this weakness was brought to
[02:38] Anthropic's attention and they were
[02:40] initially asked to take the model down,
[02:42] but they refused. Then on Friday at 5:21
[02:45] p.m. Eastern time, Anthropic gets a
[02:47] letter not from a customer, but from the
[02:49] United States government. This letter
[02:51] was an export control directive signed
[02:53] off by Commerce Secretary Howard
[02:55] Lutnick. And the order was that no
[02:57] foreign national may access Fable 5 or
[02:59] Mythos 5. Not abroad, not in the US, and
[03:02] not even Anthropic's own foreign-born
[03:04] employees are allowed to touch it. That
[03:06] last one is pretty crazy. The government
[03:08] told a company that some of its own
[03:10] staff are no longer allowed to use the
[03:11] product they built. That means guys like
[03:13] Andre Karpathy who just recently got the
[03:16] job at Anthropic, they can't even use
[03:18] Fable. In response to that directive,
[03:20] they decided to hit the big red button
[03:21] and yanked Fable and Mythos for
[03:23] everybody. Now everybody's been quietly
[03:25] demoted back to Opus 4.8. And this is
[03:28] the first time in history a major AI
[03:30] company has pulled a live public model
[03:32] off the shelf because the federal
[03:33] government said so. Many developers out
[03:36] there are not too happy with Anthropic
[03:37] right now because on top of this whole
[03:39] situation, there was already backlash
[03:41] over reports that Anthropic was
[03:43] intentionally degrading Mythos and Fable
[03:45] performance on certain AI research jobs
[03:47] without making it obvious to users. But
[03:50] others out there are speculating that
[03:51] this whole thing was a calculated
[03:53] publicity stunt to continue pumping up
[03:55] Anthropic's pre-IPO numbers while
[03:57] simultaneously building a regulatory
[03:59] moat around it. But I think the only
[04:01] thing that can truly stop Anthropic at
[04:03] this point is a better model from a
[04:04] competitor. A leaked benchmark shows
[04:06] that Mistral might have that model, but
[04:08] we're also awaiting new releases from
[04:10] OpenAI and Google. Most of what we hear
[04:12] about AI is either non-stop hype from
[04:14] Big Tech or AI doomers warning us that
[04:17] the Skynet apocalypse will destroy the
[04:18] human race. But if you want to actually
[04:20] understand AI issues, you should check
[04:22] out blue.impact, the sponsor of today's
[04:25] video. They're a non-profit whose
[04:27] mission is to get more people involved
[04:28] in making AI go better for humanity. The
[04:31] main way they do that is by offering
[04:33] free online courses like their future of
[04:35] AI course, which provides an unbiased
[04:38] introduction to where AI is today and
[04:40] where things could be heading over the
[04:42] next few years. They also have more
[04:44] technical courses on things like AI
[04:46] governance and biosecurity along with
[04:48] personalized career support for people
[04:50] interested in working on AI safety. All
[04:52] of BlueDot's resources are free to use
[04:54] because they're funded by philanthropic
[04:56] donations, and they've helped over 8,000
[04:58] people get jobs at organizations like
[05:01] Google DeepMind, Stanford HAI, and
[05:03] Apollo Research. Try out their future of
[05:06] AI course at the link below. This has
[05:08] been The Code Report. Thanks for
[05:09] watching, and I will see you in the next
[05:11] one.
⚡ Saved you 0h 05m reading this? Transcribe any YouTube video for free — no signup needed.