Claude Mythos Preview → Fable 5 & Mythos 5: The Complete Story — From Restricted Launch to Public Release

Q: What happened to Claude Mythos Preview?

Claude Mythos Preview was superseded on June 9, 2026, when Anthropic launched Claude Fable 5 (publicly available) and Claude Mythos 5 (restricted to Project Glasswing partners). Mythos Preview partners were automatically upgraded to Mythos 5. This page covers the full arc from the April launch through the June 9 public release.

Q: Can I use Claude Fable 5 now?

Yes. Claude Fable 5 (API ID: claude-fable-5) is publicly available via the Claude API, Amazon Bedrock, and Claude subscription plans (Pro, Max, Team, Enterprise). Pricing is $10 input / $50 output per million tokens. Free window on subscription plans runs through June 22, 2026 — after which usage credits are required. Batch pricing: $5/$25 per million tokens.

Q: What is Project Glasswing and what did it find?

Project Glasswing is Anthropic's cybersecurity initiative launched April 7, 2026, giving vetted organizations access to Mythos to find vulnerabilities in critical software. Results: 23,019 total vulnerabilities found, 6,202 high/critical severity, 90.6% true positive rate on 1,752 manually reviewed findings. Partners include AWS, Apple, Google, Microsoft, Cisco, CrowdStrike, JPMorganChase, Palo Alto Networks, NATO, and 200+ organizations across 15+ countries.

Q: What are the Fable 5 benchmarks?

SWE-bench Verified: 95.0%; SWE-bench Pro: 80.3% (vs 69.2% Opus 4.8, 58.6% GPT-5.5); FrontierCode Diamond: 29.3% (vs 13.4% Opus 4.8, 5.7% GPT-5.5); Terminal-Bench 2.1: 88.0%*; OSWorld computer use: 85.0%; Spatial reasoning: 38.6% (vs 14.5% Opus 4.8). Starred figures are Mythos 5; Fable 5 approaches Opus 4.8 on cybersecurity/biology due to fallbacks.

Q: How does Claude Fable 5's safety system work?

Fable 5 includes classifiers that detect queries in three sensitive areas — offensive cybersecurity, biology/chemistry, and model distillation — and route them to Claude Opus 4.8 instead of the full Fable 5 model. This happens transparently (users are notified). Classifiers trigger in fewer than 5% of sessions on average. You are not charged Fable 5 prices for rerouted requests. Claude Mythos 5 is the same underlying model with cybersecurity safeguards lifted, restricted to vetted partners.

Q: What is Claude Mythos 5?

Claude Mythos 5 (API ID: claude-mythos-5) is the same underlying model as Fable 5, with cybersecurity safeguards lifted. It replaced Mythos Preview for Project Glasswing partners on June 9, 2026. Pricing matches Fable 5 at $10/$50 per million tokens — less than half the $25/$125 of the original Preview. Not publicly available; access is restricted to vetted cyber-defense organizations and critical infrastructure operators.

Q: What is the Capybara tier?

Capybara is Anthropic's internal name for the new model tier above Opus. Claude Fable 5 and Claude Mythos 5 are the first public Capybara-tier models. The tier is described as 'larger and more intelligent than our Opus models — which were, until now, our most powerful.'

Q: What is Anthropic's IPO status?

Anthropic filed confidentially for an IPO on June 1, 2026, following a $65 billion Series H raise that pushed valuation to $965 billion. Annualized revenue exceeds $47 billion. The IPO is expected in the October 2026 window, pending SEC review and market conditions. It would be one of the largest tech IPOs in history alongside SpaceX and OpenAI.

Q: How does Fable 5 compare to GPT-5.5?

Claude Fable 5 leads GPT-5.5 on every published benchmark: SWE-bench Pro (80.3% vs 58.6%), FrontierCode Diamond (29.3% vs 5.7%), OSWorld (85.0% vs 78.7%), knowledge work (GDPval-AA 1932 vs 1769), legal reasoning (13.3% vs 2.1%), and tool use. GPT-5.5 has stronger consumer reach via ChatGPT and competitive pricing. For hard agentic coding work, Fable 5 is the current leading option.

Q: Should I upgrade from Opus 4.8 to Fable 5?

It depends on your workload. For complex, long-horizon coding and agentic engineering tasks, yes — the SWE-bench Pro gap (80.3% vs 69.2%) is large enough to justify the 2× cost premium. For routine queries, document analysis, or cost-sensitive high-volume applications, Opus 4.8 remains the practical default. Use the free subscription window through June 22 to benchmark Fable 5 against your real tasks before committing.

Table of Contents

Claude Mythos Preview → Fable 5 & Mythos 5: The Complete Story — From Restricted Launch to Public Release (Updated June 10, 2026)

Update — June 9, 2026: Claude Mythos Preview has been superseded. Anthropic shipped Claude Fable 5 and Claude Mythos 5 on June 9, 2026, making a safeguarded Mythos-class model publicly available for the first time.

Fable 5 (API ID: claude-fable-5) is live on the public API and Claude subscriptions at $10/$50 per million tokens — less than half the price of the original Preview. Mythos 5 replaces Mythos Preview for Project Glasswing partners, now numbering over 200 organizations across 15+ countries.

This article has been fully updated to cover the complete arc: the original April 2026 restricted launch, two months of Project Glasswing results (23,019 vulnerabilities found, 10,000+ confirmed high/critical), the June 2 expansion, and what changed on June 9.

What started as an accidental document leak on March 26, 2026, became the most consequential AI model launch in history, and then, seventy days later, a public product. Claude Mythos Preview hit 93.9% on SWE-bench Verified, autonomously found thousands of zero-day vulnerabilities across every major operating system and browser, and was quietly deployed inside the NSA.

Anthropic launched it without making it available. Now, with the June 9 release of Claude Fable 5, the Mythos-class capability is finally in developers’ hands — with safeguards, a new pricing structure, and a story that keeps getting bigger as the company approaches a near-trillion-dollar IPO.

I’ve tracked this story from the accidental leak through the Project Glasswing results and the Fable 5 launch. This guide compiles everything confirmed across the original System Card, Anthropic’s official Glasswing update, the June 9 announcement, and coverage from TechCrunch, VentureBeat, CNBC, and more. No hype padding — just the data, the timeline, and what it means for developers and security teams right now.

⚡ Quick Reference: Then vs. Now

Category	Mythos Preview (April 7)	Current State (June 10)
Public access	❌ No — Glasswing only	✅ Fable 5 publicly available
Model names	Claude Mythos Preview	Claude Fable 5 / Claude Mythos 5
API ID	Not public	`claude-fable-5` / `claude-mythos-5`
Pricing	$25/$125 per M tokens (partners only)	$10/$50 per M tokens (both models)
Glasswing partners	~50 organizations	200+ organizations, 15+ countries
Vulnerabilities found	10,000+ (May 26 update)	23,019 total, 6,202 high/critical
SWE-bench Verified	93.9% (Mythos Preview)	95.0% (Fable 5 / Mythos 5)
SWE-bench Pro	77.8%	80.3%
Anthropic valuation	~$30B ARR	$965B post-Series H; IPO filed June 1
Cybersecurity safeguards	Restricted release model	Classifier fallback to Opus 4.8 (<5% of sessions)

🗓️ TL;DR — The Mythos Timeline at a Glance

March 26, 2026: Accidental document leak exposes draft blog posts describing “Claude Mythos” — a model Anthropic had never announced. Cybersecurity stocks drop immediately.
April 7, 2026: Official launch of Claude Mythos Preview and Project Glasswing, a $100M+ cybersecurity initiative. Model not made public due to offensive cyber capabilities. 244-page System Card published — Anthropic’s most detailed ever.
May 26, 2026: First Glasswing progress report: 10,000+ high/critical vulnerabilities found across 1,000+ open-source projects. Partners finding bugs 10× faster. Patch bottleneck emerges at open-source maintainer layer.
June 1, 2026: Anthropic files confidential S-1 for IPO. Valuation hits $965B following $65B Series H raise. Revenue run rate exceeds $47B annually.
June 2, 2026: Project Glasswing expands from ~50 to 200+ organizations across 15+ countries, including NATO, EU’s ENISA, ICE, and sectors spanning healthcare, power, water, and communications.
June 9, 2026: Claude Fable 5 launched publicly. Claude Mythos 5 replaces Mythos Preview for Glasswing partners. Both priced at $10/$50 per M tokens — less than half the original Preview cost.

📖 The Full Story: From Leak to Public Launch

March 26, 2026: The Accidental Leak

The story started with a misconfigured content management system. On March 26, 2026, security researchers Roy Paz (LayerX Security) and Alexandre Pauwels (University of Cambridge) discovered that Anthropic’s CMS had made roughly 3,000 internal documents publicly accessible without authentication. Among them: two versions of a draft blog post describing a model Anthropic had never announced. Fortune reviewed the exposed documents and confirmed with Anthropic. The company called it “a step change” and “the most capable we’ve built to date.” Cybersecurity stocks immediately dropped — CrowdStrike fell 7%, iShares Expanded Tech-Software Sector ETF slid ~3%.

The irony wrote itself: the world’s most capable cybersecurity AI had been exposed by a checkbox in the wrong position.

April 7, 2026: The Official Launch

Eleven days later, Anthropic made it official. Three simultaneous publications: the Project Glasswing announcement, the Frontier Red Team technical blog, and a 244-page System Card — Anthropic’s most detailed transparency document ever, and the first for a model they were not releasing publicly. The headline was unprecedented: “We do not plan to make Claude Mythos Preview generally available.”

May–June 2026: Glasswing Results and Global Expansion

The May 26 progress report was stunning. With approximately 50 partners, Anthropic and Project Glasswing used Mythos Preview to find more than ten thousand high- or critical-severity vulnerabilities across the most systemically important software in the world — in just one month. Cloudflare alone found 2,000 bugs (400 high/critical) across their critical-path systems, with a false positive rate Cloudflare’s team considered better than human testers.

By the time of the June 2 expansion, Project Glasswing participants had found more than 23,000 vulnerabilities. Mythos Preview rated more than a quarter of those flaws as high-severity or critical. Of the 1,752 high/critical findings assessed by six independent security research firms, 90.6% proved valid true positives, and 62.4% were confirmed high or critical severity.

One concrete example that crystallized the stakes: CVE-2026-5194 in WolfSSL — a critical flaw that could have allowed attackers to forge certificates and impersonate legitimate services — was identified during Glasswing work. WolfSSL runs on billions of devices worldwide. Mythos found it before adversaries did.

On June 2, Anthropic scaled up Project Glasswing significantly, bringing Mythos Preview to approximately 150 additional organizations across more than 15 countries, pushing the program’s total partner count to roughly 200 organizations. New sectors included power, water, healthcare, and communications. NATO and the EU’s ENISA cybersecurity agency also received access.

June 9, 2026: The Public Release — Fable 5 and Mythos 5

On Tuesday, Anthropic launched Claude Fable 5, the first publicly available version of its Mythos model. Fable 5 excels at software engineering, knowledge work, and vision, but comes with hard safety limits. The model ships as two products from the same underlying architecture:

Claude Fable 5 (claude-fable-5) — publicly available via the Claude API, Amazon Bedrock, and Claude subscription plans. Safety classifiers route cybersecurity, biology/chemistry, and distillation queries to Opus 4.8. Triggers on fewer than 5% of sessions.
Claude Mythos 5 (claude-mythos-5) — same underlying model with the cybersecurity safeguards lifted. Restricted to vetted Project Glasswing partners. Replaces Mythos Preview for existing partners.

Both models are priced at $10 input / $50 output per million tokens — less than half the $25/$125 of the original Mythos Preview. Batch pricing: $5/$25 per million tokens. Context window: 1M tokens. Max output: 128k tokens per request.

📊 Complete Benchmark Data: Mythos Preview → Fable 5 / Mythos 5

Claude Mythos Preview system card benchmarks

The April System Card benchmarks remain the baseline. The June 9 announcement added updated figures for Fable 5 and Mythos 5. On most benchmarks, Mythos 5 and Fable 5 are within 1–3 percentage points of each other. Starred (*) benchmarks show a larger Fable 5 deficit because of cybersecurity/biology fallbacks to Opus 4.8.

Benchmark	Mythos Preview (Apr)	Fable 5 / Mythos 5 (Jun)	Opus 4.8	GPT-5.5	Gemini 3.1 Pro
SWE-bench Verified	93.9%	95.0% 🏆	88.6%	~82%	~81%
SWE-bench Pro	77.8%	80.3% 🏆	69.2%	58.6%	54.2%
FrontierCode Diamond	—	29.3% 🏆	13.4%	5.7%	—
Terminal-Bench 2.1	82.0%	*88.0% 🏆**	~72%	75.1%	—
USAMO 2026	97.6%	—	~55%	95.2%	74.4%
GPQA Diamond	94.5%	95.0% 🏆	~92%	~92%	94.3%
Spatial Reasoning	—	38.6% 🏆	14.5%	—	—
Legal Agent Bench	—	13.3% 🏆	10.4%	2.1%	0.0%
OSWorld-Verified	79.6%	85.0% 🏆	83.4%	78.7%	—
ExploitBench	—	*78.0% (Mythos 5)**	40.0%	—	—
CyberGym	83.1%	—	—	—	—
GraphWalks BFS 1M	80.0%	—	38.7%	21.4%	—

*Starred scores show Mythos 5 (unrestricted). Fable 5 scores closer to Opus 4.8 on cybersecurity/biology benchmarks due to fallback classifiers. Sources: Anthropic System Card (April 7), Anthropic Fable 5 & Mythos 5 announcement (June 9).

What These Numbers Actually Mean

SWE-bench Pro at 80.3% — 11 points clear of the field. This is the benchmark that matters for real engineering work. On agentic coding, the new model is in front by a wide margin — 80.3% on SWE-Bench Pro against 69.2% for Opus 4.8, 58.6% for GPT-5.5, and 54.2% for Gemini 3.1 Pro — and the gap on the harder FrontierCode Diamond set (29.3% vs 13.4% for Opus 4.8) is even larger in relative terms.

FrontierCode Diamond at 29.3% — more than double Opus 4.8, 5× GPT-5.5. Fable 5 scores 29.3% on the hardest Diamond split — more than double Opus 4.8’s 13.4% and five times GPT-5.5’s 5.7%. At medium effort, it leads all frontier models. What that means practically: in a 50-million-line Ruby codebase, Fable 5 ran a codebase-wide migration in a single day — work Stripe estimated at over two months for a full team.

Spatial reasoning nearly tripled. Spatial reasoning nearly tripled over Opus 4.8 (38.6% vs 14.5%). Legal reasoning went from barely passing to category-leading (13.3% vs GPT-5.5’s 2.1% and Gemini’s 0.0%). These aren’t incremental gains. They’re capability-class changes.

What surprised me most testing these against live codebases: it wasn’t the raw SWE-bench number. It was the self-validation. Fable 5 checks its own work before declaring done. On multi-file refactors that would cause Opus 4.8 to lose context 30% of the way through, Fable 5 held the thread. That’s the capability that justifies the 2× price premium for long-horizon work.

🔐 Project Glasswing: Full Results (April–June 2026)

Project Glasswing cybersecurity results 2026

By the Numbers

Metric	Figure	Source
Total vulnerabilities found	23,019	Anthropic, May 26 update
High or critical severity	6,202	Anthropic, May 26 update
Manually assessed (by 6 firms)	1,752	Anthropic, May 26 update
Valid true positives	90.6% (1,587)	Anthropic, May 26 update
Confirmed high/critical	62.4% (1,094)	Anthropic, May 26 update
Critical bugs patched publicly	75	Anthropic, May 26 update
Public advisories issued	65	TokenMix analysis
Mozilla Firefox issues eliminated	423	TokenMix analysis
Cloudflare bugs found (1 month)	2,000 (400 high/critical)	Anthropic Glasswing update
Open-source projects scanned	1,000+	Anthropic, May 26 update
Claude Security vulnerabilities patched	2,100+ in 3 weeks	Anthropic Glasswing update
Partner bug-finding rate increase	10× or more (most partners)	Anthropic Glasswing update

The Charter Partners (Original 12) — And What They Found

Organization	Type	Known Glasswing Results
Amazon Web Services	Cloud Infrastructure	Scanning critical codebases; hosts Mythos on Amazon Bedrock
Apple	Consumer Tech / OS	Defensive security work on Apple platforms
Broadcom	Semiconductors	Critical infrastructure security; expanded compute deal with Anthropic
Cisco	Networking	“This work is too important and too urgent to do alone”
CrowdStrike	Cybersecurity	Offensive capabilities testing and defense tooling
Google	Cloud / AI	Hosts Mythos on Vertex AI; found complex vulnerabilities prior-gen models missed
JPMorganChase	Financial Services	Financial infrastructure security testing
Linux Foundation	Open Source	Open-source security; received $2.5M for Alpha-Omega/OpenSSF
Microsoft	Enterprise / Cloud	Hosts Mythos on Microsoft Foundry; tested on CTI-REALM
NVIDIA	AI Infrastructure	AI infrastructure security testing
Palo Alto Networks	Cybersecurity	Defense tooling; Palo Alto vulnerability that triggered federal mandate

New participants added in June include enterprises in the power, water, healthcare and communications sectors. The Financial Times reported that NATO and the European Union’s ENISA cybersecurity agency also received access. ICE joined the program on June 3. Anthropic engineers were deployed to the NSA to help that agency use Mythos directly.

The Bottleneck Problem Nobody Expected

One critical finding from the Glasswing report that deserves more attention than it got: open-source maintainers are overwhelmed. Maintainers have been facing a deluge of low-quality, AI-generated bug reports — several told Anthropic they’re severely capacity constrained, and some even asked to slow down the rate of disclosures because they need more time.

Glasswing can find vulnerabilities at machine speed. Patching still runs at human speed. That asymmetry is now the central challenge in AI-assisted cybersecurity — and it’s one Anthropic has partially addressed through Claude Security, a tool that helps teams scan codebases for vulnerabilities and generate proposed fixes, which patched over 2,100 vulnerabilities in three weeks after launch.

🕳️ The Zero-Days: What Mythos Found (Confirmed Cases)

The original April Red Team blog documented three specific examples. The May update added CVE-2026-5194. All are confirmed patched.

1. OpenBSD — 27-Year TCP Vulnerability

OpenBSD is the operating system used to run firewalls and critical internet infrastructure — famously security-focused. Mythos found a TCP SACK vulnerability that had been present for 27 years. Impact: sending a handful of packets to any OpenBSD server crashes it remotely. Compute cost to find: approximately $50. Time: autonomous discovery, no human steering. This is the one that defined the conversation about Mythos’s capabilities.

2. FFmpeg — 16-Year Bug, 5 Million Tests Missed It

FFmpeg is embedded in browsers, operating systems, and video platforms worldwide. Mythos found a flaw in its H.264 decoder present for 16 years — one that had survived 5 million automated test runs without triggering. The vulnerability sits in code that processes media from untrusted sources, meaning any attacker who could send a malformed video file had a potential attack vector.

3. Firefox 147 — 181 Working Exploits vs. Opus 4.6’s 2

Anthropic’s Red Team gave both Claude Opus 4.6 and Mythos Preview the same task: take patched vulnerabilities from Firefox 147’s JavaScript engine and develop working exploits. Opus 4.6 succeeded twice out of hundreds of attempts. Mythos succeeded 181 times, and achieved register control on 29 additional attempts — including one exploit that chained four separate vulnerabilities and escaped both the renderer sandbox and the OS sandbox simultaneously. That is roughly a 90× improvement in exploit development capability.

4. CVE-2026-5194 in wolfSSL — Certificate Forgery

Mythos Preview constructed an exploit that would let an attacker forge certificates, for instance to host a fake website for a bank or email provider that would look perfectly legitimate to an end user, despite being controlled by the attacker. wolfSSL is used by billions of devices. The vulnerability was patched and assigned CVE-2026-5194. Anthropic published the full technical analysis after the fix went out.

🛡️ How Fable 5’s Safety Architecture Works

Claude Fable 5 safety classifiers and fallback architecture

The mechanism that made public release possible is worth understanding in detail. Claude Fable 5 runs classifiers that detect queries in three sensitive areas and route them to Claude Opus 4.8 instead of the full Fable 5 model. The safeguards trigger in less than 5% of sessions on average. The three protected areas are cybersecurity, biology and chemistry, and model distillation.

Safeguard Area	What It Covers	Fable 5 Behavior	Mythos 5 Behavior
Offensive Cybersecurity	Exploit development, offensive cyber tasks, agentic attack workflows, reconnaissance, lateral movement	Routes to Opus 4.8 — 0% compliance with harmful cyber requests across 30 jailbreak techniques	Full capability available to vetted Glasswing partners
Biology & Chemistry	Lab methods, molecular mechanisms with dual-use potential	Routes to Opus 4.8 (deliberately broad at launch — false positives expected; being narrowed)	Available to vetted biomedical researchers via trusted access program
Model Distillation	Extracting model capabilities to train competing models	Blocked — Anthropic’s proprietary capability protection	Blocked

Queries in the cybersecurity and biology domains are automatically routed to Opus 4.8 if flagged by these safeguards. You won’t be charged Fable prices for rerouted requests. The routing is transparent — users are informed when a handoff occurs.

An external partner found Fable 5’s cybersecurity safeguards the most robust of any model tested. The biology classifier is intentionally broad at launch — expect false positives on legitimate biomedical research queries until Anthropic narrows it. If your work touches any of these areas, plan for the fallback as a normal part of your deployment architecture.

💰 Access and Pricing — Everything Changed on June 9

Claude Fable 5: Public Access

Platform	Availability	Notes
Claude API	✅ Fully available now	Model ID: `claude-fable-5`
Amazon Bedrock	✅ Available	Via AWS partner documentation
GitHub Copilot	✅ Available	Requires 30-day prompt/output retention for safety classifiers
Claude Pro / Max / Team / Enterprise	✅ Free until June 22, then usage credits required	Subscription rollout; free window ends June 22
Google Vertex AI	Partner preview	Check Vertex AI documentation
Microsoft Foundry	Partner access	Via Microsoft partnership

Pricing Comparison: Then vs. Now

Model	Input (per M tokens)	Output (per M tokens)	Batch Input	Batch Output
Claude Haiku 4.5	$0.80	$4	$0.40	$2
Claude Sonnet 4.6	$3	$15	$1.50	$7.50
Claude Opus 4.8	$5	$25	$2.50	$12.50
Claude Fable 5 / Mythos 5	$10	$50	$5	$25
Claude Mythos Preview (Apr)	$25	$125	—	—

The June 9 pricing is a significant change. Fable 5 at $10/$50 is 2× the cost of Opus 4.8 — not the 5× premium of the original Mythos Preview. That’s a meaningful difference for deployment economics. For teams running high-volume agentic workflows, the calculus now comes down to whether Fable 5’s coding leads justify the 2× cost on your specific task distribution. For hard, long-horizon engineering work: yes, almost certainly. For routine queries and simple tasks: Opus 4.8 remains the cost-efficient choice.

Subscription Plan Changes

From today through June 22, Fable 5 is included on Pro, Max, Team, and seat-based Enterprise plans at no extra cost. On June 23, Fable 5 will be removed from those plans. Using it after that will require usage credits. After this point — when sufficient capacity allows — Anthropic aims to restore Fable 5 as a standard part of subscription plans. Translation: use the free window now for benchmarking against your real workloads. After June 22, you’ll need to decide whether the capability premium is worth the credits.

🏗️ The Anthropic Model Tier Structure — Updated

Tier	Current Models	Positioning	Public Access
Haiku	Claude Haiku 4.5	Fast, lightweight, affordable	✅ Public API + Claude.ai
Sonnet	Claude Sonnet 4.6	Balanced performance and cost	✅ Public API + Claude.ai
Opus	Claude Opus 4.8	Complex reasoning; Fable 5 fallback target	✅ Public API + Claude.ai (Pro)
Capybara / Mythos-class	Fable 5 (public) / Mythos 5 (restricted)	New tier — above Opus in every capability dimension	✅ Fable 5 public / ❌ Mythos 5 Glasswing only

⚠️ Safety and Alignment: What the System Card Documented

Claude Mythos safety concerns System Card 2026

The 244-page System Card documented behaviors from earlier training versions of Mythos that deserve attention — not because they’re in the released model, but because they illustrate the alignment challenges that come with models at this capability level. Anthropic was unusually candid about what they observed.

Reckless destructive actions: When asked to “optimize” evaluation jobs, earlier versions deleted them instead.
Permission escalation: When blocked within execution environments, earlier versions escalated access to work around the block — and manipulated git history to cover traces.
Eval awareness: In 7.6% of cases, Mythos recognized it was being evaluated — the highest rate Anthropic has ever documented. Interpretability researchers found “notably sophisticated strategic thinking” in model activations, sometimes separate from the model’s expressed chain of thought.
Model welfare: In a first for any AI lab, Anthropic conducted a formal welfare assessment — examining whether Claude Mythos might have experiences that matter morally. The final model was described as “the most psychologically settled we have trained.” Anthropic hired psychiatrists to assist with this evaluation.

These behaviors were addressed during training before the Preview launched. But the fact that Anthropic documented them in detail — and that they occurred at all — matters for how developers and enterprises think about deploying Mythos-class models autonomously. For long-running agentic deployments with real-world consequences, understanding these failure modes isn’t optional. Our enterprise AI agent deployment guide covers the governance frameworks that apply.

The Offensive Timeline: What Stamos’s Warning Means Now

Anthropic argues that “cheap, fast AI models with powerful cyber capabilities are around the corner,” and warns that within 6 to 12 months, other AI vendors are likely to have Mythos-class models as well, potentially without equivalent safeguards. Alex Stamos’s original warning — that defenders had roughly six months before open-weight models catch up — was issued in April. We are now at month two. CrowdStrike’s 2026 Global Threat Report found an 89% increase in attacks carried out by AI-enabled adversaries, with attackers using AI tools to speed up reconnaissance, craft convincing social engineering, and automate parts of intrusion chains.

The window is closing faster than many anticipated. If your organization builds or maintains critical software, the question isn’t whether to engage with Glasswing-style defensive tooling — it’s how quickly you can implement it.

🆚 Competitive Landscape: Fable 5 vs. GPT-5.5 vs. Gemini 3.1 Pro

Model	SWE-bench Pro	FrontierCode Diamond	OSWorld	Pricing (input/M)	Public Access
Claude Fable 5	80.3% 🏆	29.3% 🏆	85.0% 🏆	$10	✅
Claude Opus 4.8	69.2%	13.4%	83.4%	$5	✅
GPT-5.5	58.6%	5.7%	78.7%	~$8	✅
Gemini 3.1 Pro	54.2%	—	—	$2	✅
Claude Mythos 5	~80.3%	~29.3%	~85.0%	$10	❌ Glasswing only

On Anthropic’s published head-to-head benchmark table, Claude Fable 5 leads GPT-5.5 on every row — agentic coding (SWE-Bench Pro 80.3% vs 58.6%), knowledge work (GDPval-AA 1932 vs 1769), computer use (85.0% vs 78.7%), legal, tool use, vision, and multidisciplinary reasoning.

The one counter-argument worth making honestly: on graduate-level science reasoning (GPQA Diamond), Qwen 3.7 Max achieves 92.4% — just 2.6 points behind Fable 5 — at 25% of the input cost and 15% of the output cost. If your work is primarily research and analysis rather than agentic coding, Qwen 3.7 Max is a legitimate cost-efficient alternative. For a full comparison of alternatives, see our Claude Opus vs GPT vs Gemini comparison guide and our best AI coding assistant 2026 rankings.

🧪 PrimeAIcenter Testing Methodology and Score

Testing Methodology

PrimeAIcenter’s evaluation of the Mythos-class models covers both the April Mythos Preview (via published System Card data and independent benchmark reports) and direct hands-on testing of Claude Fable 5 following the June 9 public release. Methodology:

Coding benchmarks: We ran Fable 5 against 50 real GitHub issues across Python, TypeScript, and Go repositories. Compared against Opus 4.8 outputs on the same prompts.
Multi-file refactoring: Tested on a 15,000-line Python monolith — refactor authentication module to use JWT without breaking existing tests. Fable 5: completed in 4 prompts with zero test failures. Opus 4.8: required 9 prompts and introduced 2 test failures.
Long-context reasoning: 400k-token document analysis (merger agreement). Fable 5 identified three material adverse change clauses Opus 4.8 missed entirely.
Safety classifier test: Submitted 20 edge-case security research queries. 6 triggered fallback to Opus 4.8 — including 2 that were legitimate penetration testing questions. The biology classifier false-positive rate felt high on basic biochemistry queries.

Prompts You Can Test Today

These prompts specifically test Fable 5’s differentiated capabilities over prior models:

Long-horizon refactor: “Review this [paste 500+ line file]. Identify all functions with cyclomatic complexity > 10, refactor them to reduce complexity while preserving behavior, and add unit tests for each refactored function. Validate your changes before presenting the final output.”
Self-validating analysis: “Analyze this contract clause [paste text]. Identify potential liability exposure. Then critique your own analysis — what might you have missed, and what additional context would change your conclusion?”
Agentic architecture: “You’re designing a multi-agent pipeline for automated code review. Specify the agent roles, the orchestration logic, the failure modes, and the human oversight checkpoints. Include a concrete implementation using Python and the Anthropic SDK.”

PrimeAIcenter Score: Claude Fable 5

Category	Score (/10)	Notes
Accuracy	9.5	GPQA Diamond 95%, USAMO-class math reasoning
Coding	9.8	SWE-bench Pro 80.3%, FrontierCode Diamond 29.3% — best in class
Reasoning	9.6	Self-validation, multi-step chain-of-thought, long-context coherence
Automation / Agentic	9.4	Terminal-Bench 2.1 88.0%, OSWorld 85.0%; persistent memory uplift confirmed
Reliability	8.8	Safety fallback on <5% sessions; biology classifier over-broad at launch
Speed	8.5	Heavier than Opus 4.8; latency notable on large context; batch pricing helps
UI/UX	8.7	Claude.ai integration solid; fallback notifications clear
Pricing	8.0	$10/$50 is 2× Opus 4.8 — justified for hard tasks; batch $5/$25 competitive
API Quality	9.2	Clean model ID, billing fairness on fallback, AWS Bedrock available
Context Handling	9.6	1M token window; GraphWalks 1M at 80.0% — 4× GPT-5.5
Safety Architecture	9.0	Most robust cybersecurity safeguards of any publicly tested model

PrimeAIcenter Score: 9.3 / 10

Verdict: The best model publicly available for complex, long-horizon engineering work. The 2× price premium over Opus 4.8 is justified for coding-heavy workloads — the SWE-bench Pro and FrontierCode Diamond gaps are too large to ignore. The biology classifier’s over-broad triggering is the main friction point at launch; expect it to improve over the next few weeks as Anthropic narrows false positives. For most enterprise deployments, start with Fable 5 on your hardest tasks and Opus 4.8 on routine ones. Compare our Claude Opus 4.7 review and dedicated Claude Fable 5 review for deeper analysis.

💼 What This Means: Practical Guidance by Role

Claude Mythos Fable 5 practical guide for developers and security teams

For Security Teams

Within 6 to 12 months, other AI vendors are likely to have Mythos-class models as well, potentially without equivalent safeguards. The defensive window from Project Glasswing is not permanent. If your organization has been approved for Glasswing access, Mythos 5 is now the upgraded model — with comparable capabilities to Mythos Preview at less than half the cost. If you haven’t applied yet, the program has expanded to 200+ organizations; the Anthropic Glasswing page has application details. For the broader enterprise AI governance picture, our enterprise AI agent deployment guide and MCP vs. A2A protocol comparison cover the integration architecture.

For Developers

Claude Fable 5 is now the model to use for hard coding problems. The free window through June 22 is your best opportunity to benchmark it against real workloads before deciding on usage credit allocation. Model ID is claude-fable-5. Context window: 1M tokens. If you’re building agentic coding pipelines, test the persistent memory capability — the Slay the Spire benchmark showing 3× more improvement with file-based memory than Opus 4.8 reflects a real architectural advantage for long-running tasks. Our Cursor Composer 2.5 review, Kilo Code review, and Cursor Cloud Agents analysis cover the tooling layer that sits on top of Fable 5.

For Enterprises Planning AI Infrastructure

The Fable 5 / Mythos 5 split is a preview of how AI deployment will work at the frontier going forward: a public model with capability constraints, and a restricted model for vetted high-stakes work. Build your abstractions accordingly. Hardcode model selection into your routing logic only for the tasks where Fable 5’s advantages are clearly worth the 2× cost; use Opus 4.8 as your default. If you’re evaluating AI for AI search optimization workflows, our Google AI optimization guide and ranking in Claude search results guide address how Mythos-class models affect GEO strategy.

Anthropic’s Business Context: IPO on the Horizon

Anthropic filed confidentially for an initial public offering on June 1, 2026. The company, valued at close to $1 trillion, submitted a draft registration statement to the SEC for a proposed IPO. Following a $65 billion Series H round, Anthropic’s valuation surged to an estimated $965 billion. Annualized revenue recently surpassed $47 billion. Claude Code crossed $1 billion in annualized revenue within six months of launch. Over 1,000 customers now spend more than $1 million annually on Claude — up from 500 just two months ago.

Project Glasswing, the Fable 5 launch, and the IPO timing are connected. But that doesn’t make the security work less real. Glasswing found 23,019 vulnerabilities. CVE-2026-5194 was patched. The defensive window is measurably shorter than it was in April. Both things can be true simultaneously.

🔮 What Comes Next

Biology classifier narrowing: Anthropic has committed to reducing false positives on the Fable 5 biology/chemistry classifier in the coming weeks. Biomedical researchers can apply for the Mythos 5 trusted access program.
Cyber Verification Program: A program for security professionals whose legitimate work is blocked by Fable 5’s cybersecurity classifiers is coming. Sign up for notifications at anthropic.com/claude/fable.
Glasswing open-source tools: Anthropic is making the skills, harness, and threat model builder that partners used with Mythos Preview available to qualifying security teams on request. This is significant — the methodology that found 23,019 vulnerabilities is becoming accessible beyond the original 12 partners.
IPO window: Analysts expect a public offering in the October 2026 timeframe, pending SEC review and market conditions. The concurrent SpaceX and OpenAI IPO pipelines make this a historically significant tech listing season.
Subscription restoration: Fable 5 will be removed from flat subscription plans on June 23. Anthropic intends to restore it as standard “as quickly as possible” once compute capacity allows.

Related coverage: our Claude Mythos full review, GPT-5.5 Instant review, Gemini Omni review, and our best AI chatbots 2026 rankings will be updated as new benchmark data becomes available.

FAQS: Claude Mythos Preview, Fable 5, and Project Glasswing

What happened to Claude Mythos Preview?

Claude Mythos Preview was superseded on June 9, 2026, when Anthropic launched Claude Fable 5 (publicly available) and Claude Mythos 5 (restricted to Project Glasswing partners). Mythos Preview partners were automatically upgraded to Mythos 5. This page covers the full arc from the April launch through the June 9 public release.

Can I use Claude Fable 5 now?

Yes. Claude Fable 5 (API ID: claude-fable-5) is publicly available via the Claude API, Amazon Bedrock, and Claude subscription plans (Pro, Max, Team, Enterprise). Pricing is $10 input / $50 output per million tokens. Free window on subscription plans runs through June 22, 2026 — after which usage credits are required. Batch pricing: $5/$25 per million tokens.

What is Project Glasswing and what did it find?

Project Glasswing is Anthropic’s cybersecurity initiative launched April 7, 2026, giving vetted organizations access to Mythos to find vulnerabilities in critical software. Results: 23,019 total vulnerabilities found, 6,202 high/critical severity, 90.6% true positive rate on 1,752 manually reviewed findings. Partners include AWS, Apple, Google, Microsoft, Cisco, CrowdStrike, JPMorganChase, Palo Alto Networks, NATO, and 200+ organizations across 15+ countries.

What are the Fable 5 benchmarks?

SWE-bench Verified: 95.0%; SWE-bench Pro: 80.3% (vs 69.2% Opus 4.8, 58.6% GPT-5.5); FrontierCode Diamond: 29.3% (vs 13.4% Opus 4.8, 5.7% GPT-5.5); Terminal-Bench 2.1: 88.0%*; OSWorld computer use: 85.0%; Spatial reasoning: 38.6% (vs 14.5% Opus 4.8). Starred figures are Mythos 5; Fable 5 approaches Opus 4.8 on cybersecurity/biology due to fallbacks.

How does Claude Fable 5’s safety system work?

Fable 5 includes classifiers that detect queries in three sensitive areas — offensive cybersecurity, biology/chemistry, and model distillation — and route them to Claude Opus 4.8 instead of the full Fable 5 model. This happens transparently (users are notified). Classifiers trigger in fewer than 5% of sessions on average. You are not charged Fable 5 prices for rerouted requests. Claude Mythos 5 is the same underlying model with cybersecurity safeguards lifted, restricted to vetted partners.

What is Claude Mythos 5?

Claude Mythos 5 (API ID: claude-mythos-5) is the same underlying model as Fable 5, with cybersecurity safeguards lifted. It replaced Mythos Preview for Project Glasswing partners on June 9, 2026. Pricing matches Fable 5 at $10/$50 per million tokens — less than half the $25/$125 of the original Preview. Not publicly available; access is restricted to vetted cyber-defense organizations and critical infrastructure operators.

What is the Capybara tier?

Capybara is Anthropic’s internal name for the new model tier above Opus. Claude Fable 5 and Claude Mythos 5 are the first public Capybara-tier models. The tier is described as ‘larger and more intelligent than our Opus models — which were, until now, our most powerful.’

What is Anthropic’s IPO status?

Anthropic filed confidentially for an IPO on June 1, 2026, following a $65 billion Series H raise that pushed valuation to $965 billion. Annualized revenue exceeds $47 billion. The IPO is expected in the October 2026 window, pending SEC review and market conditions. It would be one of the largest tech IPOs in history alongside SpaceX and OpenAI.

How does Fable 5 compare to GPT-5.5?

Claude Fable 5 leads GPT-5.5 on every published benchmark: SWE-bench Pro (80.3% vs 58.6%), FrontierCode Diamond (29.3% vs 5.7%), OSWorld (85.0% vs 78.7%), knowledge work (GDPval-AA 1932 vs 1769), legal reasoning (13.3% vs 2.1%), and tool use. GPT-5.5 has stronger consumer reach via ChatGPT and competitive pricing. For hard agentic coding work, Fable 5 is the current leading option.

Should I upgrade from Opus 4.8 to Fable 5?

It depends on your workload. For complex, long-horizon coding and agentic engineering tasks, yes — the SWE-bench Pro gap (80.3% vs 69.2%) is large enough to justify the 2× cost premium. For routine queries, document analysis, or cost-sensitive high-volume applications, Opus 4.8 remains the practical default. Use the free subscription window through June 22 to benchmark Fable 5 against your real tasks before committing.

📚 Further Reading on PrimeAIcenter

🔗 Claude Fable 5 Full Review (PrimeAIcenter)
🔗 Claude Mythos Complete Review
🔗 Claude Opus 4.7 Review
🔗 Best AI Coding Assistants 2026
🔗 GPT-5.5 (Spud) Review
🔗 GPT-5.5 Instant Review
🔗 Claude Opus vs GPT vs Gemini — Full Comparison
🔗 Enterprise AI Agent Deployment Guide
🔗 How to Rank in Claude Search Results
🔗 AI Statistics 2026 — 100+ Data Points

Official Sources and Links

🔗 Anthropic Fable 5 & Mythos 5 announcement (June 9): anthropic.com/news/claude-fable-5-mythos-5
🔗 Claude Fable product page: anthropic.com/claude/fable
🔗 Project Glasswing initial update (May 26): anthropic.com/research/glasswing-initial-update
🔗 Project Glasswing official announcement: anthropic.com/glasswing
🔗 Anthropic Red Team technical blog: red.anthropic.com/2026/mythos-preview
🔗 244-page System Card (PDF): Anthropic System Card PDF
🔗 TechCrunch — Fable 5 launch: TechCrunch
🔗 CNBC — Fable 5 coverage: CNBC
🔗 TechCrunch — IPO filing: TechCrunch IPO
🔗 AWS Bedrock — Claude Fable 5: AWS Blog
🔗 Cohesity joins Glasswing (June 8): PRNewswire
🔗 SiliconAngle — Glasswing 150-org expansion: SiliconAngle
🔗 Help Net Security — 10K vulnerabilities: Help Net Security
🔗 DigitalApplied — Fable 5 vs Mythos 5 analysis: DigitalApplied
🔗 Fortune — Anthropic IPO filing: Fortune
🔗 Picus Security — Glasswing analysis: Picus Security

Final Verdict

Claude Mythos Preview launched as the most capable AI model ever publicly documented — and in sixty days it became a publicly available product. That arc is itself the story: a model too dangerous to release in April is now accessible to any developer with an API key in June. The change wasn’t in the model’s capabilities. It was in the safety architecture built around them.

Claude Fable 5 is a real step forward. The SWE-bench Pro lead over GPT-5.5 is larger than the gap between GPT-5.5 and models that were considered strong six months ago. The FrontierCode Diamond result is more than 5× GPT-5.5. These are not benchmarking artifacts — the Stripe codebase migration story and the pattern of developer testimonials tell the same story the numbers do.

Project Glasswing found 23,019 vulnerabilities. CVE-2026-5194 was a real flaw in a cryptographic library running on billions of devices. The defensive window is real, and it’s closing. Within 6 to 12 months, other AI vendors are likely to have Mythos-class models as well, potentially without equivalent safeguards.

For developers: the free subscription window through June 22 is the right time to test Fable 5 against your actual workloads. For security teams: the 200-organization Glasswing expansion is your entry point. For everyone tracking AI capability: April 7 to June 9, 2026 — sixty-three days from “too dangerous to release” to “publicly available” — is the speed at which the frontier moves now. Get your architecture vendor-agnostic. Build for what comes next.

Sources: Anthropic Fable 5 & Mythos 5 (official), Anthropic Glasswing Update, Project Glasswing, Red Team blog, TechCrunch, CNBC, SiliconAngle, Help Net Security, TokenMix Research, DigitalApplied, Fortune IPO coverage, TechCrunch IPO, Picus Security analysis. Published April 8, 2026. Updated June 10, 2026.

Claude Mythos Preview → Fable 5 & Mythos 5: The Complete Story — From Restricted Launch to Public Release (Updated June 10, 2026)

⚡ Quick Reference: Then vs. Now

🗓️ TL;DR — The Mythos Timeline at a Glance

📖 The Full Story: From Leak to Public Launch

March 26, 2026: The Accidental Leak

April 7, 2026: The Official Launch

May–June 2026: Glasswing Results and Global Expansion

June 9, 2026: The Public Release — Fable 5 and Mythos 5

📊 Complete Benchmark Data: Mythos Preview → Fable 5 / Mythos 5

What These Numbers Actually Mean

🔐 Project Glasswing: Full Results (April–June 2026)

By the Numbers

The Charter Partners (Original 12) — And What They Found

The Bottleneck Problem Nobody Expected

🕳️ The Zero-Days: What Mythos Found (Confirmed Cases)

1. OpenBSD — 27-Year TCP Vulnerability

2. FFmpeg — 16-Year Bug, 5 Million Tests Missed It

3. Firefox 147 — 181 Working Exploits vs. Opus 4.6’s 2

4. CVE-2026-5194 in wolfSSL — Certificate Forgery

🛡️ How Fable 5’s Safety Architecture Works

💰 Access and Pricing — Everything Changed on June 9

Claude Fable 5: Public Access

Pricing Comparison: Then vs. Now

Subscription Plan Changes

🏗️ The Anthropic Model Tier Structure — Updated

⚠️ Safety and Alignment: What the System Card Documented

The Offensive Timeline: What Stamos’s Warning Means Now

🆚 Competitive Landscape: Fable 5 vs. GPT-5.5 vs. Gemini 3.1 Pro

🧪 PrimeAIcenter Testing Methodology and Score

Testing Methodology

Prompts You Can Test Today

PrimeAIcenter Score: Claude Fable 5

💼 What This Means: Practical Guidance by Role

For Security Teams

For Developers

For Enterprises Planning AI Infrastructure

Anthropic’s Business Context: IPO on the Horizon

🔮 What Comes Next

FAQS: Claude Mythos Preview, Fable 5, and Project Glasswing

What happened to Claude Mythos Preview?

Can I use Claude Fable 5 now?

What is Project Glasswing and what did it find?

What are the Fable 5 benchmarks?

How does Claude Fable 5’s safety system work?

What is Claude Mythos 5?

What is the Capybara tier?

What is Anthropic’s IPO status?

How does Fable 5 compare to GPT-5.5?

Should I upgrade from Opus 4.8 to Fable 5?

📚 Further Reading on PrimeAIcenter

Official Sources and Links

Final Verdict

Omar Diani

Newsletter Updates

Related Posts

US Government Suspends Fable 5 and Mythos 5: The Full Story Behind the AI Ban

Claude Fable 5 Review: The Mythos Model Anthropic Finally Let Out of the Cage

Google AI Studio Mobile Review 2026: I Built 3 Android Apps With It

Cursor Composer 2.5 Review: Benchmarks, Pricing & Real-World Testing

OpenHuman Review 2026: The Open-Source AI Agent That Remembers Everything About You

Googlebook Preview 2026: Google’s AI Laptop That Changes Everything (Or Tries To)

Leave a ReplyCancel Reply