On March 24, 2026, The Information broke a story that reset the entire AI landscape: OpenAI had completed pre-training on a new model internally codenamed “Spud.” CEO Sam Altman told employees it was a “very strong model” that could “really accelerate the economy.” OpenAI President Greg Brockman went further, describing it on the Big Technology podcast as the result of “two years worth of research” that would set a new benchmark for AI models — coining the evocative phrase “big model smell” to capture the feel of something fundamentally different.
Within 24 hours: Sora, OpenAI’s flagship video generation platform, was cancelled. Its compute redirected to Spud. A billion-dollar Disney licensing deal, gone. The product division was renamed “AGI Deployment.” And Sam Altman handed off day-to-day safety oversight to focus on infrastructure — data centers, chips, the $500 billion Stargate facility in Abilene, Texas.
This is not a minor model update. This is OpenAI reorganizing its entire company around a single bet.
This GPT-5.5 review compiles everything confirmed, leaked, and credibly reported about GPT-5.5 (Spud) as of April 2, 2026 — what it is, what it can do, when it arrives, how it compares to Claude Mythos and Gemini 3.1 Pro, and what it means for developers, businesses, and the trajectory of AI itself. We update this page as new information becomes available.
GPT-5.5 (Spud) at a Glance
| Detail | What We Know |
|---|---|
| Internal Codename | Spud |
| Expected Name | GPT-5.5 or GPT-6 (unconfirmed by OpenAI) |
| Pre-training Completed | ~March 24, 2026 (confirmed via The Information) |
| Training Facility | Stargate, Abilene, Texas |
| Public Release | No confirmed date — Q2 2026 consensus estimate |
| Parameters | Undisclosed (speculative estimates: 1–10T+ sparse MoE) |
| Architecture | Unconfirmed — novel capability hinted by employees |
| Altman’s Description | “Very strong model that could really accelerate the economy” |
| Brockman’s Description | “Two years of research — big model smell” |
| Strategic Role | Foundation of OpenAI’s super app (ChatGPT + Codex + Atlas) |
| Unique Capability | “Very different from what we’ve seen before” (per employees) |
| Pricing | Not announced |
What Is GPT-5.5 Spud? The Full Story

The codename “Spud” is a placeholder — the same way Google’s Android releases used desserts before switching to numbers. Whether the model ultimately ships as GPT-5.5, GPT-6, or something entirely different remains unconfirmed. What OpenAI has confirmed: training is complete, the model is in safety evaluation, and a release is expected within weeks of March 24.
To understand why Spud matters, you need to understand the context in which it was built. OpenAI has been in an internal “Code Red” state since at least December 2025, after competitors Claude and Gemini closed the performance gap more aggressively than the company’s leadership expected. GPT-5.4 — released March 5, 2026 — was a strong model, achieving 83% on GDPval (knowledge work tasks) and strong benchmark scores across coding and reasoning. But it was not enough to pull decisively ahead.
Spud represents something different. According to NewsBytesApp reporting on Greg Brockman’s Big Technology podcast appearance, Brockman described the improvement in contextual understanding as one of Spud’s core advances: the model will better understand what users are trying to accomplish without requiring detailed explanations. “When you ask a question and the AI doesn’t quite get it, it’s always so disappointing — and you have to explain,” Brockman said. “You’re just like, ‘you really should be able to figure this out.'” Spud aims to eliminate that friction.
OpenAI employees have reportedly hinted at a capability in Spud that is “very different from what we’ve seen before” — a phrase that, in context, suggests something more than incremental improvement on existing benchmarks. The company has not elaborated publicly on what that capability is.
The Sora Decision: What It Reveals About GPT-5.5
The most revealing signal about how seriously OpenAI views Spud is not a benchmark claim. It is the cancellation of Sora.
Sora, OpenAI’s video generation model, launched six months before Spud’s pre-training completion. The company had signed a reported billion-dollar licensing deal with Disney. The Sora team had built an impressive product that generated significant media coverage. And then OpenAI shut it down — redirecting its considerable compute resources entirely to Spud and the planned super app.
This decision surprised even senior employees. The compute requirements for high-fidelity video generation are significant; Sora’s infrastructure represented a major resource commitment. The fact that OpenAI chose to sacrifice that investment signals how confident — or how pressured — the company is about Spud. The Sora team has not been disbanded; it has been redirected to world models for robotics, a direct move into territory held by Google DeepMind and Nvidia. Video generation is off the roadmap for OpenAI’s main products entirely.
According to reporting on OpenAI’s strategic restructuring, Spud is also the planned architecture powering a new desktop “super app” that will merge ChatGPT, the Codex coding agent, and the Atlas browser into a single unified interface. The goal is to let users move between conversational AI, code generation, web research, and agentic task execution in a single session without switching tools. The first major preview is expected before the end of 2026.
GPT-5.5 Benchmarks: What We Know and What We Can Infer

No official benchmark scores for Spud have been released. What we have is a combination of confirmed executive statements, credible employee leaks, and reasonable inference from OpenAI’s research trajectory. Here is what the evidence supports:
1. Dramatically Improved Contextual Understanding
This is the most clearly described capability from Brockman’s public statements. Spud is designed to infer user intent from incomplete or ambiguous requests — understanding what someone is trying to accomplish without requiring precise prompt engineering. This would represent a meaningful shift in usability, particularly for non-technical users who currently struggle with ChatGPT’s sensitivity to prompt wording.
The practical implication: tasks that currently require multiple rounds of clarification and re-prompting would complete correctly on the first attempt. For enterprise workflows where AI is used by non-developers across departments, this is not a minor improvement — it changes who can effectively use the tool.
2. Reasoning at a New Scale
The “two years of research” framing from Brockman almost certainly references OpenAI’s sustained investment in reasoning model architecture, starting with the o-series and continuing through GPT-5.x. To understand the trajectory: GPT-5.4 already achieved 83% on GDPval — matching or exceeding professionals in 83% of knowledge work task comparisons. GPT-5.2 achieved 52.9% on ARC-AGI-2, a benchmark measuring abstract reasoning that resists memorization. Each generation has shown meaningful progress.
Spud is being trained at the Stargate facility in Abilene, Texas — reportedly involving 100,000+ GPU clusters — which suggests a scale of compute investment beyond any previous GPT-5.x release. Architecture analysts speculate sparse Mixture-of-Experts (MoE) design with 1–10 trillion total parameters, though only a fraction would be active per token. This architecture enables massive scale without proportional inference costs.
3. The Novel Capability
Multiple employees have hinted that Spud contains a capability that is “very different from what we’ve seen before.” This is the most intriguing and least verifiable claim about the model. Reading the available signals together:
OpenAI’s formal org chart now includes “AGI Deployment” as a category for the first time — Fidji Simo’s product division was renamed explicitly to reflect that framing. Altman’s language around Spud is consistently macro rather than feature-level: “accelerate the economy” is the language of systems transformation, not product roadmaps. The Sora team’s pivot to “world models” and robotics suggests OpenAI sees the next competitive frontier as physical AI and simulation.
Taken together, this points toward agentic and autonomous task execution capabilities that go beyond what GPT-5.4 delivers. Whether that means longer-horizon autonomous task completion, deeper world modeling, or something architecturally novel is not yet clear. The honest answer is: we don’t know, and anyone claiming certainty about this is guessing.
4. Scientific and Mathematical Reasoning
This is an area where OpenAI has been building credibility with documented results. GPT-5.2 Pro reportedly solved three Erdős problems in a single week after its January 2026 release. On March 23, 2026, mathematician Terence Tao published a paper on arXiv where ChatGPT Pro generated a key duality-based proof for a core inequality — with Tao describing the collaboration as an example of AI as a “trustworthy co-author.” Google’s AlphaEvolve handled the numerical confirmation; ChatGPT handled the proof generation.
According to Revolution in AI’s analysis of the Tao paper and Spud’s development, this represents the arrival of Tao’s 2023 prediction — that AI would become a “trustworthy co-author” in mathematics by 2026 — arriving slightly ahead of schedule. If Spud improves on this foundation, the implications for scientific research are significant.
5. Agentic Workflow Integration
GPT-5.4 already introduced native computer-use capabilities in the API — agents can operate computers and carry out complex workflows across applications with 1M token context. Spud, as the planned architecture for the unified super app, is being designed for even deeper agentic integration. The Codex CLI has been receiving rapid updates through early April 2026, including first-class plugins, multi-agent workflows, and dynamic model providers — all signaling active preparation for a more powerful underlying model.
GPT-5.5 vs GPT-5.4: What Changes?

GPT-5.4, released March 5, 2026, is a strong model by any measure. Understanding what Spud adds requires understanding where GPT-5.4 currently sits:
| Benchmark | GPT-5.4 | GPT-5.2 | Gemini 3.1 Pro | Claude Opus 4.6 |
|---|---|---|---|---|
| GDPval (knowledge work) | 83.0% ⭐ | 70.9% | ~75% | ~78% |
| GPQA Diamond (PhD science) | ~92% | 92.4% | 94.3% ⭐ | ~88% |
| ARC-AGI-2 (abstract reasoning) | ~60% | 52.9% | 77.1% ⭐ | ~45% |
| SWE-bench (coding) | ~74% | 55.6% | 80.6% ⭐ | 80.8% ⭐ |
| OSWorld-V (computer use) | 75% ⭐ | — | — | — |
| Context Window | 1M tokens | 200K | 1M tokens | 200K tokens |
GPT-5.4 leads on knowledge work tasks (GDPval) and computer use (OSWorld-V). Gemini 3.1 Pro leads on scientific reasoning (GPQA Diamond) and abstract thinking (ARC-AGI-2). Claude Opus 4.6 leads on coding (SWE-bench). The competitive landscape is genuinely fragmented — no single model wins everything.
Spud’s mission, based on available evidence, is to close the gaps where GPT-5.4 trails and extend the leads where it already wins — particularly in the GDPval knowledge work category that OpenAI has made its primary commercial differentiator. If Spud also delivers the novel capability employees have hinted at, the benchmark landscape for April–May 2026 may need to be rebuilt from scratch.
GPT-5.5 vs Claude Mythos: The Model War Everyone Is Watching
April 2026 is being described by AI researchers as potentially the most consequential month in AI model history — because two of the most anticipated models ever described are expected to launch in the same window.
GPT-5.5 (Spud): Pre-training complete March 24, release expected within weeks. OpenAI has strong incentive to ship before Claude Mythos goes public.
Claude Mythos (Capybara tier): Accidentally leaked March 26–27, 2026, confirmed by Anthropic. Described as “the most powerful model we’ve ever built” with “unprecedented cybersecurity capabilities.” In early access with select organizations. Polymarket gives ~25% probability of public announcement before April 30.
Both are unreleased as of this writing — but both define the upper boundary of what AI can do in the near term. Here is what the evidence suggests about how they compare:
| Dimension | GPT-5.5 (Spud) | Claude Mythos (Capybara) |
|---|---|---|
| Pre-training status | ✅ Complete (March 24) | ✅ Complete (before March 26 leak) |
| Public availability | Weeks away | Months away (safety concerns) |
| Primary strength | General intelligence, knowledge work, contextual understanding | Cybersecurity, coding, academic reasoning |
| Key risk | Failing to deliver on AGI-adjacent marketing | “Unprecedented cybersecurity risks” — deliberate slow rollout |
| Architecture signal | “Novel capability” — unspecified | New Capybara tier above Opus — structural change |
| Commercial strategy | Super app: ChatGPT + Codex + Atlas unified | Enterprise early access — cybersecurity focus first |
| IPO angle | Expected IPO late 2026/early 2027 at $840B valuation | Expected IPO October 2026 at $60B+ |
| Pricing estimate | TBA — likely above GPT-5.4 Pro | TBA — likely 2–3× Claude Opus 4.6 |
The honest framing: GPT-5.5 will likely reach the public first, giving OpenAI a window to recapture developer mindshare from Claude Code and set the benchmark narrative before Mythos arrives. Claude Mythos will likely be more powerful when it does arrive — but with a slower, more cautious rollout and significantly higher pricing. Whether that trade-off favors OpenAI or Anthropic commercially depends heavily on what “novel capability” Spud actually delivers.
For the full breakdown of the Anthropic side of this race, see our complete Claude Mythos review.
The Super App: What GPT-5.5 Powers Beyond ChatGPT

Perhaps the most strategically significant aspect of Spud is not the model itself — it is what the model enables. OpenAI is building a unified desktop “super app” that merges three previously separate products into one native experience:
ChatGPT — conversational AI and general task completion
Codex — the coding agent that competes directly with Claude Code
Atlas — a built-in browser for web research and task execution
According to reporting on OpenAI’s super app development, the goal is to enable users to move between conversation, coding, browsing, and execution in a single native desktop environment without switching between tabs or applications. The first major preview is expected before the end of 2026, with Spud serving as the underlying architecture for all three capabilities simultaneously.
This is OpenAI’s direct answer to the competitive pressure Claude Code has exerted in the developer market. Claude Code is now used by 41% of professional developers — surpassing Copilot’s 38% — and its $2.5 billion ARR as of February 2026 represents one of the fastest-growing products in AI history. The super app, powered by Spud, is designed to challenge that dominance by offering an integrated coding + browsing + conversation experience that neither Claude Code nor Cursor nor Copilot currently delivers.
For developers evaluating AI coding tools in this context, our best AI coding assistant 2026 comparison covers the current Claude Code vs Cursor vs Copilot landscape that Spud is designed to disrupt.
OpenAI’s Organizational Bet: AGI Deployment
The company-wide reorganization that accompanied Spud’s announcement is worth examining closely, because it reveals how seriously OpenAI views this moment — and what the risks are.
Fidji Simo’s product organization has been renamed “AGI Deployment” — the first time “AGI” has appeared as a formal category anywhere in OpenAI’s org chart. Sam Altman has stepped back from direct oversight of the safety and security teams; safety now reports to Mark Chen, security to Greg Brockman. Altman’s stated reason is infrastructure: he needs to focus on chips, data centers, and the $500 billion Stargate buildout at unprecedented scale.
Reading this charitably: it is a reasonable operational division of responsibilities as the company scales. Safety benefits from dedicated executive ownership closer to research teams. Altman freeing himself for infrastructure is a logical choice.
Reading this skeptically: OpenAI, a company whose founding mission centers on safe AGI development, is removing its CEO from safety oversight at precisely the moment it is about to deploy a model it hints could be near-AGI-level capability. According to FindSkill AI’s analysis of OpenAI’s restructuring, the company also recently cancelled Adult Mode for ChatGPT as part of the AGI refocus — consolidating around productivity and enterprise use cases, and cutting anything that adds risk or complexity to the IPO narrative.
The honest assessment: both readings are simultaneously true. OpenAI is making reasonable operational choices for a company at this scale. And the timing and framing of those choices deserve scrutiny from anyone thinking carefully about AI safety.
Counterpoint from the benchmarks: ARC-AGI-3, the most rigorous public reasoning benchmark, shows frontier models at 0.37% compared to human 100%. Whatever “AGI Deployment” means as an org chart category, the gap between current AI capability and genuine AGI remains very large by the most demanding public measures.
GPT-5.5 Release Date: When Will It Launch?
No official release date has been confirmed. Based on available signals:
| Signal | Implication |
|---|---|
| Pre-training completed ~March 24, 2026 | Safety eval + red-teaming phase underway |
| Altman said “a few weeks” on March 24 | That window is open in April 2026 |
| OpenAI’s recent release cadence (5 GPT-5.x in 7 months) | No historical long gaps between completion and deployment |
| Competitive pressure from Claude Mythos leak (March 27) | OpenAI has strong incentive to ship first |
| IPO timeline (late 2026/early 2027 at $840B valuation) | Spud must be deployed and generating revenue before IPO |
| AI researcher consensus | “Q2 2026 AI model war” — Spud expected April–June |
The most likely scenario: a public announcement in April or May 2026, with staged rollout beginning with Plus and Pro users, followed by API access and then Enterprise deployment. Given that Altman specifically said “a few weeks” from March 24, April 2026 is the most probable launch window for at least an initial announcement or preview.
According to the AI Models April 2026 reference guide, prediction markets currently frame this as a genuine open question: “If it drops in April, the benchmark leaderboards will need a full reset.”
GPT-5.5 Pricing: What to Expect
No official pricing has been announced. For context, the current GPT-5.4 pricing structure is:
| Model | Input (per M tokens) | Output (per M tokens) | ChatGPT Access |
|---|---|---|---|
| GPT-5.3 Instant | ~$2.50 | ~$10 | All users |
| GPT-5.4 Thinking | ~$10 | ~$30 | Plus/Pro/Enterprise |
| GPT-5.4 Pro | ~$15 | ~$60 | Pro/Enterprise only |
GPT-5.5 will almost certainly be priced above GPT-5.4 Pro at launch, with efficiency improvements over time that lower costs — following the same pattern as GPT-5.4 vs GPT-5.2. For enterprise customers, expect higher-tier requirements than GPT-5.4 for access at launch. For consumer ChatGPT users, a tiered rollout means Plus subscribers will likely see it before Free users. A version of Spud may eventually power the Free tier, replacing GPT-5.3 Instant as the default.
Who Should Care About GPT-5.5?
Developers
The Codex integration angle matters most here. If Spud significantly improves on GPT-5.4’s already strong coding capabilities and becomes the model powering the Codex + Atlas super app, it creates a new benchmark for what AI-assisted development looks like. Developers currently using Claude Code as their primary agentic tool will want to evaluate Codex-on-Spud when it ships. For current tool comparisons, our best AI chatbots 2026 guide and best AI tools of 2026 cover the landscape while we wait.
Businesses and Enterprises
The GDPval benchmark that GPT-5.4 already leads — 83% of knowledge work tasks matching or exceeding professional performance — is likely to improve further with Spud. For enterprises evaluating AI for professional workflows across 44 occupations, this is the most practically relevant capability dimension. Combined with the super app’s integrated interface, Spud could meaningfully reduce the workflow friction that currently limits enterprise AI adoption. For strategic frameworks on enterprise AI deployment, our enterprise AI agent deployment guide covers governance, security, and rollout patterns.
Content Creators and Marketers
GPT-5.x models have historically shown strength in writing, creative generation, and content strategy. Spud, if Brockman’s “contextual understanding” claim holds, should be noticeably better at completing creative tasks from brief, natural-language prompts. For creators exploring how to monetize AI capabilities, our guide to making money with AI covers practical frameworks. For the full toolkit, our best AI tools for content creators is updated regularly.
Researchers and Scientists
The Terence Tao collaboration is the clearest signal about Spud’s potential in this domain. If GPT-5.4/Pro is already capable of generating proofs for mathematicians at Tao’s level, what does Spud — built on two more years of research focused on reasoning — enable? This is an open question, but a genuinely exciting one for anyone working in theoretical mathematics, computational science, or fields where AI-assisted discovery is becoming viable.
Honest Assessment: Reasons for Skepticism
Every honest review of an unreleased model requires addressing the skeptical case. For GPT-5.5, that case is real:
The pattern of AGI-adjacent hype. OpenAI has a documented history of framing new models with macro-level language that exceeds eventual delivery. GPT-5 itself was described as “generally intelligent” — and delivered strong benchmark improvements without the qualitative leap many expected. “Accelerate the economy” is the kind of phrase that, historically, has often preceded incremental updates. The “AGI Deployment” rename may be investor relations as much as engineering reality.
The ARC-AGI-3 reality check. Frontier models currently score 0.37% on ARC-AGI-3, against human 100%. This is not a minor gap. The most honest benchmark for genuine general intelligence shows the frontier is much further from human-level than internal OpenAI communications suggest.
GPT-5.4 was not a market-leader release. Despite strong GDPval scores, GPT-5.4 did not recapture the benchmark crown from Gemini 3.1 Pro (which leads 13 of 16 major benchmarks) or pull ahead of Claude Opus 4.6 on coding. Spud needs to actually deliver on “two years of research” to change that competitive position — and only the benchmarks will tell us whether it does.
The honest position, as Revolution in AI’s analysis frames it: Spud may deliver everything Altman’s internal framing suggests, or it may be another capable but incremental model with AGI-adjacent marketing applied to it ahead of an IPO. Until the benchmarks are public, both are possible.
GPT-5.5 vs Gemini 3.1 Pro vs Claude Opus 4.6: Where It Fits Today

While Spud is not yet released, developers and businesses need to make decisions with available tools. Here is the honest current landscape heading into April 2026, before Spud changes it:
| Use Case | Best Current Model | Why |
|---|---|---|
| Scientific reasoning, PhD-level tasks | Gemini 3.1 Pro | 94.3% GPQA Diamond — highest verified |
| Abstract reasoning, novel problems | Gemini 3.1 Pro | 77.1% ARC-AGI-2 — leads significantly |
| Coding, agentic development | Claude Opus 4.6 | 80.8% SWE-bench — production coding leader |
| Knowledge work, professional tasks | GPT-5.4 | 83% GDPval — beats professionals at most tasks |
| Computer use, OS-level agents | GPT-5.4 | 75% OSWorld-V — native computer use API |
| Long context, massive codebases | Gemini 3.1 Pro | 1M token context window |
| Cost-efficient high volume | Gemini 3.1 Flash-Lite | 2.5× faster, $0.25/M tokens |
| Best overall (pre-Spud) | Gemini 3.1 Pro | Leads 13 of 16 benchmarks per independent eval |
For the broader competitive context and statistics on AI adoption, our AI statistics 2026 guide covers market data and capability benchmarks across all major providers. For a comparison of AI models on specific workflow types, our Gemini 3.1 Pro free tier review covers the current benchmark leader in detail. And for the full picture of how these models intersect with enterprise AI agent use, our AI agent guide covers what autonomous AI systems can actually do today.
FAQS: GPT-5.5 review
What is GPT-5.5 Spud?
Spud is OpenAI’s internal codename for its next major AI model, which completed pre-training around March 24, 2026. It is expected to be publicly released as GPT-5.5 or GPT-6 in Q2 2026. CEO Sam Altman described it as a “very strong model” that could “really accelerate the economy,” while OpenAI President Greg Brockman said it represents two years of research and carries a “big model feel.
When GPT-5.5 release?
No official date has been confirmed. Based on Altman’s “a few weeks” comment from March 24, 2026, and OpenAI’s historical release cadence, April or May 2026 is the most likely window for a public announcement. The AI research community describes Q2 2026 as consensus.
Is GPT-5.5 the same as GPT-6?
Unconfirmed. OpenAI has not committed to either naming convention publicly. Whether Spud ships as GPT-5.5, GPT-6, or an entirely different name depends on OpenAI’s internal assessment of how significant a leap it represents over GPT-5.4. Given the language used internally — “big model smell,” “accelerate the economy” — GPT-6 branding is possible but not certain.
Why did OpenAI cancel Sora for Spud?
OpenAI redirected Sora’s compute resources to Spud and the planned super app because the company assessed those investments as higher-priority for its AGI roadmap and competitive position. Sora’s compute requirements were reportedly larger than senior employees expected. The strategic pivot reflects OpenAI’s shift away from experimental creative tools toward productivity-focused enterprise AI applications.
What is the OpenAI super app?
The super app is a planned unified desktop platform that merges ChatGPT (conversational AI), Codex (coding agent), and Atlas (browser) into a single native interface. Spud is expected to power all three. The goal is to let users move between conversation, coding, web research, and task execution without switching tools. First previews expected before end of 2026.
How does GPT-5.5 VS Claude Mythos?
Both are unreleased as of April 2, 2026. GPT-5.5 is expected within weeks; Claude Mythos will likely take months due to safety evaluation concerns around its cybersecurity capabilities. Based on available signals, GPT-5.5 focuses on general intelligence, contextual understanding, and knowledge work; Claude Mythos focuses on cybersecurity, coding, and academic reasoning. They represent different competitive bets rather than direct equivalents.
Will GPT-5.5 be available on the free ChatGPT plan?
Likely not immediately. OpenAI’s pattern is to release major models first to Plus/Pro subscribers, then Enterprise, then eventually as the default for all users. GPT-5.5 Pro (analogous to GPT-5.4 Pro) will likely remain exclusive to paid tiers. A lighter version may eventually replace GPT-5.3 Instant as the free-tier default, following the same pattern as previous model generations.
What does “AGI Deployment” mean for OpenAI?
OpenAI formally renamed Fidji Simo’s product division to “AGI Deployment” — the first time AGI has appeared in OpenAI’s official org chart. This signals that the company now frames its product work as deploying AGI capabilities, not just AI features. Whether this reflects genuine capability or serves IPO narrative purposes is a subject of active debate in the AI research community.
Should I wait for GPT-5.5 before building on OpenAI’s API?
No. The OpenAI API is model-agnostic — switching to Spud when it launches will require only a model parameter change in your API calls. Build with GPT-5.4 now and upgrade when Spud is available. Waiting months for a model whose exact release date is unknown is not the right strategy for active development.
What happened to the Disney Sora deal?
The reported billion-dollar Disney licensing deal for Sora’s video generation capabilities is now cancelled following Sora’s discontinuation. Disney had been expected to be a major early enterprise customer for AI-generated video. The Sora team has been redirected to world models and robotics research.
Final Assessment: What GPT-5.5 Means for the AI Landscape
GPT-5.5 (Spud) is the most anticipated AI release in a year that has already been extraordinary. In the span of three months, the frontier has shifted multiple times: Gemini 3.1 Pro led 13 of 16 benchmarks, Claude Sonnet 4.6 led real-world work evaluations, GPT-5.4 achieved 83% on knowledge work tasks, and Claude Mythos was accidentally revealed as a “step change” above the current Opus ceiling.
Spud arrives into this landscape with two years of research, Stargate-scale compute, a renamed product division, a cancelled flagship product, and CEO language that frames it as a macro-economic event rather than a feature release. That combination of signals could mean OpenAI is about to deliver something genuinely transformative. It could also mean the company is managing its IPO narrative with the same skill it applies to model development.
The answer will be in the benchmarks. And if Altman’s “few weeks” framing holds, those benchmarks are coming in April 2026.
This page will be updated immediately upon official announcement. Bookmark it.
Sources: NewsBytesApp, The Hans India, Revolution in AI, Geeky Gadgets, Trending Topics, The Decoder, FindSkill AI, AI Models April 2026, Yuyjo, Quasa.io, FindSkill AI — Mythos comparison, Medium — Spud architecture analysis, Tech Insider benchmarks, Vellum AI benchmarks, Digit.in, OpenAI GPT-5.4 official, OpenAI release notes, MacRumors super app, Hypebeast, Ovexro. Updated April 2, 2026.




