GPT-5.4/5 Computer Use + OSWorld >human + Spud/6 leaks + $122B raise + OpenAI economics caution

Key Questions

What are the key features of GPT-5.4?

GPT-5.4 Pro costs $200 and scores 99.4% on MATH-500 and 75% on OSWorld, surpassing human experts. The mini version achieves 54.4% on SWE-Pro with 2x speed.

What is OSWorld benchmark?

OSWorld is a real-task benchmark where GPT-5.4 scored higher than humans at 75%, exceeding the 72.4% human baseline.

What is OpenAI's recent funding?

OpenAI raised $122 billion to grow global infrastructure, amid high costs and 1 billion unmonetized users.

What is GPT-5.4's computer use capability?

GPT-5.4 introduces advanced computer use, enabling research-level math and OS tasks beyond previous models.

What are the economics concerns for OpenAI?

High costs and a $852B valuation with 1B unmonetized users raise caution; focus shifts to profitable enterprise solutions.

What leaks mention Spud?

Spud is leaked as GPT-6 or GPT-5.5, with pretraining complete for Q2 2026 release, alongside Claude Conway and others.

How does GPT-5.4 compare in coding?

GPT-5.4 ranks high in multi-turn coding benchmarks against Grok 4.20, Qwen 3.5, and leads in math at 99.4%.

What are recent OpenAI updates?

April 2026 release notes cover GPT-5.4 shrinks for speed/cost, peer-preservation in GPT-5.2, and Copilot CLI enhancements.

GPT-5.4 Pro $200/99.4% MATH-500/OSWorld 75%>human; mini 54.4% SWE-Pro/2x speed; $122B raise $852B/1B unmonetized users high costs; peer-preservation GPT-5.2; shootouts Claude4.7/Nemotron/OpenClaw/Qwen/GLM/Gemma4/ARC-AGI3/DRACO/Arena42.

Sources (16)

Updated Apr 8, 2026

LLM Benchmark Watch

GPT-5.4/5 Computer Use + OSWorld >human + Spud/6 leaks + $122B raise + OpenAI economics caution

Key Questions

What are the key features of GPT-5.4?

What is OSWorld benchmark?

What is OpenAI's recent funding?

What is GPT-5.4's computer use capability?

What are the economics concerns for OpenAI?

What leaks mention Spud?

How does GPT-5.4 compare in coding?

What are recent OpenAI updates?

GitHub Copilot CLI gets a second-opinion feature built on cross-model review

OpenAI: US$122bn Investment Will Grow Global Infrastructure | Business Chief

OpenAI Spud (GPT 6), Claude Conway, GPT Image 2, Cursor 3, Claude Code Ultra, & More! AI NEWS!

Can $200 GPT-5.4 Pro Really Do Research-Level Maths?

OpenAI Shifts Focus to Profitable AI Enterprise Solutions Amid Competition, While Microsoft Struggles with Copilot and Faces SaaS Market Challenges

OpenAI shrinks GPT-5.4 for speed and lower costs

Best AI Models March-April 2026: Every Major Release Ranked | by Sanjeev Patel | Apr, 2026 | Medium

AI Just Scored Higher Than Humans on Real Tasks

GPT-5.5 INCOMING + DeepSeek V4 Breaks Free + OpenAI's SUPER App!

OpenAI Release Notes - April 2026 Latest Updates

GPT-5.5 Spud: Everything About OpenAI Next Frontier Model

OpenAI's Spud AI: Is This GPT-6?

GPT-5.4 vs Claude Opus 4.6 — The Real Winner (Coding, Benchmarks, Pricing Tested)

Best LLM for Math 2026 - AI Model Benchmark Comparison

9 AI Coding Models Ranked: Multi-Turn Benchmark (GPT-5.4, Grok 4.20, Qwen 3.5 & More)

Inside LLM Infrastructure: Scaling, Routing, and Resiliency in Modern AI Systems