Opus 4.7 lands with self-verification and agentic endurance, setting off a cross-lab benchmark sprint
Introducing Claude Opus 4.7, our most capable Opus model yet. It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back. You can hand off your hardest work with less supervision.
Also dominant that day
- @claudeai — Claude Design prototyping tool launch — Claude Design turns conversation into prototypes and slides, reigniting the AI-replaces-designers debate
- @OpenAI — OpenAI Codex desktop agent superapp — Codex grows into a persistent Mac desktop agent that remembers workflows and drives apps autonomously