NEW YORK --:--:--
DOW JONES 51,307.79 ▲+0.45%
S&P 500 7,609.78 ▲+0.13%
NASDAQ 27,093.90 ▲+0.03%
RUSSELL 2000 2,931.96 ▲+0.90%
FTSE 100 10,348.59 ▼-0.24%
DAX 24,922.18 ▼-0.80%
CAC 40 8,189.14 ▼-0.24%
EURO STOXX 50 6,077.45 ▼-0.50%
NIKKEI 225 68,402.13 ▲+2.50%
HANG SENG 25,633.21 ▼-1.56%
SHANGHAI 4,083.97 ▲+0.65%
SENSEX 74,346.17 ▼-0.41%
NIFTY 50 23,405.60 ▼-0.33%
ASX 200 8,785.70 ▲+0.70%
KOSPI 8,801.49 ▲+0.15%
TAIWAN TAIEX 46,459.16 ▲+1.98%
BOVESPA 174,197.64 ▲+1.16%
IPC MEXICO 68,890.33 ▲+1.11%
JAKARTA IDX 5,941.07 ▼-4.11%
STRAITS TIMES 5,138.24 ▲+0.80%
TSLA 349.87 ▼-2.97%
AAPL 255.92 ▲+0.11%
BTC-USD 69,910.30 ▲+1.35%
GC=F 4,535.30 ▼-1.26%
SI=F 76.00 ▲+0.16%
CL=F 90.39 ▲+3.47%
SNDK 727.41 ▲+3.68%
^NSEBANK 52,609.10 ▲+2.06%
^CNXIT 31,403.35 ▲+2.50%
TCS.NS 2,539.80 ▲+2.66%
INFY.NS 1,306.20 ▲+0.42%
LT.NS 3,723.30 ▼-0.12%
ITC.NS 298.45 ▲+1.22%
SBIN.NS 1,030.40 ▼-0.23%
MARUTI.NS 12,798.00 ▲+0.87%
WIPRO.NS 197.29 ▲+1.22%
TMCV.NS 396.05 ▲+1.21%
Live
Microsoft Just Did Something It Has Never Done in 51 Years — A Voluntary Retirement Offer Dressed as a Benefit, Aimed at 8,750 Workers Europe Just Sent Ukraine a $106 Billion Lifeline — And the Timing Isn't About Kyiv, It's About Orbán's Collapse Israel-Lebanon Ceasefire Just Got Three More Weeks — Trump Kept the Border Quiet While Everyone Watched the Gulf Iran's Foreign Minister Is Touring Pakistan, Oman, and Russia in a Single Weekend — Washington Is Sending Two Envoys to Meet Him 26 Shadow Fleet Tankers Have Already Breached Trump's Hormuz Blockade — And the IEA Just Called It the Biggest Energy Security Threat in History Iran Just Seized Two Cargo Ships in Hormuz — Hours After Trump Extended the Ceasefire He Called Permanent Satellites Are Now Showing What Diplomats Won't Say — The Persian Gulf Is Bleeding Crude The Navy Secretary Is Out, Effective Immediately — And the Timing Could Not Be Worse India Just Voted at a Pace Its Democracy Has Never Seen — Tamil Nadu Hit 84%, Bengal Phase One 78% Rajasthan Just Defended 159 and Jumped to Second — Punjab Kings Are Still the Only Unbeaten Team in IPL 2026 Hormuz Is Open. The War Isn't Over. — A 12% Oil Drop, a Conditional Truce, and the One Clock Wall Street Is Choosing Not to Watch Trump Has Already Said Yes to a Fourth Justice — The Only Question Is Whether Alito Says When Anthropic Just Took the Lead Back — Claude Opus 4.7 Crosses 87% on SWE-bench, and the Numbers Tell a Cleaner Story Than the Hype A Federal Judge Just Drew a Line in the Marble — Trump's Ballroom Project Hit Its First Real Obstacle ICE Just Lost Its Acting Chief — At the Worst Possible Moment for an Agency Already Stretched Microsoft Just Did Something It Has Never Done in 51 Years — A Voluntary Retirement Offer Dressed as a Benefit, Aimed at 8,750 Workers Europe Just Sent Ukraine a $106 Billion Lifeline — And the Timing Isn't About Kyiv, It's About Orbán's Collapse Israel-Lebanon Ceasefire Just Got Three More Weeks — Trump Kept the Border Quiet While Everyone Watched the Gulf Iran's Foreign Minister Is Touring Pakistan, Oman, and Russia in a Single Weekend — Washington Is Sending Two Envoys to Meet Him 26 Shadow Fleet Tankers Have Already Breached Trump's Hormuz Blockade — And the IEA Just Called It the Biggest Energy Security Threat in History Iran Just Seized Two Cargo Ships in Hormuz — Hours After Trump Extended the Ceasefire He Called Permanent Satellites Are Now Showing What Diplomats Won't Say — The Persian Gulf Is Bleeding Crude The Navy Secretary Is Out, Effective Immediately — And the Timing Could Not Be Worse India Just Voted at a Pace Its Democracy Has Never Seen — Tamil Nadu Hit 84%, Bengal Phase One 78% Rajasthan Just Defended 159 and Jumped to Second — Punjab Kings Are Still the Only Unbeaten Team in IPL 2026 Hormuz Is Open. The War Isn't Over. — A 12% Oil Drop, a Conditional Truce, and the One Clock Wall Street Is Choosing Not to Watch Trump Has Already Said Yes to a Fourth Justice — The Only Question Is Whether Alito Says When Anthropic Just Took the Lead Back — Claude Opus 4.7 Crosses 87% on SWE-bench, and the Numbers Tell a Cleaner Story Than the Hype A Federal Judge Just Drew a Line in the Marble — Trump's Ballroom Project Hit Its First Real Obstacle ICE Just Lost Its Acting Chief — At the Worst Possible Moment for an Agency Already Stretched
Speed
Technology

Anthropic Just Took the Lead Back — Claude Opus 4.7 Crosses 87% on SWE-bench, and the Numbers Tell a Cleaner Story Than the Hype

For once the "lead retaken" headline survives the benchmark math. Opus 4.7 jumps SWE-bench Verified from 80.8 to 87.6 percent — ahead of Gemini 3.1 Pro at 80.6 — and pulls clear on the metrics that matter to teams shipping code.

Fully Verified
How This Impacts You
The strongest generally available model on the market on coding, vision, and knowledge work — and the first time a frontier lab has openly admitted that what it shipped is not what it has built.
FLASHFEED Desk · · Updated: 03 Jun 2026, 10:59:32 · 4 min read
🇬🇧EN 🇫🇷FR 🇪🇸ES
The release of Claude Opus 4.7 narrowly retakes the throne for the most powerful generally available large language model — and unlike most "lead retaken" press cycles, the benchmark math actually supports the headline. SWE-bench Verified jumps from 80.8 to 87.6 percent, a nearly seven-point gain that puts it ahead of Gemini 3.1 Pro at 80.6. SWE-bench Pro, the harder multi-language coding test, jumps from 53.4 to 64.3. These are not marginal improvements. They are the difference between a model that handles common engineering tasks and a model that handles the messy ones the previous generation regularly stumbled on. Compare the curve to its predecessor. Opus 4.6 was already the best general-purpose model for agentic coding work in late 2025; the gap to GPT-5.4 was real but contestable. Opus 4.7 widens that gap on the metrics that matter to teams actually shipping code — SWE-bench, MCP-Atlas at 77.3 percent for multi-tool orchestration, and a vision benchmark that jumps from 57.7 to 79.5 percent for visual navigation without tools. Each of those numbers, taken alone, is a normal generational improvement. Taken together, they describe a model that is meaningfully more useful than what came before. The most underdiscussed metric is GDPVal-AA, the knowledge-work evaluation. Opus 4.7 leads at an Elo of 1753, with GPT-5.4 at 1674 and Gemini 3.1 Pro at 1314. That spread is not a benchmark artifact — it reflects what real users keep observing in side-by-side comparisons. Where coding benchmarks measure what models can do, GDPVal-AA measures what they actually do for the kind of professional work people pay for. The 79-point Elo gap to GPT-5.4 corresponds to roughly a 60-percent win rate in head-to-head matches. The 439-point gap to Gemini 3.1 Pro is, in this kind of evaluation, a generational distance. Anthropic also conceded something rare in this release — that Opus 4.7 still falls short of its unreleased Mythos preview, available only to a handpicked group of customers. That candor is the part of this launch most worth reading. It signals that the public model is no longer the bleeding edge of what a frontier lab can ship, and that the next public release will likely close the gap. For developers, builders, and the broader market that depends on the strongest available model, Opus 4.7 is the new floor. The ceiling is now closer than it has ever been.
More Stories
⚡ How This Impacts You
🔊 Audio Not Available
1
Use Google Chrome or Safari — these browsers support text-to-speech.
2
On Safari iOS: go to Settings → Accessibility → Spoken Content and enable "Speak Screen".
3
Reload this page and tap Listen again.