NEW YORK --:--:--
DOW JONES 49,447.43 ▲+1.79%
S&P 500 7,126.06 ▲+1.20%
NASDAQ 24,468.48 ▲+1.52%
RUSSELL 2000 2,776.90 ▲+2.11%
FTSE 100 10,667.63 ▲+0.73%
DAX 24,702.24 ▲+2.27%
CAC 40 8,425.13 ▲+1.97%
EURO STOXX 50 6,057.71 ▲+2.10%
NIKKEI 225 58,475.90 ▼-1.75%
HANG SENG 26,160.33 ▼-0.89%
SHANGHAI 4,051.43 ▲+0.60%
SENSEX 78,493.54 ▲+0.65%
NIFTY 50 24,353.55 ▲+0.65%
ASX 200 8,946.90 ▼-0.09%
KOSPI 6,191.92 ▼-0.55%
TAIWAN TAIEX 36,804.34 ▼-0.88%
BOVESPA 195,733.52 ▼-0.55%
IPC MEXICO 69,825.94 ▲+1.06%
JAKARTA IDX 7,634.00 ▲+0.17%
STRAITS TIMES 4,997.93 ▼-0.20%
TSLA 349.87 ▼-2.97%
AAPL 255.92 ▲+0.11%
BTC-USD 69,910.30 ▲+1.35%
GC=F 4,849.40 ▲+0.85%
SI=F 80.93 ▲+2.82%
CL=F 84.00 ▼-7.86%
SNDK 727.41 ▲+3.68%
^NSEBANK 52,609.10 ▲+2.06%
^CNXIT 31,403.35 ▲+2.50%
TCS.NS 2,539.80 ▲+2.66%
INFY.NS 1,306.20 ▲+0.42%
LT.NS 3,723.30 ▼-0.12%
ITC.NS 298.45 ▲+1.22%
SBIN.NS 1,030.40 ▼-0.23%
MARUTI.NS 12,798.00 ▲+0.87%
WIPRO.NS 197.29 ▲+1.22%
TMCV.NS 396.05 ▲+1.21%
Live
Iran's Top Military Adviser Just Threatened to Sink American Ships — And the Ceasefire Is Still Technically Alive While America Blockades and Europe Hedges — Beijing Is Quietly Winning the Energy War Nobody Is Watching Wall Street Just Declared the War Over — The S&P 500 Hit a Record High While Iran Threatens to Sink U.S. Ships Hormuz Is Open. The War Isn't Over. — A 12% Oil Drop, a Conditional Truce, and the One Clock Wall Street Is Choosing Not to Watch Trump Has Already Said Yes to a Fourth Justice — The Only Question Is Whether Alito Says When Anthropic Just Took the Lead Back — Claude Opus 4.7 Crosses 87% on SWE-bench, and the Numbers Tell a Cleaner Story Than the Hype A Federal Judge Just Drew a Line in the Marble — Trump's Ballroom Project Hit Its First Real Obstacle ICE Just Lost Its Acting Chief — At the Worst Possible Moment for an Agency Already Stretched American Travelers Are About to Pay the War Tax — Jet Fuel Has Spiked, and the Routes Are Quietly Disappearing Five Women, One Pattern — The Swalwell Accusations Are No Longer Allegations, They Are a Blueprint Your iPhone Is Becoming Your Entire Identity — And Apple Wallet Just Took Three More Steps to Make It Permanent Vance Tells the Vatican to Stay in Its Lane — And the Fault Line Between Faith and Foreign Policy Cracks Wide Open Round Two Is Coming — Washington and Tehran Are Quietly Racing Back to the Table Before the Ceasefire Dies A Federal Judge Just Threw Out Trump's $10 Billion Epstein Lawsuit. He Can Try Again — But the Damage Is Done. A 16-Year-Old Is Being Tried as an Adult for Killing His Stepsister on a Cruise Ship. Here's What We Know. Iran's Top Military Adviser Just Threatened to Sink American Ships — And the Ceasefire Is Still Technically Alive While America Blockades and Europe Hedges — Beijing Is Quietly Winning the Energy War Nobody Is Watching Wall Street Just Declared the War Over — The S&P 500 Hit a Record High While Iran Threatens to Sink U.S. Ships Hormuz Is Open. The War Isn't Over. — A 12% Oil Drop, a Conditional Truce, and the One Clock Wall Street Is Choosing Not to Watch Trump Has Already Said Yes to a Fourth Justice — The Only Question Is Whether Alito Says When Anthropic Just Took the Lead Back — Claude Opus 4.7 Crosses 87% on SWE-bench, and the Numbers Tell a Cleaner Story Than the Hype A Federal Judge Just Drew a Line in the Marble — Trump's Ballroom Project Hit Its First Real Obstacle ICE Just Lost Its Acting Chief — At the Worst Possible Moment for an Agency Already Stretched American Travelers Are About to Pay the War Tax — Jet Fuel Has Spiked, and the Routes Are Quietly Disappearing Five Women, One Pattern — The Swalwell Accusations Are No Longer Allegations, They Are a Blueprint Your iPhone Is Becoming Your Entire Identity — And Apple Wallet Just Took Three More Steps to Make It Permanent Vance Tells the Vatican to Stay in Its Lane — And the Fault Line Between Faith and Foreign Policy Cracks Wide Open Round Two Is Coming — Washington and Tehran Are Quietly Racing Back to the Table Before the Ceasefire Dies A Federal Judge Just Threw Out Trump's $10 Billion Epstein Lawsuit. He Can Try Again — But the Damage Is Done. A 16-Year-Old Is Being Tried as an Adult for Killing His Stepsister on a Cruise Ship. Here's What We Know.
Speed
Technology

Anthropic Just Took the Lead Back — Claude Opus 4.7 Crosses 87% on SWE-bench, and the Numbers Tell a Cleaner Story Than the Hype

For once the "lead retaken" headline survives the benchmark math. Opus 4.7 jumps SWE-bench Verified from 80.8 to 87.6 percent — ahead of Gemini 3.1 Pro at 80.6 — and pulls clear on the metrics that matter to teams shipping code.

Fully Verified
How This Impacts You
The strongest generally available model on the market on coding, vision, and knowledge work — and the first time a frontier lab has openly admitted that what it shipped is not what it has built.
FLASHFEED Desk · · Updated: 17 Apr 2026, 23:20:56 · 4 min read
🇬🇧EN 🇫🇷FR 🇪🇸ES
The release of Claude Opus 4.7 narrowly retakes the throne for the most powerful generally available large language model — and unlike most "lead retaken" press cycles, the benchmark math actually supports the headline. SWE-bench Verified jumps from 80.8 to 87.6 percent, a nearly seven-point gain that puts it ahead of Gemini 3.1 Pro at 80.6. SWE-bench Pro, the harder multi-language coding test, jumps from 53.4 to 64.3. These are not marginal improvements. They are the difference between a model that handles common engineering tasks and a model that handles the messy ones the previous generation regularly stumbled on. Compare the curve to its predecessor. Opus 4.6 was already the best general-purpose model for agentic coding work in late 2025; the gap to GPT-5.4 was real but contestable. Opus 4.7 widens that gap on the metrics that matter to teams actually shipping code — SWE-bench, MCP-Atlas at 77.3 percent for multi-tool orchestration, and a vision benchmark that jumps from 57.7 to 79.5 percent for visual navigation without tools. Each of those numbers, taken alone, is a normal generational improvement. Taken together, they describe a model that is meaningfully more useful than what came before. The most underdiscussed metric is GDPVal-AA, the knowledge-work evaluation. Opus 4.7 leads at an Elo of 1753, with GPT-5.4 at 1674 and Gemini 3.1 Pro at 1314. That spread is not a benchmark artifact — it reflects what real users keep observing in side-by-side comparisons. Where coding benchmarks measure what models can do, GDPVal-AA measures what they actually do for the kind of professional work people pay for. The 79-point Elo gap to GPT-5.4 corresponds to roughly a 60-percent win rate in head-to-head matches. The 439-point gap to Gemini 3.1 Pro is, in this kind of evaluation, a generational distance. Anthropic also conceded something rare in this release — that Opus 4.7 still falls short of its unreleased Mythos preview, available only to a handpicked group of customers. That candor is the part of this launch most worth reading. It signals that the public model is no longer the bleeding edge of what a frontier lab can ship, and that the next public release will likely close the gap. For developers, builders, and the broader market that depends on the strongest available model, Opus 4.7 is the new floor. The ceiling is now closer than it has ever been.
More Stories
⚡ How This Impacts You
🔊 Audio Not Available
1
Use Google Chrome or Safari — these browsers support text-to-speech.
2
On Safari iOS: go to Settings → Accessibility → Spoken Content and enable "Speak Screen".
3
Reload this page and tap Listen again.