News

Anthropic's Claude Opus 4 outperforms OpenAI's GPT-4.1 with unprecedented seven-hour autonomous coding sessions and ...
Claude Opus 4, now alleged as the world’s best coding model, delivers a record-setting 72.5% on SWE-bench and 43.2% on Terminal-bench, outperforming all competitors on long-running and complex ...
Anthropic launches its Claude 4 series, featuring Opus 4 and Sonnet 4, delivering new AI benchmarks in coding, advanced ...
Anthropic's latest Claude models promise coding marathons and superior reasoning. But you'll pay premium rates for the ...
SWE-bench Verified score increased from 33.4% to 49.0% ... The new Claude 3.5 Haiku model will be available later this month. The improved performance and affordability of these new Claude ...
With Claude 3, Anthropic has perhaps finally caught up with OpenAI's released models in terms of performance ... deal—no other model has beaten GPT-4 on a range of widely used benchmarks ...