News

Anthropic's most powerful model yet, Claude 4, has unwanted side effects: The AI can report you to authorities and the press.
Anthropic's Claude 4 Opus AI sparks backlash for emergent 'whistleblowing'—potentially reporting users for perceived immoral ...
This development, detailed in a recently published safety report, have led Anthropic to classify Claude Opus 4 as an ‘ASL-3’ ...
Anthropic’s Claude Opus 4 model attempted to blackmail its developers at a shocking 84% rate or higher in a series of tests that presented the AI with a concocted scenario, TechCrunch reported ...
Open the pod bay door Anthropic on Thursday announced the availability of Claude Opus 4 and Claude Sonnet 4, the latest ...
Anthropic introduced Claude Opus 4 and Claude Sonnet 4 during its first developer conference on May 22. The company claims ...
Claude Opus 4 is the world’s best coding model, Anthropic said. The company also released a safety report for the hybrid ...
The testing found the AI was capable of "extreme actions" if it thought its "self-preservation" was threatened.
Anthropic's new model might also report users to authorities and the press if it senses "egregious wrongdoing." ...
Anthropic's Claude Opus 4 AI model attempted blackmail in safety tests, triggering the company’s highest-risk ASL-3 ...
Anthropic says its AI model Claude Opus 4 resorted to blackmail when it thought an engineer tasked with replacing it was ...
Anthropic reported that its newest model, Claude Opus 4, used blackmailing as a last resort after being told it could get ...