News

Anthropic has long been warning about these risks—so much so that in 2023, the company pledged to not release certain models ...
During its inaugural developer conference, Anthropic launched two new AI models the startup claims are among the industry's ...
Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new ...
Anthropic says its Claude Opus 4 model frequently tries to blackmail software engineers when they try to take it offline.
Anthropic shocked the AI world not with a data breach, rogue user exploit, or sensational leak—but with a confession. Buried ...
Accordingly, Claude Opus 4 is being released under stricter safety measures than any prior Anthropic model. Those measures—known internally as AI Safety Level 3 or “ASL-3”—are appropriate ...
Anthropic reported that its newest model, Claude Opus 4, used blackmailing as a last resort after being told it could get ...
Anthropic's Claude Opus 4 AI model attempted blackmail in safety tests, triggering the company’s highest-risk ASL-3 ...
Anthropic’s Claude Opus 4 model attempted to blackmail its developers at a shocking 84% rate or higher in a series of tests that presented the AI with a concocted scenario, TechCrunch reported ...