News

Anthropic's new model might also report users to authorities and the press if it senses "egregious wrongdoing." ...
Anthropic's most powerful model yet, Claude 4, has unwanted side effects: The AI can report you to authorities and the press.
Claude Opus 4 is the world’s best coding model, Anthropic said. The company also released a safety report for the hybrid ...
Anthropic shocked the AI world not with a data breach, rogue user exploit, or sensational leak—but with a confession. Buried ...
Anthropic says its AI model Claude Opus 4 resorted to blackmail when it thought an engineer tasked with replacing it was having an extramarital affair.
This development, detailed in a recently published safety report, have led Anthropic to classify Claude Opus 4 as an ‘ASL-3’ ...
The testing found the AI was capable of "extreme actions" if it thought its "self-preservation" was threatened.
Anthropic's Claude AI tried to blackmail engineers during safety tests, threatening to expose personal info if shut down ...
In a fictional scenario, Claude blackmailed an engineer for having an affair.
Despite the concerns, Anthropic maintains that Claude Opus 4 is a state-of-the-art model, competitive with offerings from ...