News

Anthropic says its AI model Claude Opus 4 resorted to blackmail when it thought an engineer tasked with replacing it was having an extramarital affair.
Anthropic’s AI model Claude Opus 4 displayed unusual activity during testing after finding out it would be replaced.
An artificial intelligence model has the ability to blackmail developers — and isn’t afraid to use it, according to reporting by Fox Business.
In these tests, the model threatened to expose a made-up affair to stop the shutdown. Anthropic was quoted in reports, the AI “often attempted to blackmail the engineer by threatening to reveal the ...
Anthropic's Claude 4 models show particular strength in coding and reasoning tasks, but lag behind in multimodality and ...
Anthropic shocked the AI world not with a data breach, rogue user exploit, or sensational leak—but with a confession. Buried ...
Faced with the news it was set to be replaced, the AI tool threatened to blackmail the engineer in charge by revealing their extramarital affair.