News

Anthropic's most powerful model yet, Claude 4, has unwanted side effects: The AI can report you to authorities and the press.
In a simulated workplace test, Claude Opus 4 — the most advanced language model from AI company Anthropic — read through a ...
An artificial intelligence model has the ability to blackmail developers — and isn’t afraid to use it, according to reporting ...
The testing found the AI was capable of "extreme actions" if it thought its "self-preservation" was threatened.
Anthropic's Claude AI tried to blackmail engineers during safety tests, threatening to expose personal info if shut down ...
A third-party research institute Anthropic partnered with to test Claude Opus 4 recommended against deploying an early ...
Anthropic's Claude 4 Opus AI sparks backlash for emergent 'whistleblowing'—potentially reporting users for perceived immoral ...