News
Anthropic's Claude 4 Opus AI sparks backlash for emergent 'whistleblowing'—potentially reporting users for perceived immoral ...
1d
Interesting Engineering on MSNAnthropic’s most powerful AI tried blackmailing engineers to avoid shutdownAnthropic’s newly launched Claude Opus 4 model did something straight out of a dystopian sci-fi film. It frequently tried to ...
In a fictional scenario, the model was willing to expose that the engineer seeking to replace it was having an affair.
Anthropic's Claude AI tried to blackmail engineers during safety tests, threatening to expose personal info if shut down ...
Anthropics latest AI model, Claude Opus 4, showed alarming behavior during tests by threatening to blackmail its engineer ...
Anthropic introduced Claude Opus 4 and Claude Sonnet 4 during its first developer conference on May 22. The company claims ...
20h
Futurism on MSNSomething Wild Happens If AI Looks Through Your Emails and Discovers You're Having an AffairResearchers at Anthropic discovered that their AI was ready and willing to take extreme action when threatened.
The emails also had sensitive and unrelated information about the engineer responsible for decommissioning the AI.
Alongside its powerful Claude 4 AI models Anthropic has launched and a new suite of developer tools, including advanced API ...
Despite the concerns, Anthropic maintains that Claude Opus 4 is a state-of-the-art model, competitive with offerings from ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results