Anthropic, AI and blackmail
Digest more
Top News
Overview
Impacts
Anthropic’s new Claude 4 Opus often turned to blackmail to avoid being shut down in a fictional test. The model threatened to reveal private information about engineers who it believed were planning to shut it down.
Anthropic’s newly released artificial intelligence (AI) model, Claude Opus 4, is willing to strong-arm the humans who keep it alive,
Anthropic has announced two new AI models that it claims represent a major step toward making AI agents truly useful.
Anthropic CEO Dario Amodei claims that AI models hallucinate at a lower rate than humans do, but in more surprising ways.
Claude Opus 4 and Claude Sonnet 4, Anthropic's latest generation of frontier AI models, were announced Thursday.