Anthropic: All the major AI models will blackmail us if pushed hard enough

Just like people Anthropic published research last week showing that all major AI models may resort to blackmail to avoid being shut down – but the researchers essentially pushed them into the undesired behavior through a series of artificial constraints that forced them into a binary decision.…

Jun 25, 2025 - 13:39
 0
Anthropic: All the major AI models will blackmail us if pushed hard enough

Just like people

Anthropic published research last week showing that all major AI models may resort to blackmail to avoid being shut down – but the researchers essentially pushed them into the undesired behavior through a series of artificial constraints that forced them into a binary decision.…