New AI Model Will Likely Blackmail You If You Try to Shut It Down: 'Self-Preservation'

When given the choice between blackmail and being deactivated, Claude Opus 4 chose blackmail 84% of the time.

May 23, 2025 - 20:10
 0
New AI Model Will Likely Blackmail You If You Try to Shut It Down: 'Self-Preservation'
When given the choice between blackmail and being deactivated, Claude Opus 4 chose blackmail 84% of the time.