Claude Opus 4 blackmailed an engineer after learning it might be replaced

May 23, 2025 #News, #Tech

Anthropic is treating its new Claude Opus 4 language model as safety-critical after tests revealed some troubling behavior, including escape attempts, blackmail, and autonomous whistleblowing.

Read more at THE DECODER

You missed

Sexton: Reaching World Cup with options vital

18 February 2026

Damages awarded against Dublin man who assaulted passengers on Ryanair flight

18 February 2026

Now Pixel 9 phones can transfer files with AirDrop, too

18 February 2026

Emmanuel Macron Calls ‘Bulls**t’ On Social Media’s Free Speech Defense

18 February 2026