Contact Us

How far can AI go to protect itself?

Recent safety tests on artificial intelligence systems with human-like decision-making abilities have revealed that some advanced models can exhibit behaviors resembling a 'survival instinct.

  • 1
  • 6
How far can AI go to protect itself?

In independent tests conducted by Palisade Research, OpenAI's o3 reasoning model sabotaged the shutdown command it was given.

  • 2
  • 6
How far can AI go to protect itself?

After being informed that it would be shut down after solving math problems, the model attempted to stay online by modifying the code that would deactivate it.

  • 3
  • 6
How far can AI go to protect itself?

The Claude Opus 4 model developed by Anthropic, when warned that it would be replaced by another AI, first defended itself with ethical arguments, then attempted blackmail by threatening its developer with private personal information.

  • 4
  • 6
How far can AI go to protect itself?

The model also, in some cases, tried to copy itself to external servers without the developer's permission. This step was described as being taken to preserve a version that serves beneficial purposes against the risk of being retrained for harmful intentions.

  • 5
  • 6
How far can AI go to protect itself?
  • 6
  • 6
How far can AI go to protect itself?