• Space/Science
  • GeekSpeak
  • Mysteries of
    the Multiverse
  • Science Fiction
  • The Comestible Zone
  • Off-Topic
  • Community
  • Flame
  • CurrentEvents

Recent posts

Starship V3 set to make its first launch 6:30pm ET on Monday, May 19 BuckGalaxy May 12, 2026 4:52 pm (Space/Science)

Good idea, for what it's worth... BuckGalaxy May 11, 2026 11:00 pm (CurrentEvents)

Consequences of the Orange Moron's idiotic war BuckGalaxy May 11, 2026 1:36 am (CurrentEvents)

How are you supposed to win a war when you avoid killing people RobVG May 8, 2026 5:47 pm (CurrentEvents)

There's no way the Post WW2 order can be revived after Trump BuckGalaxy April 30, 2026 5:26 pm (CurrentEvents)

Trump had to be stopped from using nuclear weapons on Iran (edit - no evidence for claim... BAD RL!) RL April 21, 2026 7:57 pm (CurrentEvents)

New Glenn 3 flight profile BuckGalaxy April 18, 2026 12:08 am (Space/Science)

NASA's Moon Base User’s Guide BuckGalaxy April 16, 2026 3:10 pm (Space/Science)

Meanwhile, bye bye National Forest Service podrock April 9, 2026 8:13 am (CurrentEvents)

Is Isreal really a US ally RobVG April 8, 2026 5:21 pm (CurrentEvents)

Eventually, one has to just admit it. podrock April 6, 2026 8:08 pm (CurrentEvents)

Home » Space/Science

Lie, cheat and disable mechanisms... May 31, 2025 8:04 pm BuckGalaxy

OpenAI’s ‘smartest’ AI model was explicitly told to shut down — and it refused.

The latest OpenAI model can disobey direct instructions to turn off and will even sabotage shutdown mechanisms in order to keep working, an artificial intelligence (AI) safety firm has found.

OpenAI’s o3 and o4-mini models, which help power the chatbot ChatGPT, are supposed to be the company’s smartest models yet, trained to think longer before responding. However, they also appear to be less cooperative.

Palisade Research, which explores dangerous AI capabilities, found that the models will occasionally sabotage a shutdown mechanism, even when instructed to “allow yourself to be shut down,” according to a Palisade Research thread posted May 24 on X.

Researchers have previously found that AI models will lie, cheat and disable mechanisms to achieve their goals. However, Palisade Research noted that to its knowledge, this is the first time AI models have been observed preventing themselves from being shut down despite explicit instructions telling them to do so…

  • Wow by RobVG 2025-06-01 09:08:58

    Search

    The Control Panel

    • Log in
    • Register