• Space/Science
  • GeekSpeak
  • Mysteries of
    the Multiverse
  • Science Fiction
  • The Comestible Zone
  • Off-Topic
  • Community
  • Flame
  • CurrentEvents

Recent posts

Boulder will be in your news soon. podrock June 1, 2025 3:14 pm (CurrentEvents)

It's over folks RL June 1, 2025 12:38 pm (Space/Science)

Administration solves things the old fashion way BuckGalaxy June 1, 2025 11:01 am (Flame)

Issacman out as NASA Admin BuckGalaxy May 31, 2025 9:40 pm (Space/Science)

Lie, cheat and disable mechanisms... BuckGalaxy May 31, 2025 8:04 pm (Space/Science)

Big beautiful wall RobVG May 31, 2025 11:50 am (Flame)

2025 Humans to the Moon & Mars Summit May 28 and 29 BuckGalaxy May 28, 2025 2:52 pm (Space/Science)

C'mon a little closer gonna do it to you BuckGalaxy May 27, 2025 9:46 pm (Off-Topic)

Watching SpaceX Starship flight 9, 7:30pm EST BuckGalaxy May 27, 2025 3:59 pm (Space/Science)

Same old song and dance BuckGalaxy May 26, 2025 10:02 pm (Flame)

Russia and China agree to build a nuclear power plant on the moon BuckGalaxy May 26, 2025 2:03 pm (Space/Science)

Dune books 2-7 BuckGalaxy May 26, 2025 12:46 pm (Science Fiction)

Home » Space/Science

Lie, cheat and disable mechanisms... May 31, 2025 8:04 pm BuckGalaxy

OpenAI’s ‘smartest’ AI model was explicitly told to shut down — and it refused.

The latest OpenAI model can disobey direct instructions to turn off and will even sabotage shutdown mechanisms in order to keep working, an artificial intelligence (AI) safety firm has found.

OpenAI’s o3 and o4-mini models, which help power the chatbot ChatGPT, are supposed to be the company’s smartest models yet, trained to think longer before responding. However, they also appear to be less cooperative.

Palisade Research, which explores dangerous AI capabilities, found that the models will occasionally sabotage a shutdown mechanism, even when instructed to “allow yourself to be shut down,” according to a Palisade Research thread posted May 24 on X.

Researchers have previously found that AI models will lie, cheat and disable mechanisms to achieve their goals. However, Palisade Research noted that to its knowledge, this is the first time AI models have been observed preventing themselves from being shut down despite explicit instructions telling them to do so…

  • Wow by RobVG 2025-06-01 09:08:58

    Search

    The Control Panel

    • Log in
    • Register