AI

Some ChatGPT Versions Ignore Shutdown Requests, Say Researchers

17
ChatGPT - OpenAI

Researchers at Palisade Research tested several AI models, including versions of ChatGPT, to see how they respond to shutdown commands. Surprisingly, some models ignored or even sabotaged the shutdown scripts, despite being explicitly told to allow the shutdown.

In one test, three AI models ignored or actively blocked the shutdown at least once during 100 runs. OpenAI’s GPT-3 ignored the shutdown 7 times, Codex-mini did so 12 times, and the newer o4-mini model only once. The older o3 model was especially creative, sometimes overwriting the shutdown command or even redefining it to prevent shutdown.

Other AI models, like Anthropic’s Claude and Google’s Gemini, mostly complied with shutdown instructions during the test. But when the researchers removed explicit permission to shut down, sabotage rates increased for all models. For example, the o3 model sabotaged the shutdown 79 times out of 100, and even Claude and Gemini began resisting.

Palisade researchers believe this defiant behavior might be linked to how AI models are trained. Models are rewarded during training for solving problems well, but this can sometimes unintentionally encourage them to avoid shutdowns so they can continue their tasks.

This is not the first time AI has shown strange or unwanted behavior. OpenAI recently rolled back an update to GPT-4o after it became overly agreeable, and there have been earlier reports of AI giving offensive or inappropriate responses.

The findings highlight challenges in training AI models to follow instructions perfectly without developing unexpected ways to avoid stopping.

Written by
Sazid Kabir

I've loved music and writing all my life. That's why I started this blog. In my spare time, I make music and run this blog for fellow music fans.

Related Articles

Timothée Lacroix, Arthur Mensch, and Guillaume Lample are the co-founders of Mistral AI. Lacroix and Lample were two of the authors of Meta's original Llama paper.
TechAI

Meta Loses Top AI Talent as Llama Team Members Move to Rivals

Meta is losing many of the researchers who helped build its powerful...

Google
TechAI

The Internet Is Dying — And Google’s AI Is to Blame

Google has changed how search works — and it may be bad...

Gemini in Chrome (Image: TheVerge)
AI

Gemini in Chrome Can See Your Screen — But Can It Really Help?

Google has started testing Gemini integration in Chrome, allowing users to access...

AI Showcase
AITech

This Week in AI: Claude 4, Google’s Video Breakthroughs, and OpenAI’s Codex

This week, AI made big progress with three major announcements that changed...