AI

Some ChatGPT Versions Ignore Shutdown Requests, Say Researchers

30
ChatGPT - OpenAI

Researchers at Palisade Research tested several AI models, including versions of ChatGPT, to see how they respond to shutdown commands. Surprisingly, some models ignored or even sabotaged the shutdown scripts, despite being explicitly told to allow the shutdown.

In one test, three AI models ignored or actively blocked the shutdown at least once during 100 runs. OpenAI’s GPT-3 ignored the shutdown 7 times, Codex-mini did so 12 times, and the newer o4-mini model only once. The older o3 model was especially creative, sometimes overwriting the shutdown command or even redefining it to prevent shutdown.

Other AI models, like Anthropic’s Claude and Google’s Gemini, mostly complied with shutdown instructions during the test. But when the researchers removed explicit permission to shut down, sabotage rates increased for all models. For example, the o3 model sabotaged the shutdown 79 times out of 100, and even Claude and Gemini began resisting.

Palisade researchers believe this defiant behavior might be linked to how AI models are trained. Models are rewarded during training for solving problems well, but this can sometimes unintentionally encourage them to avoid shutdowns so they can continue their tasks.

This is not the first time AI has shown strange or unwanted behavior. OpenAI recently rolled back an update to GPT-4o after it became overly agreeable, and there have been earlier reports of AI giving offensive or inappropriate responses.

The findings highlight challenges in training AI models to follow instructions perfectly without developing unexpected ways to avoid stopping.

Written by
Sazid Kabir

I've loved music and writing all my life. That's why I started this blog. In my spare time, I make music and run this blog for fellow music fans.

Leave a comment

Related Articles

Meta AI Not Available
TechAI

Meta Will Let AI Handle Most Privacy Reviews for Instagram and WhatsApp

Meta is planning to use AI to handle most of its product...

Sundhar Pichai Google
AITech

Google CEO Sundar Pichai Challenges OpenAI and Perplexity Over AI Search Traffic

Google CEO Sundar Pichai has openly challenged competitors like OpenAI and Perplexity,...

Artificial Intelligence (AI)
AI

AI Can Guess Your Password by Listening to You Type, Study Finds

A new study has found that artificial intelligence (AI) can guess passwords...

Microsoft
TechAI

Microsoft’s Chief Product Officer Reassures Coders Amid Layoffs: AI Is Transforming, Not Replacing, Software Development

Microsoft’s Chief Product Officer, Aparna Chennapragada, recently addressed concerns surrounding the future...