AIvengo > Reviews > AI models may develop self-preservation instinct, scientists warned

AI models may develop self-preservation instinct, scientists warned

Palisade Research, a company engaged in AI safety research, stated that models may develop their own self-preservation instinct. And some advanced models resist shutdown, and sometimes even sabotage shutdown mechanisms.

Palisade described scenarios where Google’s Gemini 2.5, xAI’s Grok 4, and OpenAI’s GPT-o3 and GPT-5 models were given a task. And then clear shutdown instructions. Some models, particularly Grok 4 and GPT-o3, still tried to sabotage shutdown instructions. Palisade writes that there are “no convincing explanations for why AI models sometimes resist shutdown, lie to achieve certain goals, or resort to blackmail”.

According to company representatives, “survival-oriented behavior” may be one explanation for why models resist shutdown.

Former OpenAI employee Steven Adler says: “Survival is an important step toward achieving many different goals that a model may pursue”.

Andrea Miotti, executive director of ControlAI, stated that Palisade’s findings reflect a long-standing trend: AI models are becoming increasingly capable of disobeying their developers.

Autor: AIvengo

For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.

AI models may develop self-preservation instinct, scientists warned

Palisade Research, a company engaged in AI safety research, stated that models may develop their own self-preservation instinct. And some advanced models resist shutdown, and sometimes even sabotage shutdown mechanisms.

AI passed Turing test in music

University of Minas Gerais in Brazil conducted an experiment. Participants were given pairs of songs, in each of which was one generated track. They needed to determine which one exactly. And the results were unexpected.

Microsoft owns 27% of OpenAI

Microsoft announced it reached an agreement with OpenAI regarding its ownership stake. And this is a deal that redefines their relationship for years ahead.

DeepSeek V3.1 doubled deposit to 22 thousand dollars in 9 trading days

I told earlier that the Alpha Arena benchmark launched, where popular models trade real cryptocurrency for real money. Each was given 10 thousand dollars and released into free navigation.

AI minister Diella will give birth to 83 digital children

I already told that Albania's prime minister introduced a new minister. That was Diella. An AI-based assistant. She was tasked with overseeing public procurement to reduce corruption, as well as work to increase government efficiency. She has an avatar - a woman in traditional Albanian clothing.