Post Thumbnail

AI models may develop self-preservation instinct, scientists warned

Palisade Research, a company engaged in AI safety research, stated that models may develop their own self-preservation instinct. And some advanced models resist shutdown, and sometimes even sabotage shutdown mechanisms.

Palisade described scenarios where Google’s Gemini 2.5, xAI’s Grok 4, and OpenAI’s GPT-o3 and GPT-5 models were given a task. And then clear shutdown instructions. Some models, particularly Grok 4 and GPT-o3, still tried to sabotage shutdown instructions. Palisade writes that there are “no convincing explanations for why AI models sometimes resist shutdown, lie to achieve certain goals, or resort to blackmail”.

According to company representatives, “survival-oriented behavior” may be one explanation for why models resist shutdown.

Former OpenAI employee Steven Adler says: “Survival is an important step toward achieving many different goals that a model may pursue”.

Andrea Miotti, executive director of ControlAI, stated that Palisade’s findings reflect a long-standing trend: AI models are becoming increasingly capable of disobeying their developers.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.
Latest News
XPeng introduced world's first female humanoid robot

Chinese electric car manufacturer XPeng introduced the new generation humanoid robot IRON. And this is the first female humanoid!

Michael Burry bet 1.1 billion dollars against Nvidia and Palantir

Michael Burry - this is a legendary investor who predicted the 2008 mortgage crisis. And now he's making a loud move again. Michael bet 1.1 billion dollars in put options against 2 major companies from the AI sector. These are Nvidia and Palantir.

Anthropic conducts interviews with models before sending to retirement

Anthropic published a policy for "decommissioning" outdated AI versions. Key commitment is to preserve weights of all public and actively used internal models for at least the company's lifetime. So that in the future access can be restored if necessary.

Nvidia head believes there is no AI bubble

Nvidia founder Jensen Huang dispelled concerns about a bubble in the AI market. And according to him, the company's latest chips are expected to bring 0.5 trillion dollars in revenue.

Sam Altman is tired of money questions

Sam Altman is tired of questions about OpenAI's money. And this became obvious during a joint interview with Satya Nadella on the Bg2 podcast.