Post Thumbnail

DeepSeek released 2 models with a breakthrough in agentic systems and AI

Chinese startup DeepSeek released 2 models that claim to be a breakthrough in agentic systems. And judging by the metrics, this is not just marketing.

DeepSeek-V3.2 — this is the official successor to the experimental version. Available in the app, on the website and through API. DeepSeek-V3.2-Speciale — an improved version with emphasis on advanced multi-step reasoning. So far works only through API.

Both models emphasize deep reasoning chains and behavior for agentic scenarios. This is planning, problem solving, complex inferences and work with structured data.

DeepSeek-V3.2-Speciale became the first open-source model that wins gold at top olympiads. Gold at 4 prestigious olympiads! By metrics, Speciale surpasses Gemini 3.0 Pro in mathematics, and the less powerful DeepSeek-V3.2 beats Claude-4.5 Sonnet in coding.

But there’s a nuance. Test-time compute is huge. Speciale doesn’t save tokens at all, so inference turns out expensive. The authors themselves admit they “left optimization for future research”.

Technical reasons for success: this is the new DeepSeek Sparse Attention architecture, large-scale stable RL training and a large pipeline for agentic tasks. And this is the key architecture change compared to the previous generation.

Both models are extremely good at all sorts of agentic tasks, and especially at search and browser tasks. For this, 1800 synthetic environments were generated in which agents trained to perform completely different tasks.

A very cool model turned out, respect.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.
Latest News
Altman declared red alert at OpenAI due to Google's successes

Sam Altman declared "red alert level" at OpenAI, and this is not just corporate drama. This is an admission that the market leader felt competitors breathing down their neck. According to an internal memo, he is mobilizing additional resources to improve ChatGPT amid growing threats from Google.

Users spend more time with Gemini than with ChatGPT

OpenAI still leads in user numbers, but people are starting to spend more time with competitors. And this creates a serious problem.

Companies are bringing back 5% of those fired due to AI implementation failure

Many companies began bringing back employees fired because of artificial intelligence. Analytics company Visier studied employment data of 2.5 million employees from 142 companies worldwide. About 5% of fired employees subsequently returned to their previous employer. This indicator remained stable for several years, but recently began to rise.

DeepSeek released 2 models with a breakthrough in agentic systems and AI

Chinese startup DeepSeek released 2 models that claim to be a breakthrough in agentic systems. And judging by the metrics, this is not just marketing.

All top AI models failed the safety test in robots

Scientists from King's College London and Carnegie Mellon conducted a study that sounds like a horror movie scenario. They took popular large language models and let them control robots. And then checked what would happen if you give these robots access to personal information and ask them to do something crazy.