AIvengo > Reviews > DeepSeek R1 surpassed Qwen 3 and reduced gap with Gemini 2.5 Pro

DeepSeek R1 surpassed Qwen 3 and reduced gap with Gemini 2.5 Pro

Data on DeepSeek R1, which received a serious update, has arrived. And the results are impressive. The model now confidently surpasses its competitor Qwen 3 with 235 billion parameters. Although it still lags behind flagships like Gemini 2.5 Pro and O3, the gap has significantly narrowed. The main improvement is related to increased reasoning depth – now the model uses an average of 23,000 tokens to solve tasks, while the previous version was limited to 12,000. This ability for deeper analysis brought impressive results. For example, in the AIME test, accuracy grew from 70% to 87.5%. Besides impressive successes in benchmarks, the new version began hallucinating much less and significantly improved its capabilities in frontend development. Although it still has to grow to Claude’s level in this sphere.

I think within the next year we will see a new wave of large language model integration into knowledge distillation systems. Where giant models will act as “teachers” for compact versions. This will lead to rapid breakthrough in small model efficiency and their implementation in mobile devices.

Autor: AIvengo

For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.

Scientists became more afraid of AI hallucinations

The more scientists work with artificial intelligence, the less they trust it. Academic publisher Wiley released a preliminary report for 2025 on technology's impact on science, and the conclusions there are paradoxical. Researchers began treating neural networks with greater skepticism than a year ago, when the technology was obviously less developed.

New model from DeepSeek recognizes documents cheaply and efficiently

DeepSeek rolled out a new model for document recognition. And you know what? It doesn't just read text from pages - it understands structure. And does this cheaply and efficiently, which is rare in the AI world.

OpenAI officially denied GPT-6 release by end of year

At OpenAI they decided to cool public expectations and confessed: GPT-6 won't happen this year. But don't rush to be upset - this doesn't mean the company is sitting idle.

Father of reinforcement learning predicted end of large language models era

Richard Sutton - this is one of the fathers of reinforcement learning and Turing Award laureate. So he stated that the era of large language models is coming to an end. Next, in his opinion, comes the era of experience. And here's why he thinks so.

Artificial intelligence detects ADHD without questionnaires and doctors

Imagine you could diagnose ADHD simply by how your brain processes flickering letters on a screen. No questionnaires, no months of waiting for an appointment with a specialist. AI looks at your visual rhythms and gives a verdict with 92% accuracy. Sounds like science fiction? But this is already reality.