Post Thumbnail

Qwen 3 surpassed Claude 4 Opus and DeepSeek V3 in tests

The Qwen team released an update to their flagship model Qwen 3. The results are excellent. The new version surpasses such powerful models as Claude 4 Opus, Kimi K2 and DeepSeek V3 in many key indicators.

Particularly important is the developers’ strategic decision to abandon the hybrid approach. Now instead of combining Instruct and Reasoning modes in one model, they release separate specialized versions. Today the Instruct model is presented. And the version for deep reasoning is already in active development.

The foundation of the new architecture is Mixture of Experts technology. Where from 235 billion parameters only 22 billion are actively used. This makes the model significantly lighter for computations, which is critically important for real application.

The developers claim they significantly improved the model’s basic knowledge coverage, its logical reasoning capabilities and long context processing up to 256,000 tokens. The model now follows user preferences much better.

In the future, the team plans to distill technologies into younger versions, which will make Qwen 3’s power accessible not only to owners of top graphics cards.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.
Latest News
$200 USB cable transforms into autonomous AI hacker

Researchers from Palisade Research created a new cybersecurity threat. A modified USB cable that becomes a conduit for autonomous AI into computer systems. The $200 device contains a programmable microchip that loads a digital agent directly onto the target machine.

xAI lays off 500 annotators for Grok's expert specialization

A strategic pivot from xAI is emerging. The company is radically changing its approach to training its Grok language model! Elon Musk's team fired 500 universal annotators in one day. Instead, it's increasing the number of specialized AI tutors by 10 times.

Gemini content review time reduced from 30 to 15 minutes

Alarming signals from Google's internal kitchen were published by The Guardian. Content evaluators for the Gemini model shared interesting information about declining review standards. Employees of contractor GlobalLogic, responsible for assessing quality and safety of AI responses before release, are sounding alarms.

Golden chassis and contextual understanding in Tesla's new generation

Tesla introduced a new humanoid robot Optimus with integrated Grok from xAI. Salesforce CEO Marc Benioff personally tested the prototype, asking it to bring a soda. The robot demonstrated meaningful contextual understanding and dialogue capability. Although several clarifying commands were needed.

Microsoft diversifies partnerships: Claude Sonnet 4 in Office

Microsoft made a strategic decision to diversify its AI partnerships. The company signed an agreement with Anthropic, creator of the Claude model. To implement their technologies in Office applications.