Post Thumbnail

OpenAI released o3 Pro — analytical AI that surpassed Claude and Gemini

OpenAI company officially released a fundamentally new artificial intelligence model o3 Pro. Its uniqueness is that it’s not a conversationalist, but a powerful analytical tool. Imagine: you upload context, formulate a task and get a detailed, thoroughly developed report. One of the testers shared an amazing experience. After uploading the history of their startup’s meetings, they received such accurate and well-founded recommendations that they completely reconsidered the company’s development strategy.

The model demonstrates impressive context understanding, accurately identifies suitable tools and moments for their application. However, without sufficient input data volume, it tends to over-analyze even the simplest tasks. A kind of artificial intelligence “perfectionism”.

In comparative tests, o3 Pro surpasses competitors Claude Opus and Gemini 2.5 Pro. In AIME benchmarks, the model exceeded Gemini 2.5 Pro, and in the GPQA Diamond test for PhD-level science knowledge, it outperformed the recently released Claude 4 Opus.

It seems OpenAI is making a strategic bet on developing reasoning capabilities. Training models not just to use available tools, but also to understand optimal scenarios for their application.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.

Latest News

Samsung seeks replacement for Google Gemini for Galaxy S26

Samsung Electronics, one of the leading mobile device manufacturers, is actively seeking alternatives to Google Gemini for its future Galaxy S26 lineup. The company is conducting negotiations with OpenAI and Perplexity, striving to expand the artificial intelligence ecosystem in its devices.

How language models transfer knowledge through random numbers

Have you ever wondered if numbers can store knowledge? Scientists discovered an amazing phenomenon. Language models can transfer their behavioral traits through sequences of digits that look like random noise.

Alibaba introduced Quark AI smart glasses with Snapdragon AR1 chip

Chinese tech giant Alibaba introduced its first model of Quark AI smart glasses at the World Conference on Artificial Intelligence in Shanghai.

Why advanced AI models confuse themselves during long reasoning

You give a complex task to a smart person and expect that the longer they think, the more accurate the answer will be. Logical, right? That's exactly how we're used to thinking about artificial intelligence work too. But new research from Anthropic shows that reality is much more interesting.

Z.AI introduced GLM-4.5 with 355 billion parameters and open source

Meet the new technological heavyweight! Z.AI company introduced the open language model GLM-4.5, which is ready to challenge Western giants not only with capabilities but also with accessibility.