Post Thumbnail

Google Gemini 2.5 Pro surpassed OpenAI o3 and leads on LMArena

Google updated Gemini 2.5 Pro with strong benchmark gains. Now the model surpasses the current o3 version from OpenAI. Like 2.5 Flash, this is a hybrid model that allows setting budget for thinking processes or completely turning them off. The model is already available and according to first impressions works better than early versions. It even listens when you ask it not to spam code with comments.

On LMArena arena, the new version rose by 24 Elo points compared to the previous one and now leads in all categories, surpassing o3 and Claude Opus 4. On benchmarks, the model noticeably improved and became more efficient in code, logic and exact sciences tasks.

The results are impressive. 82.2% on programming tasks, 86.4% on natural sciences questions and 21.6% on Humanity’s Last Exam test, which checks thinking and knowledge.

Developers also took into account feedback about the previous version and improved style and structure. Now the model can be more creative. They also added budgets for thinking processes for greater cost control. Unfortunately, image generation for Gemini Pro was still not added.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.
Latest News
IMF chief economist compared AI boom to dotcom bubble

IMF chief economist Pierre-Olivier Gourinchas stated that the world has already traveled halfway to a burst AI bubble and a new financial crisis.

Researchers cracked 12 AI protection systems

You know what researchers from OpenAI, Anthropic, Google DeepMind and Harvard just found out? They tried to break popular AI security systems and found a bypass almost everywhere. They checked 12 common protection approaches. From smart system prompt formulations to external filters that should catch dangerous queries.

OpenAI has 5 years to turn $13 billion into trillion

You know what position OpenAI is in now? According to Financial Times, the company has 5 years to turn 13 billion dollars into a trillion. And here's what it looks like in practice.

Sam Altman promises to return humanity to ChatGPT

OpenAI head Sam Altman made a statement after numerous offline and online protests against shutting down the GPT-4o model occurred. And then turning it on, but with a wild router. I talked about this last week in maximum detail. Direct quote from OpenAI head.

AI comes to life: Why Anthropic co-founder fears his creation

Anthropic co-founder Jack Clark published an essay that makes you uneasy. He wrote about the nature of modern artificial intelligence, and his conclusions sound like a warning.