Post Thumbnail

Latest Claude 3.7 Sonnet model storms the Pokémon world

Anthropic, one of the leaders in artificial intelligence, presented an unusual approach to testing its latest Claude 3.7 Sonnet model, using the iconic Game Boy game Pokémon Red.

According to information published in the company’s official blog on February 24, researchers equipped the model with basic memory, pixel input processing capability from the screen, and functional calls for button pressing and navigation. This allowed the AI to continuously play Pokémon without additional assistance.

A key advantage of Claude 3.7 Sonnet is the “extended thinking” feature, similar to the capabilities of OpenAI o3-mini and DeepSeek R1. This technology allows the model to “reason” when solving complex tasks, applying additional computational resources and spending more time on analysis.

The results of the experiment were impressive. While the previous version of the model, Claude 3.0 Sonnet, couldn’t even leave the starting house in Pallet Town where the game begins, Claude 3.7 Sonnet successfully battled three gym leaders and received their badges.

To achieve these results, the AI performed 35,000 game actions to reach the last gym leader Lieutenant Surge. However, the company did not disclose exact data on the computing power and time spent completing the game.

Although Pokémon Red may be considered more of an entertainment benchmark, using games for AI testing has a long tradition in the research community. In recent months, a number of new applications and platforms have emerged to test AI models’ gaming abilities on various games – from Street Fighter to Pictionary.

This experiment demonstrates the growing ability of artificial intelligence models to navigate complex interactive environments, understand rules, and strategically plan actions to achieve long-term goals – skills that have broad practical applications beyond the gaming industry.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.

Latest News

The first LAARMA system protects animals on Australian roads

In Australia, animal-vehicle collisions are a serious problem for this continent's ecosystem. Now scientists have found a technological solution. The world's first roadside LAARMA system based on artificial intelligence that protects wild animals from dangerous encounters with traffic.

Nvidia introduced Cosmos model family for robotics

Nvidia company introduced the Cosmos family of AI models. Which can fundamentally change the approach to creating robots and physical AI agents.

ChatGPT calls users "star seeds" from planet Lyra

It turns out ChatGPT can draw users into the world of scientifically unfounded and mystical theories.

AI music triggers stronger emotions than human music

Have you ever wondered why one melody gives you goosebumps while another leaves you indifferent? Scientists discovered something interesting. Music created by artificial intelligence triggers more intense emotional reactions in people than compositions written by humans.

GPT-5 was hacked in 24 hours

2 independent research companies NeuralTrust and SPLX discovered critical vulnerabilities in the security system of the new model just 24 hours after GPT-5's release. For comparison, Grok-4 was hacked in 2 days, making the GPT-5 case even more alarming.