Post Thumbnail

Latest Claude 3.7 Sonnet model storms the Pokémon world

Anthropic, one of the leaders in artificial intelligence, presented an unusual approach to testing its latest Claude 3.7 Sonnet model, using the iconic Game Boy game Pokémon Red.

According to information published in the company’s official blog on February 24, researchers equipped the model with basic memory, pixel input processing capability from the screen, and functional calls for button pressing and navigation. This allowed the AI to continuously play Pokémon without additional assistance.

A key advantage of Claude 3.7 Sonnet is the “extended thinking” feature, similar to the capabilities of OpenAI o3-mini and DeepSeek R1. This technology allows the model to “reason” when solving complex tasks, applying additional computational resources and spending more time on analysis.

The results of the experiment were impressive. While the previous version of the model, Claude 3.0 Sonnet, couldn’t even leave the starting house in Pallet Town where the game begins, Claude 3.7 Sonnet successfully battled three gym leaders and received their badges.

To achieve these results, the AI performed 35,000 game actions to reach the last gym leader Lieutenant Surge. However, the company did not disclose exact data on the computing power and time spent completing the game.

Although Pokémon Red may be considered more of an entertainment benchmark, using games for AI testing has a long tradition in the research community. In recent months, a number of new applications and platforms have emerged to test AI models’ gaming abilities on various games – from Street Fighter to Pictionary.

This experiment demonstrates the growing ability of artificial intelligence models to navigate complex interactive environments, understand rules, and strategically plan actions to achieve long-term goals – skills that have broad practical applications beyond the gaming industry.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.
Latest News
Sam Altman promises to return humanity to ChatGPT

OpenAI head Sam Altman made a statement after numerous offline and online protests against shutting down the GPT-4o model occurred. And then turning it on, but with a wild router. I talked about this last week in maximum detail. Direct quote from OpenAI head.

AI comes to life: Why Anthropic co-founder fears his creation

Anthropic co-founder Jack Clark published an essay that makes you uneasy. He wrote about the nature of modern artificial intelligence, and his conclusions sound like a warning.

Google buried the idea of omnipotent AI doctor

Google company released a report on Health AI Agents of 150 pages. That's 7,000 annotations, over 1,100 hours of expert work. Link in description. Numbers impressive, yes. But the point isn't in metrics. The point is they buried the very idea of an omnipotent AI doctor. And this is perhaps the most honest thing that happened in this industry recently.

Teenagers on TikTok scare parents with fake AI vagrants

You know what's considered a fun prank among teenagers now? Sending parents a photo of a homeless vagrant in their own living room. AI draws it, TikTok approves it, and let parents have hysteria. That's the kind of fun going around social media.

California shut up AI companions: New safety law

California became the first state to officially shut up AI companion chatbots. Governor Gavin Newsom signed a historic law that requires operators of such bots to implement safety protocols.