That a model of AI is stuck playing something as basic as ‘Pokémon’ seems worrying. It is not at all

by usatoday24 April 2, 2025, 10:42 pm

It was 2013 and almost no one had heard of Deepminda small artificial intelligence startup. His researchers came up to make their AI system learn to play (already win) video games, and They trained her with some titles of the old Atari console.

Among them was ‘Breakout’ (in Spain it appeared as ‘Arkanoid’), and A video of the time It shows how after 10 minutes playing the machine did not know just anything. After two hours of play, yes, I already played as an expert.

https://www.youtube.com/watch?v=v1eynij0rnk

But at four o’clock something amazing spent: The machine discovered a “trick” To maximize the effort: it made the ball end up creating “a tunnel” and then cast the ball through that tunnel so that it would not stop bouncing and ending almost the entire level effortlessly.

Since then using video games to train AI models or to check if they are able to adapt to them and complete them is common in the industry. It is precisely what Anthropic tried when a few weeks ago Claude 3.7 launched.

I have used Claude 3.7 for hours. It is the closest to a human brain that I have felt with an AI

This hybrid model of AI has proven to be a notable advance in areas such as programming and reasoning, but in Anthropic they wanted to test it with a singular test: To play the ‘Pokémon’ video game.

The AI is stuck

In this experiment those responsible for Anthropic wanted to evaluate whether the AI systems “can face challenges with increasingly complex competences, not only through training, but of generalized reasoning.”

Claude’s previous versions had a bad time even trying to start playing from the video game’s beginning screen, but Claude 3.7 Sonnet’s “expanded thinking” allows the new model «Plan in advanceremember their objectives and adapt when the initial strategies fail »in a way that their predecessors did not do.

For those responsible for Anthropic these improvements will end up helping to solve real world problems. It is something we are also seeing With the benchmark arc -agi 2which is precisely aimed at measuring the ability of the Ias to do things that are easy for us (controlling a video game, solving a visual puzzle) but these models are especially difficult.

Source: Anthropic.

The advance of Anthropic here is remarkable, but is far from being able to be considered a success. In fact and how they comment In Ars Technicathousands of spectators have proven On the Twitch Channel created by Anthropic how Claude stayed totally stuck in Mount Sléniteone of the video game sections.

In that channel you can also see how Claude is still trying to solve the problem and advance. “Think” and “reason” and even shows what “thinking” and “reasoning”, but the model still does not overcome that video game.

And despite everything, this is a great achievement of AI

Taking into account that the video game is oriented to children, it seems easy to despise the achievement of Anthropic, but these advances must be valued very positively. To start, Claude 3.7 model used to play was not “pressed” to play the video game: I had to learn about the march and adapt to the game.

The AI in video games will arrive sooner. According to a developer, 'GTA VI' will mark a before and after

Here also Claude “sees” the screen and what happens to react based on that analysis. And the problem is that The ‘Pokémon’ graphics are very basic and pixelatedwhich raises an even greater challenge for the Anthropic model: with better graphics it would probably behave much better, explained one of those responsible for the experiment.

Even so, Claude behaves especially well in the parts of the game in which text is shown, something that allows this model to better recognize what he needs to do in that phase of the video game.

But if there is a serious problem, that is also that of memorization. Claude has trouble remembering everything you have learned: It has a limited “memory” Of 200,000 tokens and when they exhaust Claude, they resort to summaries and condense the information, which can lead to eliminate small details that are important to advance in the game.

Be that as it may, the achievement of Anthropic remains remarkable, and points to a future in which these models can play autonomously and do so exceptionally to all kinds of games. As Deepmind already did it with that simplistic version of the ‘Arkanoid’, but in a big way.

In Xataka | The latest Google is an AI that plays video games. THE KEY: DOES IT UNDERSTANDING NATURAL LANGUAGE

basic Model playing Pokémon stuck worrying

What do you think?

0 Points

Upvote Downvote

That a model of AI is stuck playing something as basic as ‘Pokémon’ seems worrying. It is not at all

The AI is stuck

And despite everything, this is a great achievement of AI

What do you think?

The Xiaomi electric SUV is already priced in China. Yu7 is a direct dart to Tesla Model and

The atmosphere has been stuck in a “extreme heat generator” on Spain. That means something: a hard summer

22% of renewable plants did not meet the basic tension control criteria during the blackout. And the regulations already demanded it

Samsung is playing his future with the Galaxy S26 processor. Exynos 2600 is in critical phase

Ireland tested a basic income of 1,300 euros for 2,000 artists. It has gone so well that now they do not want me to end

The AI race seemed to be matching. Openai has just hit the table with a model that points very high

Spain wants to regulate the legal resale of tickets. The risk: Let the "BOLI BIC A 300 euros"

This xiaomi tablet is perfect to take it in your next vacation

cars and trains

How to claim the money from your Renfe, Ouigo or Iryo bills if there are delays and cancellations, and how much corresponds to you in each case

In the twentieth century the pipelines were the key to the world. In the 21st century are the electrical networks and a country is winning them: China

Some employees sued their company for cutting the salary. The supreme has responded that being unpunctual is not a job

Leave a ReplyCancel reply

Aemet’s big question is if 2025 will definitely end the drought

We do not need robots that look like us. We need robots to do things for us

We increasingly understand the relationship between intestinal flora and sleep quality

The day that United Kingdom invaded Tenerife without knowing what was inside

A few months ago Ryanair raised her salary to her employees in Spain. Now he is claiming that they return it

Spacex has always been 10 years ahead of the competition. The problem is that in China that law no longer applies

The great drama of private employment in Spain: almost everyone would sacrifice him for being an official

2024 YR4 is not going to kill us, but could collide with the moon

The AI ​​is stuck

And despite everything, this is a great achievement of AI

What do you think?

Leave a ReplyCancel reply

Log In

Sign In

Forgot password?

Your password reset link appears to be invalid or expired.

Log in

Privacy Policy

Add to Collection

No Collections

Hey Friend!Before You Go…

The AI is stuck

Hey Friend!
Before You Go…