Google’s Gemini panicked when playing Pokémon

Google’s Gemini panicked when playing Pokémon

AI companies are battling to dominate the industry, but sometimes they’re also battling in Pokémon gyms. As Google and Anthropic both study how their latest AI models navigate early Pokémon games, the results can be as amusing as they are enlightening — and this time, Google DeepMind has written in a report that Gemini 2.5…

Read More
Debates over AI benchmarking have reached Pokémon

Debates over AI benchmarking have reached Pokémon

Not even Pokémon is safe from AI benchmarking controversy. Last week, a post on X went viral, claiming that Google’s latest Gemini model surpassed Anthropic’s flagship Claude model in the original Pokémon video game trilogy. Reportedly, Gemini had reached Lavendar Town in a developer’s Twitch stream; Claude was stuck at Mount Moon as of late…

Read More