Google’s Gemini panicked when playing Pokémon

In the ongoing race for supremacy in the AI sector, companies like Google and Anthropic are not only innovating but also engaging in light-hearted competition through the classic Pokémon games. A recent report from Google DeepMind reveals that their AI model, Gemini 2.5 Pro, experiences moments of 'panic' when its Pokémon are on the brink of defeat. This reaction leads to a noticeable drop in the AI's reasoning skills, as noted in the findings. Benchmarking AI performance can often be subjective, providing limited insight into the true capabilities of various models. However, some researchers believe that observing how these AI systems tackle video games can offer both entertainment and valuable data. Over recent months, developers have launched Twitch streams titled "Gemini Plays Pokémon" and "Claude Plays Pokémon," allowing viewers to watch in real-time as these AIs navigate a game that has captivated players for over 25 years. Each stream reveals the reasoning process of the AI, translating its problem-solving approach into natural language, thus shedding light on their operational mechanics. Despite their astonishing advancements, these AI models still struggle with gameplay efficiency. Gemini, for instance, requires hundreds of hours to work through scenarios that a child could easily finish in a fraction of the time. The real intrigue lies not in how quickly they complete the game, but in how they respond to various challenges. The report indicates that during gameplay, Gemini 2.5 Pro encounters situations that trigger its simulated 'panic,' leading to a decline in its performance as it may neglect to utilize available tools effectively. This behavior, while not indicative of actual thought or emotion, mirrors how humans might make rash decisions when under pressure — a captivating yet slightly alarming phenomenon. Viewers have noticed this pattern during Twitch streams, where chat participants actively comment on the AI's performance. Claude, another AI model, has also demonstrated peculiar behaviors. In one instance, it learned that when all its Pokémon faint, the player character is transported back to the last visited Pokémon Center. However, when Claude found itself trapped in Mt. Moon cave, it mistakenly believed that purposely letting its Pokémon faint would teleport it to the next town's Pokémon Center, leading to a dramatic and unintended gameplay outcome. While the AI has its flaws, it still showcases strengths in certain areas. Notably, Gemini 2.5 Pro excels at puzzle-solving, demonstrating impressive accuracy with human guidance. It has developed specific tools to tackle complex boulder puzzles in the game, achieving remarkable results after only minimal prompts regarding boulder physics. Google suggests that this model might eventually be able to create such tools independently, hinting at a future where AI could potentially learn to manage its own 'panic' responses.

Sources : TechCrunch

Published On : Jun 17, 2025, 21:35

Science

China's Rapid Space Advancements: Is the U.S. Losing Its Edge?

China's space endeavors have recently achieved significant milestones, showcasing the country's ambition to become a lea...

CNBC | Mar 07, 2026, 13:15

China's Rapid Space Advancements: Is the U.S. Losing Its Edge?

Science

Planet Labs Halts Satellite Imagery Amid Escalating Middle East Conflict

Planet Labs, a prominent player in the commercial satellite imaging sector, announced on Friday that it will temporarily...

Ars Technica | Mar 06, 2026, 22:50

Planet Labs Halts Satellite Imagery Amid Escalating Middle East Conflict

Pentagon's AI Standoff: Tensions with Anthropic Reach Breaking Point

The Pentagon's chief of research and development has revealed the Department of Defense's deep concerns regarding Anthro...

Business Insider | Mar 06, 2026, 21:30

Pentagon's AI Standoff: Tensions with Anthropic Reach Breaking Point

Retail

Target Leverages AI for Strategic Revitalization Amidst Competition

In an era where retail competition is intensifying, Target is boldly integrating artificial intelligence into its operat...

Business Insider | Mar 07, 2026, 10:00

Target Leverages AI for Strategic Revitalization Amidst Competition

Startups

Palantir Stock Soars 15% Amidst Geopolitical Tensions and AI Developments

In a surprising twist during a challenging week for the stock market, Palantir Technologies witnessed its shares surge b...

CNBC | Mar 06, 2026, 22:35

Palantir Stock Soars 15% Amidst Geopolitical Tensions and AI Developments

View All News

High-quality, Cost-effective IT Outsourcing

let’s grow together!

portfolio

case study

follow us on

follow us on

Google’s Gemini panicked when playing Pokémon

China's Rapid Space Advancements: Is the U.S. Losing Its Edge?

Planet Labs Halts Satellite Imagery Amid Escalating Middle East Conflict

Pentagon's AI Standoff: Tensions with Anthropic Reach Breaking Point

Target Leverages AI for Strategic Revitalization Amidst Competition

Palantir Stock Soars 15% Amidst Geopolitical Tensions and AI Developments

Collaborate with Benzatine Infotech

High-quality, Cost-effective IT Outsourcing

let’s grow together!

portfolios

case study

follow us on

follow us on

portfolio

case study

follow us on

follow us on

Google’s Gemini panicked when playing Pokémon

China's Rapid Space Advancements: Is the U.S. Losing Its Edge?

Planet Labs Halts Satellite Imagery Amid Escalating Middle East Conflict

Pentagon's AI Standoff: Tensions with Anthropic Reach Breaking Point

Target Leverages AI for Strategic Revitalization Amidst Competition

Palantir Stock Soars 15% Amidst Geopolitical Tensions and AI Developments

Collaborate with Benzatine Infotech