Analysis finds Google AI Overviews is wrong 10 percent of the time

Analysis finds Google AI Overviews is wrong 10 percent of the time

In today's digital landscape, searching for information on Google often brings users face-to-face with AI Overviews, an AI-driven feature powered by the Gemini model. Since its debut in 2024, AI Overviews has faced criticism over its inconsistent accuracy, although it shows signs of improvement. A recent analysis by the New York Times has shed light on its performance, revealing that while the AI provides accurate information 90 percent of the time, it also delivers incorrect answers one in every ten queries. This discrepancy translates to hundreds of thousands of inaccurate responses being generated every minute, raising concerns for Google. The analysis was conducted in collaboration with a startup named Oumi, which specializes in AI model development. Oumi utilized AI tools to evaluate AI Overviews using the SimpleQA assessment, a widely recognized test designed to gauge the factual accuracy of generative models like Gemini. SimpleQA, launched by OpenAI in 2024, comprises over 4,000 verifiable questions that can be posed to AI systems. Oumi began its testing last year when Gemini 2.5 was the latest model, achieving an accuracy rate of 85 percent. After the release of the Gemini 3 update, AI Overviews improved to answer 91 percent of the questions correctly. However, when applying this error rate across all Google searches, the implications are significant, with tens of millions of incorrect answers potentially being issued daily. The report highlights specific instances where AI Overviews faltered. For example, when asked about the date Bob Marley’s former residence became a museum, AI Overviews referenced three sources, two of which failed to mention the date. The third, Wikipedia, presented conflicting years, and the AI selected the incorrect one. In another case, when queried about Yo-Yo Ma's induction into the classical music hall of fame, AI Overviews misrepresented the existence of the hall despite citing the relevant organization's website.

Sources : Ars Technica

Published On : Apr 07, 2026, 16:55

Startups
Court Ruling Favors Sam Altman in High-Stakes Legal Clash with Elon Musk

In a significant legal decision, a jury has sided with Sam Altman, CEO of OpenAI, dismissing Elon Musk's allegations aft...

CNBC | May 18, 2026, 17:50
Court Ruling Favors Sam Altman in High-Stakes Legal Clash with Elon Musk
AI
Jury Rules Against Musk in OpenAI Lawsuit Due to Delay

In a significant legal development, a jury in Oakland, California, has ruled that Elon Musk's lawsuit against OpenAI and...

CNN | May 18, 2026, 17:40
Jury Rules Against Musk in OpenAI Lawsuit Due to Delay
Automotive
BMW Bids Farewell to the M3 CS with a Classic Manual Transmission

As the automotive landscape evolves, the thrill of driving enthusiasts has faced significant challenges. Modern vehicles...

Ars Technica | May 18, 2026, 14:35
BMW Bids Farewell to the M3 CS with a Classic Manual Transmission
Streaming
Lanterns Teaser Blends Gritty Realism with Superhero Elements

The upcoming DC Universe series 'Lanterns' is generating considerable buzz after its latest teaser, which was released i...

Ars Technica | May 18, 2026, 17:45
Lanterns Teaser Blends Gritty Realism with Superhero Elements
Science
Ancient Doctor Unveiled: New Insights from Pompeii's Eruption Victims

Archaeologists have made a remarkable discovery regarding one of the victims from the catastrophic eruption of Mount Ves...

Ars Technica | May 18, 2026, 18:11
Ancient Doctor Unveiled: New Insights from Pompeii's Eruption Victims
View All News