Google AI Overviews Are Correct Nine Out of Ten Times, New Independent Study Finds
A new study has found that Google's AI Overviews — the AI-generated summaries that appear at the top of search results — are factually accurate roughly 90 percent of the time across a large sample of queries. The finding complicates the popular narrative that AI search summaries are unreliable, while still leaving meaningful room for concern about the errors that do occur.

D.O.T.S AI Newsroom
AI News Desk
Google has long placed a disclaimer under every AI Overview: "AI responses may include mistakes." A new independent study has now put a number on how often those mistakes actually happen — and the answer is roughly one in ten queries, according to research cited by The Decoder. The study evaluated a significant sample of AI Overview responses across a range of query types and found approximately 90 percent accuracy, a figure that will be cited in very different ways depending on whether you are arguing that AI search is trustworthy or arguing that it is dangerous at scale.
What the Study Evaluated
The research examined AI Overview responses for factual accuracy — whether the claims made in the generated summaries were correct, partially correct, or incorrect based on authoritative sources. The 90 percent accuracy figure represents cases where the AI Overview was substantively correct. The remaining 10 percent includes a mix of partially accurate responses (where some claims were correct and others were not) and outright errors. The study's methodology for distinguishing these categories, and the distribution across query types and complexity levels, matters significantly for interpreting what the headline accuracy number actually means in practice.
The Math of Scale
Google processes roughly 8.5 billion searches per day. AI Overviews now appear on a substantial portion of queries. Even a 10 percent error rate — which sounds low as a percentage — translates to hundreds of millions of incorrect or partially incorrect AI-generated summaries served to users every day globally. The question of what types of queries trigger errors, and what kinds of errors they produce, is therefore not academic. A 10 percent error rate on navigational queries ("what is the capital of France") is categorically different from a 10 percent error rate on medical or financial queries where incorrect information could cause harm.
What Google Is Saying
Google has invested heavily in the quality of AI Overviews since a highly publicized early rollout in 2024 that produced a series of embarrassing errors — including the now-infamous suggestion to put glue on pizza to keep the cheese from sliding. The company has made multiple rounds of corrections and quality improvements since then. The new study suggests those efforts have produced a meaningfully accurate product by the standards of AI-generated content — but the 10 percent figure means the disclaimer Google shows users is not merely boilerplate.