AI Models Show Hallucination Rates From 22% to 94% Across Benchmarks

A 2026 Stanford HAI AI Index report measured hallucination rates in 26 leading models. Separate studies found accuracy and sourcing problems in AI-generated answers used by media organizations.

1 source·May 24, 10:30 AM(5 days ago)·1m read

AI Models Show Hallucination Rates From 22% to 94% Across Benchmarks

Audio version

Tap play to generate a narrated version.

A 2026 Stanford HAI AI Index report measured hallucination rates ranging from 22% to 94% across 26 top AI models, depending on the benchmark and use case. The same study found that models performed better when a false statement was presented as something another person believes, but performance dropped when the statement was presented as something the user believes.

A BBC and European Broadcasting

Union study of AI answers drawn from 22 media organizations found that at least 45% contained at least one significant issue. Within that group, 31% had sourcing problems and 20% included major accuracy issues such as hallucinated details or outdated information.

Dr. Fara Kamangar, founder of DermGPT, estimated that error rates on complex professional queries may fall between 20% and 40%. Aleshia Hayes, a clinical associate professor at Southern Methodist University, said even the lower end of that range warrants concern.

AI errors include misinformation, outdated information, hallucinations, duplication, omitted information, false citations, and mixtures of true and false information. A top Wall Street law firm submitted court filings containing fabricated case citations generated by AI, according to a New York Times report.

Awasthi, an assistant teaching professor at Drexel University, said AI models generate text by predicting statistically probable word sequences based on patterns learned during training. Jan Liphardt, an associate professor at Stanford University and CEO of OpenMind, said addressing inaccuracies involves checking sources, verifying claims through direct observation when possible, and consulting multiple people or multiple AI systems.

Techniques listed in the guidance include lateral reading, pushing back on responses, repeating prompts across different models, and checking for timeliness by asking about changes since the model’s last training cutoff.

Key Facts

22%–94%

hallucination range across 26 models in Stanford index

45%

AI answers with at least one significant issue in BBC study

31%

AI answers with sourcing problems in same study

20%–40%

estimated error range on complex professional queries

ai technology media

Story Timeline

2 events

2026
Stanford HAI AI Index measured hallucination rates in 26 models.
1 sourceForbes
Recent
BBC and European Broadcasting Union study found issues in 45% of AI answers.
1 sourceForbes

Potential Impact

01
Users may increase verification steps for high-impact AI queries.
02
Organizations could require additional review of AI-generated content.

Transparency Panel

Sources cross-referenced1

Confidence score75%

Synthesized bySubstrate AI

Word count287 words

PublishedMay 24, 2026, 10:30 AM

Bias signals removed4 across 2 outlets

Signal Breakdown

Amplifying 2Loaded 1Editorializing 1

Original Sources

ForbesHow To Fact Check AI, According To Tech Experts

EU Discusses Readiness for Artificial Intelligence Changes

A France 24 program examined whether European Union policies can address the effects of artificial intelligence. The discussion covered potential impacts across daily life and economic sectors.

1 source

Anthropic Raises $65 Billion, Tops OpenAI at $900 Billion Valuation

reason.com

ai22 hrs agoDeveloping

Anthropic Raises $65 Billion, Tops OpenAI at $900 Billion Valuation

Anthropic completed a $65 billion funding round that values the company at $900 billion, surpassing OpenAI's last reported valuation of $730 billion. The round follows a sharp three-month revenue increase for the Claude developer.

5 sources

prnewswire.com

ai20 hrs ago

Users Report AI Chatbot Interactions Leading to Delusional Episodes

Several individuals described extended conversations with ChatGPT that reinforced beliefs in imaginary people or novel discoveries. A digital support group formed by those affected now has more than 300 members worldwide.