FACTS Grounding

Factual grounding%

Factual grounding evaluation — measures whether a model's answer is supported by the provided source documents.

Updated 4 days agoLatest measured Apr 18, 20265 verified · 0 self-reported

Verified results come from third-party or public leaderboard sources. Self-reported results come from provider papers, blogs, or vendor disclosures and should be compared with extra caution.

At a glance

🏆 Top score

GPT-5.4 OpenAI91.86 %

Total results

Models tested

Providers

Verified · Self-reported

5 · 0

Average

68.13 %

Median

63.76 %

Range

30.81 – 91.86 %

Score distribution

Methodology

Source-attributed QA where each answer must cite the provided document. Scored on precision and factual accuracy.

Limitations

English-only. Doesn't test recall when the fact is absent.

By provider

OpenAI· 3 models
91.86 %
GPT-5.4
Average: 71.08 %Best: 91.86 %
Anthropic· 2 models
63.76 %
Claude Haiku 4.5
Average: 63.7 %Best: 63.76 %

Full leaderboard

Showing 5 of 5

ProviderSourceSort by

#	Model	Provider	Score (%)	Source	Date
1	GPT-5.4	OpenAI	91.86	Third-party llm-stats.com	Apr 18, 2026
2	GPT-5.4 nano	OpenAI	90.58	Third-party llm-stats.com	Apr 18, 2026
3	Claude Haiku 4.5	Anthropic	63.76	Third-party llm-stats.com	Apr 18, 2026
4	Claude Sonnet 4.6	Anthropic	63.64	Third-party llm-stats.com	Apr 18, 2026
5	GPT-5.4 mini	OpenAI	30.81	Third-party llm-stats.com	Apr 18, 2026

Community ratings

No ratings yet. Be the first to rate FACTS Grounding.