FACTS Grounding

Google DeepMind and Google Research benchmark for long-form factuality and grounding against provided document context up to 32k tokens.

34rows
scoreprimary metric
2026-05-06sampled

Metadata

Metrics

Combined Score, Separate Grounding Score, Separate Quality Score

Latest Results

Rank Subject Combined Score Model Match Provenance Sampled
1 deepseek-ai/DeepSeek-R1-Distill-Qwen-14B 0.46 Imported 2026-05-06
2 VIDraft/Gemma-3-R1984-27B 0.43 Imported 2026-05-06
3 meta-llama/Llama-3.3-70B-Instruct 0.43 Llama 3.3 70B Instruct
meta-llama-llama-3.3-70b-instruct
Imported 2026-05-06
4 Qwen/Qwen3-30B-A3B 0.43 Qwen3 30B A3B
qwen-qwen3-30b-a3b
Imported 2026-05-06
5 Qwen/Qwen3-4B 0.43 Imported 2026-05-06
6 google/medgemma-27b-text-it 0.42 Imported 2026-05-06
7 Qwen/Qwen3-32B 0.42 Qwen3 32B
qwen-qwen3-32b
Imported 2026-05-06
8 deepseek-ai/DeepSeek-R1-0528-Qwen3-8B 0.41 Imported 2026-05-06
9 deepseek-ai/DeepSeek-R1-Distill-Llama-8B 0.41 Imported 2026-05-06
10 Qwen/Qwen3-8B 0.40 Qwen3 8B
qwen-qwen3-8b
Imported 2026-05-06
11 Qwen/Qwen3-14B 0.38 Qwen3 14B
qwen-qwen3-14b
Imported 2026-05-06
12 google/gemma-3-27b-it 0.38 Gemma 3 27B
google-gemma-3-27b-it
Imported 2026-05-06
13 google/medgemma-4b-it 0.38 Imported 2026-05-06
14 Qwen/Qwen2.5-VL-32B-Instruct 0.36 Imported 2026-05-06
15 meta-llama/Llama-3.1-70B-Instruct 0.33 Llama 3.1 70B Instruct
meta-llama-llama-3.1-70b-instruct
Imported 2026-05-06
16 google/gemma-3-12b-it 0.31 Gemma 3 12B
google-gemma-3-12b-it
Imported 2026-05-06
17 google/gemma-3-4b-it 0.30 Gemma 3 4B
google-gemma-3-4b-it
Imported 2026-05-06
18 Qwen/Qwen3-1.7B 0.30 Imported 2026-05-06
19 deepseek-ai/DeepSeek-R1-Distill-Qwen-7B 0.28 Imported 2026-05-06
20 Qwen/Qwen3-0.6B 0.27 Imported 2026-05-06
21 Qwen/Qwen2.5-7B-Instruct 0.26 Qwen2.5 7B Instruct
qwen-qwen-2.5-7b-instruct
Imported 2026-05-06
22 Qwen/Qwen2.5-14B-Instruct-1M 0.25 Imported 2026-05-06
23 nvidia/Llama-Nemotron-Nano-8B 0.24 Imported 2026-05-06
24 OpenScholar/Llama-3.1-OpenScholar-8B 0.24 Imported 2026-05-06
25 Qwen/Qwen2.5-7B-Instruct-1M 0.21 Imported 2026-05-06
26 nvidia/Llama-Nemotron-Nano-4B-v1.1 0.20 Imported 2026-05-06
27 google/gemma-3-1b-it 0.19 Imported 2026-05-06
28 mistralai/Ministral-8B-Instruct-2410 0.17 Imported 2026-05-06
29 meta-llama/Llama-3.1-8B-Instruct 0.17 Llama 3.1 8B Instruct
meta-llama-llama-3.1-8b-instruct
Imported 2026-05-06
30 mistralai/Mistral-Small-3.1-24B-Instruct-2503 0.16 Mistral: Mistral Small 3.1 24B
mistralai-mistral-small-3.1-24b-instruct
Imported 2026-05-06
31 mistralai/Mistral-Small-24B-Instruct-2501 0.13 Mistral: Mistral Small 3
mistralai-mistral-small-24b-instruct-2501
Imported 2026-05-06
32 open-thoughts/OpenThinker-7B 0.11 Imported 2026-05-06
33 PleIAs/Pleias-RAG-350M 0.01 Imported 2026-05-06
34 PleIAs/Pleias-RAG-1B 0 Imported 2026-05-06