FACTS Grounding
Google DeepMind and Google Research benchmark for long-form factuality and grounding against provided document context up to 32k tokens.
34rows
scoreprimary metric
2026-05-06sampled
Metadata
Metrics
Combined Score, Separate Grounding Score, Separate Quality Score
| Rank | Subject | Combined Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | 0.46 | — | Imported | 2026-05-06 |
| 2 | VIDraft/Gemma-3-R1984-27B | 0.43 | — | Imported | 2026-05-06 |
| 3 | meta-llama/Llama-3.3-70B-Instruct | 0.43 | Llama 3.3 70B Instruct meta-llama-llama-3.3-70b-instruct | Imported | 2026-05-06 |
| 4 | Qwen/Qwen3-30B-A3B | 0.43 | Qwen3 30B A3B qwen-qwen3-30b-a3b | Imported | 2026-05-06 |
| 5 | Qwen/Qwen3-4B | 0.43 | — | Imported | 2026-05-06 |
| 6 | google/medgemma-27b-text-it | 0.42 | — | Imported | 2026-05-06 |
| 7 | Qwen/Qwen3-32B | 0.42 | Qwen3 32B qwen-qwen3-32b | Imported | 2026-05-06 |
| 8 | deepseek-ai/DeepSeek-R1-0528-Qwen3-8B | 0.41 | — | Imported | 2026-05-06 |
| 9 | deepseek-ai/DeepSeek-R1-Distill-Llama-8B | 0.41 | — | Imported | 2026-05-06 |
| 10 | Qwen/Qwen3-8B | 0.40 | Qwen3 8B qwen-qwen3-8b | Imported | 2026-05-06 |
| 11 | Qwen/Qwen3-14B | 0.38 | Qwen3 14B qwen-qwen3-14b | Imported | 2026-05-06 |
| 12 | google/gemma-3-27b-it | 0.38 | Gemma 3 27B google-gemma-3-27b-it | Imported | 2026-05-06 |
| 13 | google/medgemma-4b-it | 0.38 | — | Imported | 2026-05-06 |
| 14 | Qwen/Qwen2.5-VL-32B-Instruct | 0.36 | — | Imported | 2026-05-06 |
| 15 | meta-llama/Llama-3.1-70B-Instruct | 0.33 | Llama 3.1 70B Instruct meta-llama-llama-3.1-70b-instruct | Imported | 2026-05-06 |
| 16 | google/gemma-3-12b-it | 0.31 | Gemma 3 12B google-gemma-3-12b-it | Imported | 2026-05-06 |
| 17 | google/gemma-3-4b-it | 0.30 | Gemma 3 4B google-gemma-3-4b-it | Imported | 2026-05-06 |
| 18 | Qwen/Qwen3-1.7B | 0.30 | — | Imported | 2026-05-06 |
| 19 | deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | 0.28 | — | Imported | 2026-05-06 |
| 20 | Qwen/Qwen3-0.6B | 0.27 | — | Imported | 2026-05-06 |
| 21 | Qwen/Qwen2.5-7B-Instruct | 0.26 | Qwen2.5 7B Instruct qwen-qwen-2.5-7b-instruct | Imported | 2026-05-06 |
| 22 | Qwen/Qwen2.5-14B-Instruct-1M | 0.25 | — | Imported | 2026-05-06 |
| 23 | nvidia/Llama-Nemotron-Nano-8B | 0.24 | — | Imported | 2026-05-06 |
| 24 | OpenScholar/Llama-3.1-OpenScholar-8B | 0.24 | — | Imported | 2026-05-06 |
| 25 | Qwen/Qwen2.5-7B-Instruct-1M | 0.21 | — | Imported | 2026-05-06 |
| 26 | nvidia/Llama-Nemotron-Nano-4B-v1.1 | 0.20 | — | Imported | 2026-05-06 |
| 27 | google/gemma-3-1b-it | 0.19 | — | Imported | 2026-05-06 |
| 28 | mistralai/Ministral-8B-Instruct-2410 | 0.17 | — | Imported | 2026-05-06 |
| 29 | meta-llama/Llama-3.1-8B-Instruct | 0.17 | Llama 3.1 8B Instruct meta-llama-llama-3.1-8b-instruct | Imported | 2026-05-06 |
| 30 | mistralai/Mistral-Small-3.1-24B-Instruct-2503 | 0.16 | Mistral: Mistral Small 3.1 24B mistralai-mistral-small-3.1-24b-instruct | Imported | 2026-05-06 |
| 31 | mistralai/Mistral-Small-24B-Instruct-2501 | 0.13 | Mistral: Mistral Small 3 mistralai-mistral-small-24b-instruct-2501 | Imported | 2026-05-06 |
| 32 | open-thoughts/OpenThinker-7B | 0.11 | — | Imported | 2026-05-06 |
| 33 | PleIAs/Pleias-RAG-350M | 0.01 | — | Imported | 2026-05-06 |
| 34 | PleIAs/Pleias-RAG-1B | 0 | — | Imported | 2026-05-06 |
No matching rows.