MRCR v2

MRCR v2 (Multi-Round Coreference Resolution version 2) is an enhanced version of the synthetic long-context reasoning task. It extends the original MRCR framework with improved evaluation criteria and additional complexity for testing models' ability to maintain attention and reasoning across extended contexts.

5rows
scoreprimary metric
2026-05-06sampled

Metadata

Metrics

Score, Normalized Score

Latest Results

Rank Subject Score Model Match Provenance Sampled
1 Gemma 4 31B 0.66 Gemma 4 31B
google-gemma-4-31b-it
Self-reported 2026-05-06
2 Gemma 4 26B-A4B 0.44 Gemma 4 26B A4B
google-gemma-4-26b-a4b-it
Self-reported 2026-05-06
3 Gemma 4 E4B 0.25 Self-reported 2026-05-06
4 Gemma 4 E2B 0.19 Self-reported 2026-05-06
5 Gemini 2.5 Flash-Lite 0.17 Gemini 2.5 Flash Lite
google-gemini-2.5-flash-lite
Self-reported 2026-05-06