OCR Bench Britannica
Pairwise VLM-as-judge OCR benchmark ranking OCR models on Encyclopaedia Britannica document images with Bradley-Terry ELO scores.
6rows
eloprimary metric
2026-05-06sampled
Metadata
Metrics
ELO, Win%, Wins, Losses (lower is better), Ties, ELO Low, ELO High
| Rank | Subject | ELO | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | rednote-hilab/dots.mocr | 1714 | — | Imported | 2026-05-06 |
| 2 | lightonai/LightOnOCR-2-1B | 1600 | — | Imported | 2026-05-06 |
| 3 | FireRedTeam/FireRed-OCR | 1589 | — | Imported | 2026-05-06 |
| 4 | deepseek-ai/DeepSeek-OCR | 1434 | — | Imported | 2026-05-06 |
| 5 | baidu/Qianfan-OCR | 1426 | — | Imported | 2026-05-06 |
| 6 | rednote-hilab/dots.ocr | 1236 | — | Imported | 2026-05-06 |
No matching rows.