WMT23
The Eighth Conference on Machine Translation (WMT23) benchmark evaluating machine translation systems across 8 language pairs (14 translation directions) including general, biomedical, literary, and low-resource language translation tasks. Features specialized shared tasks for quality estimation, metrics evaluation, sign language translation, and discourse-level literary translation with professional human assessment.
4rows
scoreprimary metric
2026-05-06sampled
Metadata
Metrics
Score, Normalized Score
| Rank | Subject | Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | Gemini 1.5 Pro | 0.75 | — | Self-reported | 2026-05-06 |
| 2 | Gemini 1.5 Flash | 0.74 | — | Self-reported | 2026-05-06 |
| 3 | Gemini 1.5 Flash 8B | 0.73 | — | Self-reported | 2026-05-06 |
| 4 | Gemini 1.0 Pro | 0.72 | — | Imported | 2026-05-06 |
No matching rows.