WMT23

The Eighth Conference on Machine Translation (WMT23) benchmark evaluating machine translation systems across 8 language pairs (14 translation directions) including general, biomedical, literary, and low-resource language translation tasks. Features specialized shared tasks for quality estimation, metrics evaluation, sign language translation, and discourse-level literary translation with professional human assessment.

4rows
scoreprimary metric
2026-05-06sampled

Metadata

Metrics

Score, Normalized Score

Latest Results

Rank Subject Score Model Match Provenance Sampled
1 Gemini 1.5 Pro 0.75 Self-reported 2026-05-06
2 Gemini 1.5 Flash 0.74 Self-reported 2026-05-06
3 Gemini 1.5 Flash 8B 0.73 Self-reported 2026-05-06
4 Gemini 1.0 Pro 0.72 Imported 2026-05-06