MEGA MLQA
MLQA as part of the MEGA (Multilingual Evaluation of Generative AI) benchmark suite. A multi-way aligned extractive QA evaluation benchmark for cross-lingual question answering across 7 languages (English, Arabic, German, Spanish, Hindi, Vietnamese, and Simplified Chinese) with over 12K QA instances in English and 5K in each other language.
2rows
scoreprimary metric
2026-05-06sampled
Metadata
Metrics
Score, Normalized Score
| Rank | Subject | Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | Phi-3.5-MoE-instruct | 0.65 | — | Self-reported | 2026-05-06 |
| 2 | Phi-3.5-mini-instruct | 0.62 | — | Self-reported | 2026-05-06 |
No matching rows.