AIME 2024

AIME 2024: Measures mathematical reasoning, symbolic problem solving, proof construction, or competition-style problem solving.

5rows
scoreprimary metric
2026-05-27sampled

Metadata

Metrics

Score

Latest Results

Rows are imported from the public Frontier Benchmarks AI AIME-2024 static HTML table.

Rank Subject Score Model Match Provenance Sampled
1 Grok 4 94.0 GROK Grok 4
x-ai-grok-4
Imported 2026-05-27
2 Magistral Medium 1.2 91.8 Imported 2026-05-27
3 DeepSeek R1 0528 91.4 R1 0528
deepseek-deepseek-r1-0528
Imported 2026-05-27
4 Mistral Large 3 53.3 Imported 2026-05-27
5 Reka Flash 3 51.0 REKA Reka Flash 3
rekaai-reka-flash-3
Imported 2026-05-27