Mistral: Devstral Small 1.1
Mistral / Mistral AI
19scores
11benchmarks
$0.1 / $0.3 per 1M tokenscost in/out
Metadata
Mistral Open source
Aliases: devstral-small, devstral-small-2507, mistralai-devstral-small, mistralai-devstral-small-2507, mistralai/devstral-small, mistralai/devstral-small-2507
| Benchmark | Category | Rank | Score | Sampled |
|---|---|---|---|---|
| Tau2-Bench Telecom | Agentic | 206 | 38% | 2026-05-11 |
| Tau2-Bench Telecom | Agentic | 250 | 28.4% | 2026-05-11 |
| Terminal-Bench Hard | Agentic | 251 | 6.1% | 2026-05-11 |
| Terminal-Bench Hard | Agentic | 252 | 6.1% | 2026-05-11 |
| SciCode | Coding | 327 | 24.5% | 2026-05-11 |
| SciCode | Coding | 329 | 24.3% | 2026-05-11 |
| Artificial Analysis Intelligence Index | Intelligence | 274 | 18.03 | 2026-05-11 |
| Artificial Analysis Intelligence Index | Intelligence | 315 | 15.21 | 2026-05-11 |
| Humanity's Last Exam | Intelligence | 410 | 4% | 2026-05-11 |
| Humanity's Last Exam | Intelligence | 439 | 3.7% | 2026-05-11 |
| MMLU-Pro | Intelligence | 261 | 63.2% | 2026-05-11 |
| MMLU-Pro | Intelligence | 263 | 62.2% | 2026-05-11 |
| AIME 2025 | Math | 187 | 29.3% | 2026-05-11 |
| Design Arena | Multimodal | 121 | 865 | 2026-05-06 |
| Artificial Analysis Openness Index | Openness | 79 | 44.44 | 2026-05-11 |
| GPQA Diamond | Reasoning | 373 | 43.4% | 2026-05-11 |
| GPQA Diamond | Reasoning | 389 | 41.4% | 2026-05-11 |
| CritPt | Science | 175 | 0% | 2026-05-11 |
| CritPt | Science | 176 | 0% | 2026-05-11 |
No matching rows.