GPT-5.1-Codex
Codex / OpenAI
18scores
18benchmarks
$1.25 / $10 per 1M tokenscost in/out
Metadata
Codex Closed/API
Aliases: gpt-5.1-codex, gpt-5.1-codex-20251113, openai-gpt-5.1-codex, openai-gpt-5.1-codex-20251113, openai/gpt-5.1-codex, openai/gpt-5.1-codex-20251113
| Benchmark | Category | Rank | Score | Sampled |
|---|---|---|---|---|
| APEX-Agents | Agentic | 14 | 34.90 | 2026-05-06 |
| Gert Labs Rankings | Agentic | 34 | 0.47 | 2026-05-11 |
| Tau2-Bench Telecom | Agentic | 100 | 83% | 2026-05-11 |
| Terminal-Bench Hard | Agentic | 62 | 34.8% | 2026-05-11 |
| ALE-Bench | Coding | 12 | 1244.92 | 2026-05-06 |
| Arena AI Code | Coding | 53 | 1329 | 2026-05-06 |
| IOI | Coding | 44 | 3.666% | 2026-05-26 |
| LiveCodeBench | Coding | 15 | 85.55% | 2026-05-28 |
| SciCode | Coding | 95 | 40.2% | 2026-05-11 |
| Vibe Code Bench v1.1 | Coding | 33 | 13.115% | 2026-05-28 |
| Artificial Analysis Intelligence Index | Intelligence | 52 | 43.11 | 2026-05-11 |
| Humanity's Last Exam | Intelligence | 54 | 23.4% | 2026-05-11 |
| LiveBench | Intelligence | 31 | 69.31 | 2026-05-05 |
| MMLU-Pro | Intelligence | 23 | 86% | 2026-05-11 |
| AIME 2025 | Math | 8 | 95.7% | 2026-05-11 |
| Design Arena | Multimodal | 57 | 1209 | 2026-05-06 |
| GPQA Diamond | Reasoning | 44 | 86% | 2026-05-11 |
| CritPt | Science | 31 | 5.7% | 2026-05-11 |
No matching rows.