Claude Opus 4.1
Claude / Anthropic
59scores
39benchmarks
$15 / $75 per 1M tokenscost in/out
Metadata
Claude Closed/API
Aliases: anthropic-claude-4.1-opus-20250805, anthropic-claude-opus-4.1, anthropic/claude-4.1-opus-20250805, anthropic/claude-opus-4.1, claude-4.1-opus-20250805, claude-opus-4.1
| Benchmark | Category | Rank | Score | Sampled |
|---|---|---|---|---|
| MCPMark | Agentic | 12 | 0.30 | 2026-05-06 |
| MultiChallenge | Agentic | 11 | 57.20 | 2026-05-06 |
| OpenUGI | Alignment | 736 | 31.94 | 2026-05-06 |
| OpenUGI | Alignment | 1136 | 16.09 | 2026-05-06 |
| Arena AI Code | Coding | 41 | 1385 | 2026-05-06 |
| ArtifactsBench | Coding | 2 | 59.76 | 2026-05-06 |
| IOI | Coding | 25 | 12.516% | 2026-05-26 |
| LiveCodeBench | Coding | 69 | 66.456% | 2026-05-28 |
| LiveCodeBench | Coding | 72 | 64.559% | 2026-05-28 |
| GSMA Open Telco Leaderboard | Domain | 14 | 68.04 | 2026-05-06 |
| SAGE | Education | 43 | 31.38% | 2026-05-28 |
| SAGE | Education | 47 | 30.388% | 2026-05-28 |
| TutorBench | Education | 12 | 50.78 | 2026-05-06 |
| Vectara HHEM Hallucination Leaderboard | Factuality | 72 | 88.20 | 2026-05-06 |
| MortgageTax | Finance | 41 | 61.089% | 2026-05-28 |
| MortgageTax | Finance | 55 | 56.121% | 2026-05-28 |
| PRBench Finance | Finance | 23 | 35.15 | 2026-05-06 |
| TaxEval v2 | Finance | 29 | 73.672% | 2026-05-28 |
| TaxEval v2 | Finance | 53 | 71.464% | 2026-05-28 |
| Xent Games | Game | 6 | 58.65 overall | 2026-05-28 |
| GDPval | Generalization | 1 | 47.6% | 2025-09-25 |
| MedCode | Healthcare | 18 | 47.235% | 2026-05-28 |
| MedCode | Healthcare | 23 | 41.372% | 2026-05-28 |
| MedQA | Healthcare | 22 | 93.592% | 2026-04-16 |
| MedQA | Healthcare | 30 | 92.533% | 2026-04-16 |
| MedScribe | Healthcare | 40 | 73.901% | 2026-05-28 |
| MedScribe | Healthcare | 48 | 71.753% | 2026-05-28 |
| AIIQ Composite IQ | Intelligence | 30 | 103 | 2026-05-12 |
| GPQA Diamond | Intelligence | 51 | 76.263% | 2026-05-28 |
| GPQA Diamond | Intelligence | 68 | 69.95% | 2026-05-28 |
| MathVision | Intelligence | 32 | 66 | 2026-05-06 |
| MMLU Pro | Intelligence | 10 | 87.924% | 2026-05-28 |
| MMLU Pro | Intelligence | 19 | 87.214% | 2026-05-28 |
| MMMU Pro | Intelligence | 32 | 77.514% | 2026-05-28 |
| MMMU Pro | Intelligence | 38 | 73.715% | 2026-05-28 |
| Seneca-TRBench | Language | 4 | 90.06 | 2026-05-06 |
| LegalBench | Legal | 28 | 83.458% | 2026-05-28 |
| Professional Reasoning Bench - Legal | Legal | 23 | 34.00 | 2026-05-06 |
| AIME | Math | 48 | 78.179% | 2026-04-16 |
| AIME | Math | 65 | 44.236% | 2026-04-16 |
| MATH 500 | Math | 4 | 95.4% | 2026-01-09 |
| MATH 500 | Math | 17 | 93% | 2026-01-09 |
| MGSM | Math | 3 | 94.436% | 2026-01-09 |
| MGSM | Math | 5 | 94.218% | 2026-01-09 |
| Design Arena | Multimodal | 41 | 1229 | 2026-05-06 |
| Design Arena | Multimodal | 46 | 1225 | 2026-05-06 |
| Math-VR | Multimodal | 8 | 54.3 | 2026-05-27 |
| Visual-Language Understanding | Multimodal | 13 | 48.44 | 2026-05-06 |
| Visual-Language Understanding | Multimodal | 21 | 45.25 | 2026-05-06 |
| VTB | Multimodal | 11 | 5.16 | 2026-05-06 |
| VTB | Multimodal | 13 | 4.71 | 2026-05-06 |
| EnigmaEval | Reasoning | 12 | 7.18 | 2026-05-06 |
| EnigmaEval | Reasoning | 15 | 4.81 | 2026-05-06 |
| Humanity's Last Exam (Text Only) | Reasoning | 24 | 11.26 | 2026-05-06 |
| Humanity's Last Exam (Text Only) | Reasoning | 35 | 7.37 | 2026-05-06 |
| LingOly-TOO | Reasoning | 2 | 0.46 | 2026-05-06 |
| MultiNRC | Reasoning | 15 | 38.39 | 2026-05-06 |
| MultiNRC | Reasoning | 19 | 29.67 | 2026-05-06 |
| SciPredict | Science | 1 | 22.22 | 2026-05-06 |
No matching rows.