Claude Opus 4.1

Claude / Anthropic

59scores
39benchmarks
$15 / $75 per 1M tokenscost in/out

Metadata

Claude Closed/API

Aliases: anthropic-claude-4.1-opus-20250805, anthropic-claude-opus-4.1, anthropic/claude-4.1-opus-20250805, anthropic/claude-opus-4.1, claude-4.1-opus-20250805, claude-opus-4.1

Benchmark Results

Benchmark Category Rank Score Sampled
MCPMark Agentic 12 0.30 2026-05-06
MultiChallenge Agentic 11 57.20 2026-05-06
OpenUGI Alignment 736 31.94 2026-05-06
OpenUGI Alignment 1136 16.09 2026-05-06
Arena AI Code Coding 41 1385 2026-05-06
ArtifactsBench Coding 2 59.76 2026-05-06
IOI Coding 25 12.516% 2026-05-26
LiveCodeBench Coding 69 66.456% 2026-05-28
LiveCodeBench Coding 72 64.559% 2026-05-28
GSMA Open Telco Leaderboard Domain 14 68.04 2026-05-06
SAGE Education 43 31.38% 2026-05-28
SAGE Education 47 30.388% 2026-05-28
TutorBench Education 12 50.78 2026-05-06
Vectara HHEM Hallucination Leaderboard Factuality 72 88.20 2026-05-06
MortgageTax Finance 41 61.089% 2026-05-28
MortgageTax Finance 55 56.121% 2026-05-28
PRBench Finance Finance 23 35.15 2026-05-06
TaxEval v2 Finance 29 73.672% 2026-05-28
TaxEval v2 Finance 53 71.464% 2026-05-28
Xent Games Game 6 58.65 overall 2026-05-28
GDPval Generalization 1 47.6% 2025-09-25
MedCode Healthcare 18 47.235% 2026-05-28
MedCode Healthcare 23 41.372% 2026-05-28
MedQA Healthcare 22 93.592% 2026-04-16
MedQA Healthcare 30 92.533% 2026-04-16
MedScribe Healthcare 40 73.901% 2026-05-28
MedScribe Healthcare 48 71.753% 2026-05-28
AIIQ Composite IQ Intelligence 30 103 2026-05-12
GPQA Diamond Intelligence 51 76.263% 2026-05-28
GPQA Diamond Intelligence 68 69.95% 2026-05-28
MathVision Intelligence 32 66 2026-05-06
MMLU Pro Intelligence 10 87.924% 2026-05-28
MMLU Pro Intelligence 19 87.214% 2026-05-28
MMMU Pro Intelligence 32 77.514% 2026-05-28
MMMU Pro Intelligence 38 73.715% 2026-05-28
Seneca-TRBench Language 4 90.06 2026-05-06
LegalBench Legal 28 83.458% 2026-05-28
Professional Reasoning Bench - Legal Legal 23 34.00 2026-05-06
AIME Math 48 78.179% 2026-04-16
AIME Math 65 44.236% 2026-04-16
MATH 500 Math 4 95.4% 2026-01-09
MATH 500 Math 17 93% 2026-01-09
MGSM Math 3 94.436% 2026-01-09
MGSM Math 5 94.218% 2026-01-09
Design Arena Multimodal 41 1229 2026-05-06
Design Arena Multimodal 46 1225 2026-05-06
Math-VR Multimodal 8 54.3 2026-05-27
Visual-Language Understanding Multimodal 13 48.44 2026-05-06
Visual-Language Understanding Multimodal 21 45.25 2026-05-06
VTB Multimodal 11 5.16 2026-05-06
VTB Multimodal 13 4.71 2026-05-06
EnigmaEval Reasoning 12 7.18 2026-05-06
EnigmaEval Reasoning 15 4.81 2026-05-06
Humanity's Last Exam (Text Only) Reasoning 24 11.26 2026-05-06
Humanity's Last Exam (Text Only) Reasoning 35 7.37 2026-05-06
LingOly-TOO Reasoning 2 0.46 2026-05-06
MultiNRC Reasoning 15 38.39 2026-05-06
MultiNRC Reasoning 19 29.67 2026-05-06
SciPredict Science 1 22.22 2026-05-06