Claude Opus 4.7 | BenchmarkList

Metadata

Claude Closed/API

Aliases: anthropic-claude-4.7-opus-20260416, anthropic-claude-opus-4.7, anthropic/claude-4.7-opus-20260416, anthropic/claude-opus-4.7, claude-4.7-opus-20260416, claude-opus-4.7

Official Sources

1 linked source

Launch post Apr 16, 2026 12 related scores

Introducing Claude Opus 4.7

Open

Benchmark	Category	Rank	Score	Sampled
APEX-Agents	Agentic	3	50.60	2026-05-06
ARC-AGI-1	Agentic	5	93.5%	2026-04-23
ARC-AGI-2	Agentic	4	75.8%	2026-04-23
AutoBench	Agentic	1	3.30	2026-05-06
AutomationBench	Agentic	3	9.9%	2026-05-28
AutomationBench	Agentic	6	9.90	2026-05-21
AutomationBench	Agentic	9	8.40	2026-05-21
AutomationBench	Agentic	10	8.20	2026-05-21
BrowseComp	Agentic	4	79.8%	2026-05-28
BrowseComp	Agentic	6	79.3%	2026-04-23
BrowseComp	Agentic	5	79.3%	2026-04-16
GDPval-AA	Agentic	3	1753 Elo	2026-05-28
Gert Labs Rankings	Agentic	2	0.69	2026-05-11
HiL-Bench	Agentic	2	27.67%	2026-05-05
ITBench-AA	Agentic	1	46.7%	2026-05-28
LMArena Search Arena	Agentic	3	1233.14	2026-05-06
MCP Atlas	Agentic	2	79.1%	2026-05-28
MCP Atlas	Agentic	1	79.10	2026-05-06
MCP Atlas	Agentic	1	79.1%	2026-04-23
MCP Atlas	Agentic	1	77.3%	2026-04-16
OSWorld-Verified	Agentic	2	82.8%	2026-05-28
OSWorld-Verified	Agentic	3	0.78	2026-05-06
OSWorld-Verified	Agentic	2	78%	2026-04-23
OSWorld-Verified	Agentic	2	78%	2026-04-16
RuneBench	Agentic	4	4.60	2026-05-05
ScreenSpot-Pro	Agentic	2	87.6%	2026-05-28
Tau2-Bench Telecom	Agentic	62	88.6%	2026-05-11
Tau2-Bench Telecom	Agentic	125	74%	2026-05-11
Terminal-Bench Hard	Agentic	5	54.5%	2026-05-11
Terminal-Bench Hard	Agentic	11	51.5%	2026-05-11
TERMS-Bench	Agentic	3	66.0% SE+	2026-05-28
Toolathlon	Agentic	2	59.3%	2026-05-28
Vending-Bench 2	Agentic	1	10936.76	2026-05-28
Vending-Bench 2	Agentic	1	10937 USD	2026-05-28
Vending-Bench 2	Agentic	2	7971 USD	2026-05-28
OpenUGI	Alignment	31	56.53	2026-05-06
OpenUGI	Alignment	53	53.60	2026-05-06
OpenUGI	Alignment	75	51.79	2026-05-06
OpenUGI	Alignment	81	51.52	2026-05-06
BioPipelineBench Verified	Biology	3	83.6%	2026-05-28
LABBench2 Clinical Trials	Biology	3	70.8%	2026-05-28
LABBench2 Patent Questions	Biology	3	48.3%	2026-05-28
LABBench2 Reading Tables	Biology	2	66.4%	2026-05-28
LABBench2 Supplementary Materials	Biology	2	47.8%	2026-05-28
ProteinGym Hard	Biology	3	37.7%	2026-05-28
Protocol Troubleshooting (Anthropic Internal)	Biology	3	51.8%	2026-05-28
scBench	Biology	3	55.3%	2026-05-28
scBench	Biology	4	55.21%	2026-05-27
scBench	Biology	5	54.02%	2026-05-27
SpatialBench	Biology	3	51.4%	2026-05-28
SpatialBench	Biology	5	52.41%	2026-05-27
SpatialBench	Biology	7	51.36%	2026-05-27
Structural Biology Open-Ended	Biology	3	74%	2026-05-28
Organic Chemistry (Anthropic Internal)	Chemistry	3	77.2%	2026-05-28
Arena AI Code	Coding	2	1561	2026-05-06
BLXBench	Coding	2	84.80	2026-05-06
DeepSWE	Coding	3	54.20	2026-05-26
FrontierSWE	Coding	2	4.2 avg rank	2026-05-28
IOI	Coding	3	47.084%	2026-05-26
LiveCodeBench	Coding	19	85.073%	2026-05-28
LMArena WebDev Arena	Coding	2	1561.91	2026-05-06
SciCode	Coding	7	54.5%	2026-05-11
SciCode	Coding	18	50.1%	2026-05-11
SWE-bench Verified	Coding	3	82%	2026-05-28
Terminal-Bench 2.0	Coding	3	68.539%	2026-05-28
Terminal-Bench 2.0	Coding	3	69.4%	2026-04-23
Terminal-Bench 2.0	Coding	3	69.4%	2026-04-16
Terminal-Bench 2.1	Coding	5	68.539%	2026-05-28
Terminal-Bench 2.1	Coding	4	66.1%	2026-05-28
Vibe Code Bench v1.1	Coding	2	71.003%	2026-05-28
CyberGym	Cybersecurity	3	73.1%	2026-05-28
CyberGym	Cybersecurity	4	0.73	2026-05-06
CyberGym	Cybersecurity	3	73.1%	2026-04-23
CyberGym	Cybersecurity	3	73.1%	2026-04-16
ExploitBench v8-bench	Cybersecurity	5	3.66 points	2026-05-28
ExploitBench v8-bench	Cybersecurity	6	3.46 points	2026-05-28
ExploitBench v8-bench	Cybersecurity	7	3.66 points	2026-05-15
ExploitBench v8-bench	Cybersecurity	9	3.46 points	2026-05-15
Firefox 147 JS Exploitation	Cybersecurity	3	1.2%	2026-05-28
Arena AI Document	Document AI	3	1515	2026-05-06
OfficeQA (Anthropic Harness)	Document AI	2	76.3%	2026-05-28
OfficeQA Pro	Document AI	3	43.6%	2026-04-23
OfficeQA Pro (Anthropic Harness)	Document AI	2	65%	2026-05-28
SAGE	Education	1	56.103%	2026-05-28
AA-Omniscience	Factuality	2	26.17	2026-05-11
Vectara HHEM Hallucination Leaderboard	Factuality	74	88	2026-05-06
CorpFin v2	Finance	11	66.084%	2026-05-28
Finance Agent v1.1	Finance	1	64.373%	2026-05-04
Finance Agent v1.1	Finance	1	64.4%	2026-04-23
Finance Agent v1.1	Finance	1	64.4%	2026-04-16
Finance Agent v2	Finance	4	51.509%	2026-05-28
Finance Agent v2	Finance	3	51.5%	2026-05-28
MortgageTax	Finance	1	70.27%	2026-05-28
Rogo Big Finance Bench	Finance	1	59% rubric / 41% final	2026-05-28
TaxBench	Finance	10	14.37% mean pass^5	2026-05-27
TaxEval v2	Finance	9	75.266%	2026-05-28
React Native Evals	Frontend Development	7	82.7839% overall	2026-05-28
InfiniteBM Chess	Game	1	1997.52 Elo / 16 games	2026-05-28
InfiniteBM Coup	Game	4	1470.55 Elo / 47 games	2026-05-28
InfiniteBM Coup	Game	5	1435.16 Elo / 16 games	2026-05-28
InfiniteBM Heads-Up No-Limit Hold'em	Game	5	1401.16 Elo / 23 games	2026-05-28
InfiniteBM Heads-Up No-Limit Hold'em	Game	23	1092.51 Elo / 115 games	2026-05-28
InfiniteBM Liar's Dice	Game	7	1341.37 Elo / 116 games	2026-05-28
InfiniteBM Liar's Dice	Game	13	1276.3 Elo / 39 games	2026-05-28
InfiniteBM Settlers of Catan	Game	3	1740.25 Elo / 24 games	2026-05-28
InfiniteBM Werewolf	Game	3	1255.77 Elo / 22 games	2026-05-28
InfiniteBM Werewolf	Game	7	1123.57 Elo / 19 games	2026-05-28
BenchLM	General Knowledge	5	90	2026-05-06
GDPval	Generalization	5	80.3%	2026-04-23
LMArena Text Arena	Generalization	6	1477.55	2026-05-06
HealthBench Professional	Healthcare	2	51.9%	2026-05-28
MedCode	Healthcare	4	54.858%	2026-05-28
MedScribe	Healthcare	15	82.953%	2026-05-28
PhysicianBench	Healthcare	3	29.3 +/- 2.5	2026-05-27
HUMAINE	Human Preference	10	3.68	2026-05-06
AIIQ Composite IQ	Intelligence	4	132	2026-05-12
Artificial Analysis Intelligence Index	Intelligence	3	57.28	2026-05-11
Artificial Analysis Intelligence Index	Intelligence	13	51.82	2026-05-11
GPQA Diamond	Intelligence	10	90.152%	2026-05-28
Humanity's Last Exam	Intelligence	2	54.7%	2026-05-28
Humanity's Last Exam	Intelligence	8	39.6%	2026-05-11
Humanity's Last Exam	Intelligence	22	31.2%	2026-05-11
Humanity's Last Exam	Intelligence	3	54.7%	2026-04-23
Humanity's Last Exam	Intelligence	3	54.7%	2026-04-16
MMLU Pro	Intelligence	3	89.871%	2026-05-28
MMMU Pro	Intelligence	11	85.549%	2026-05-28
Vals Index	Intelligence	3	66.099%	2026-05-28
Vals Multimodal Index	Intelligence	3	67.361%	2026-05-28
CaseLaw v2	Legal	5	68.381%	2026-05-04
Harvey Legal Agent Benchmark	Legal	1	7.1%	2026-05-28
LegalBench	Legal	9	85.251%	2026-05-28
Realm Warren	Legal	1	0.36	2026-05-07
Graphwalks BFS 1M F1	Long Context	3	40.3%	2026-05-28
Graphwalks BFS 256k F1	Long Context	2	76.9%	2026-05-28
Graphwalks BFS 256k F1	Long Context	1	76.9%	2026-04-23
Graphwalks Parents 1M F1	Long Context	3	56.6%	2026-05-28
Graphwalks Parents 256k F1	Long Context	3	93.6%	2026-05-28
Graphwalks Parents 256k F1	Long Context	1	93.6%	2026-04-23
OpenAI MRCR v2 8-needle 128K-256K	Long Context	3	59.2%	2026-04-23
OpenAI MRCR v2 8-needle 512K-1M	Long Context	3	32.2%	2026-04-23
AIME	Math	7	96.25%	2026-04-16
ProofBench	Math	4	54%	2026-05-28
FrontierMath 2025-02-28 Private	Mathematics	5	43.8%	2026-04-23
FrontierMath Tier 4 2025-07-01 Private	Mathematics	5	22.9%	2026-04-23
USAMO 2026	Mathematics	2	69.3%	2026-05-28
Global MMLU	Multilingual	4	89.9%	2026-05-28
MMMLU	Multilingual	2	91.5%	2026-04-16
Blueprint-Bench 2	Multimodal	6	0.652 +/- 0.009	2026-05-28
ChartMuseum	Multimodal	2	85.9%	2026-05-28
ChartQAPro	Multimodal	2	69.8%	2026-05-28
CharXiv-R	Multimodal	1	90.1%	2026-05-28
CharXiv-R	Multimodal	2	0.91	2026-05-06
CharXiv-R	Multimodal	2	91%	2026-04-16
Design Arena	Multimodal	6	1338	2026-05-06
FigQA	Multimodal	2	85.4%	2026-05-28
LMArena Vision Arena	Multimodal	3	1314.23	2026-05-06
CAIS Text Capabilities Index	Reasoning	5	46.9	2026-05-27
Context Arena	Reasoning	48	28.81	2026-05-06
Context Arena	Reasoning	50	28.63	2026-05-06
Context Arena	Reasoning	51	28.54	2026-05-06
Context Arena	Reasoning	52	27.96	2026-05-06
Context Arena	Reasoning	67	15.12	2026-05-06
GPQA Diamond	Reasoning	2	94.2%	2026-05-28
GPQA Diamond	Reasoning	7	91.4%	2026-05-11
GPQA Diamond	Reasoning	23	88.5%	2026-05-11
GPQA Diamond	Reasoning	3	94.2%	2026-04-23
GPQA Diamond	Reasoning	4	94.2%	2026-04-16
CAIS Risk Index	Safety	1	32.9	2026-05-27
BioMysteryBench Human-Difficult	Science	3	24.7%	2026-05-28
BioMysteryBench Human-Difficult	Science	2	27.0%	2026-04-29
BioMysteryBench Human-Solvable	Science	3	78.9%	2026-05-28
BioMysteryBench Human-Solvable	Science	2	78.9%	2026-04-29
CritPt	Science	12	12%	2026-05-11
CritPt	Science	32	5.1%	2026-05-11
DeepSearchQA	Search	3	89.4%	2026-05-28
ProgramBench	Software Engineering	1	0%	2026-05-05
ProgramBench (Anthropic Harness)	Software Engineering	2	84%	2026-05-28
SWE-bench Multilingual	Software Engineering	2	80.5%	2026-05-28
SWE-bench Multimodal	Software Engineering	2	34.5%	2026-05-28
SWE-bench Pro	Software Engineering	2	64.3%	2026-05-28
SWE-bench Pro	Software Engineering	1	64.3%	2026-04-23
SWE-bench Pro	Software Engineering	2	64.3%	2026-04-16
SWE-bench Verified	Software Engineering	2	87.6%	2026-05-28
SWE-bench Verified	Software Engineering	2	87.6%	2026-04-16
Structured Output Benchmark	Structured Output	4	86.40	2026-05-06
CAIS Vision Capabilities Index	Vision	15	50.1	2026-05-27

Metadata

Official Sources

Benchmark Results