ScreenSpot-Pro

GUI grounding benchmark for professional high-resolution computer-use settings across development, creative, CAD, scientific, office, and operating-system applications.

27rows
scoreprimary metric
2026-05-28sampled

Metadata

Metrics

Grounding score

Showing 2 latest source slices.

Latest Results

Provider-published system-card benchmark scores parsed from Anthropic's Claude Opus 4.8 capability evaluation tables. Rows are marked self-reported and should be interpreted as source claims unless independently reproduced.

Rank Subject Grounding score Model Match Provenance Sampled
1 Claude Opus 4.8 87.9% Claude Opus 4.8
anthropic-claude-opus-4.8
Self-reported 2026-05-28
2 Claude Opus 4.7 87.6% Claude Opus 4.7
anthropic-claude-opus-4.7
Self-reported 2026-05-28
1 Hcompany/Holo2-235B-A22B 78.50 Imported 2026-05-06
2 Hcompany/Holo2-30B-A3B 75.20 Imported 2026-05-06
3 Hcompany/Holo2-8B 71.40 Imported 2026-05-06
4 Qwen/Qwen3.5-122B-A10B 70.40 Qwen3.5-122B-A10B
qwen-qwen3.5-122b-a10b
Imported 2026-05-06
5 Qwen/Qwen3.5-27B 70.30 Qwen3.5-27B
qwen-qwen3.5-27b
Imported 2026-05-06
6 inclusionAI/UI-Venus-1.5-30B-A3B 69.60 Imported 2026-05-06
7 Hcompany/Holo2-4B 68.60 Imported 2026-05-06
8 Qwen/Qwen3.5-35B-A3B 68.60 Qwen3.5-35B-A3B
qwen-qwen3.5-35b-a3b
Imported 2026-05-06
9 inclusionAI/UI-Venus-1.5-8B 68.40 Imported 2026-05-06
10 Qwen/Qwen3.5-397B-A17B 65.60 Qwen3.5 397B A17B
qwen-qwen3.5-397b-a17b
Imported 2026-05-06
11 Qwen/Qwen3.5-9B 65.20 Qwen3.5-9B
qwen-qwen3.5-9b
Imported 2026-05-06
12 Salesforce/GTA1-32B 63.60 Imported 2026-05-06
13 Hcompany/Holo1.5-72B 63.30 Imported 2026-05-06
14 inclusionAI/UI-Venus-Ground-72B 61.90 Imported 2026-05-06
15 Qwen/Qwen3.5-4B 60.30 Imported 2026-05-06
16 Hcompany/Holo1.5-7B 57.90 Imported 2026-05-06
17 inclusionAI/UI-Venus-1.5-2B 57.70 Imported 2026-05-06
18 Qwen/Qwen3.5-2B 54.50 Imported 2026-05-06
19 Qwen/Qwen2.5-VL-72B-Instruct 53.30 Qwen2.5 VL 72B Instruct
qwen-qwen2.5-vl-72b-instruct
Imported 2026-05-06
20 moonshotai/Kimi-VL-A3B-Thinking-2506 51 Imported 2026-05-06
21 inclusionAI/UI-Venus-Ground-7B 50.80 Imported 2026-05-06
22 Qwen/Qwen2.5-VL-32B-Instruct 48 Imported 2026-05-06
23 Qwen/Qwen3.5-0.8B 46.50 Imported 2026-05-06
24 KDEGroup/UI-AGILE-3B 45 Imported 2026-05-06
25 microsoft/GUI-Actor-7B-Qwen2.5-VL 44.60 Imported 2026-05-06