MM-Mind2Web
A multimodal web navigation benchmark comprising 2,000 open-ended tasks spanning 137 websites across 31 domains. Each task includes HTML documents paired with webpage screenshots, action sequences, and complex web interactions.
3rows
scoreprimary metric
2026-05-06sampled
Metadata
Metrics
Score, Normalized Score
| Rank | Subject | Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | Nova Pro | 0.64 | Nova Pro 1.0 amazon-nova-pro-v1 | Self-reported | 2026-05-06 |
| 2 | Nova Lite | 0.61 | Nova Lite 1.0 amazon-nova-lite-v1 | Self-reported | 2026-05-06 |
| 3 | Qwen3-Coder 480B A35B Instruct | 0.56 | Qwen3 Coder 480B A35B qwen-qwen3-coder | Self-reported | 2026-05-06 |
No matching rows.