OMLAB Open Agent Multimodal Leaderboard
Open Agent Leaderboard multimodal track comparing visual-agent configurations by score, pass rate, and token usage.
9rows
scoreprimary metric
2026-05-06sampled
Metadata
Metrics
Score, Pass Rate, Total Input Tokens (lower is better), Total Output Tokens (lower is better), All Tokens (lower is better)
| Rank | Subject | Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | ZoomEye + Qwen2.5-VL-72B-Instruct | 51.56 | — | Imported | 2026-05-06 |
| 2 | ZoomEye + Qwen2.5-VL-7B-Instruct | 48.06 | — | Imported | 2026-05-06 |
| 3 | IO + Qwen2.5-VL-72B-Instruct | 44.47 | — | Imported | 2026-05-06 |
| 4 | ZoomEye + InternVL2.5-8B | 43.42 | — | Imported | 2026-05-06 |
| 5 | IO + InternVL2.5-8B | 42.95 | — | Imported | 2026-05-06 |
| 6 | IO + Qwen2.5-VL-7B-Instruct | 42.86 | — | Imported | 2026-05-06 |
| 7 | ZoomEye + Llava-v1.5-7B | 31.60 | — | Imported | 2026-05-06 |
| 8 | IO + Llava-v1.5-7B | 24.79 | — | Imported | 2026-05-06 |
| 9 | V* + seal_vqa & seal_vsm | 15.14 | — | Imported | 2026-05-06 |
No matching rows.