MMBench
MMBench: Evaluates multimodal understanding across image, text, chart, diagram, or cross-modal reasoning tasks.
7rows
scoreprimary metric
2026-05-27sampled
Metadata
Metrics
MMBench-EN
| Rank | Subject | MMBench-EN | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | Qwen3.5-9B | 90.1 | Qwen3.5-9B qwen-qwen3.5-9b | Imported | 2026-05-27 |
| 2 | Qwen3.5-4B | 89.4 | — | Imported | 2026-05-27 |
| 3 | InternVL3-78B | 89 | — | Imported | 2026-05-27 |
| 4 | Qwen3-VL-30B-A3B | 88.9 | — | Imported | 2026-05-27 |
| 5 | Kimi-VL-A3B-Thinking | 84.4 | — | Imported | 2026-05-27 |
| 6 | Gemini-2.5-FL-Lite | 82.7 | — | Imported | 2026-05-27 |
| 7 | GPT-5-Nano | 80.3 | GPT-5 Nano openai-gpt-5-nano | Imported | 2026-05-27 |
No matching rows.