MMBench

MMBench: Evaluates multimodal understanding across image, text, chart, diagram, or cross-modal reasoning tasks.

7rows
scoreprimary metric
2026-05-27sampled

Metadata

Metrics

MMBench-EN

Latest Results

Rows are imported from direct MMBench-EN metric values in the public ALL-Bench JSON leaderboard.

Rank Subject MMBench-EN Model Match Provenance Sampled
1 Qwen3.5-9B 90.1 Qwen3.5-9B
qwen-qwen3.5-9b
Imported 2026-05-27
2 Qwen3.5-4B 89.4 Imported 2026-05-27
3 InternVL3-78B 89 Imported 2026-05-27
4 Qwen3-VL-30B-A3B 88.9 Imported 2026-05-27
5 Kimi-VL-A3B-Thinking 84.4 Imported 2026-05-27
6 Gemini-2.5-FL-Lite 82.7 Imported 2026-05-27
7 GPT-5-Nano 80.3 GPT-5 Nano
openai-gpt-5-nano
Imported 2026-05-27