RealWorldQA
RealWorldQA: Evaluates multimodal understanding across image, text, chart, diagram, or cross-modal reasoning tasks.
7rows
scoreprimary metric
2026-05-27sampled
Metadata
Metrics
RealWorldQA
| Rank | Subject | RealWorldQA | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | Qwen3.5-9B | 80.3 | Qwen3.5-9B qwen-qwen3.5-9b | Imported | 2026-05-27 |
| 2 | Qwen3.5-4B | 79.5 | — | Imported | 2026-05-27 |
| 3 | InternVL3-78B | 78 | — | Imported | 2026-05-27 |
| 4 | Qwen3-VL-30B-A3B | 77.4 | — | Imported | 2026-05-27 |
| 5 | Gemini-2.5-FL-Lite | 72.2 | — | Imported | 2026-05-27 |
| 6 | GPT-5-Nano | 71.8 | GPT-5 Nano openai-gpt-5-nano | Imported | 2026-05-27 |
| 7 | Kimi-VL-A3B-Thinking | 70 | — | Imported | 2026-05-27 |
No matching rows.