RealWorldQA

RealWorldQA: Evaluates multimodal understanding across image, text, chart, diagram, or cross-modal reasoning tasks.

7rows
scoreprimary metric
2026-05-27sampled

Metadata

Metrics

RealWorldQA

Latest Results

Rows are imported from direct RealWorldQA metric values in the public ALL-Bench JSON leaderboard.

Rank Subject RealWorldQA Model Match Provenance Sampled
1 Qwen3.5-9B 80.3 Qwen3.5-9B
qwen-qwen3.5-9b
Imported 2026-05-27
2 Qwen3.5-4B 79.5 Imported 2026-05-27
3 InternVL3-78B 78 Imported 2026-05-27
4 Qwen3-VL-30B-A3B 77.4 Imported 2026-05-27
5 Gemini-2.5-FL-Lite 72.2 Imported 2026-05-27
6 GPT-5-Nano 71.8 GPT-5 Nano
openai-gpt-5-nano
Imported 2026-05-27
7 Kimi-VL-A3B-Thinking 70 Imported 2026-05-27