ALL Bench Multimodal

ALL Bench Multimodal aggregates cross-verified AI model scores across LLM, VLM, agent, image generation, video generation, and music generation categories in one unified benchmark file.

96rows
category_scoreprimary metric
2026-05-06sampled

Metadata

Metrics

Category Score

Latest Results

Rows are parsed from the FINAL-Bench ALL Bench Leaderboard JSON. Rankings are within each published modality; generated-media letter grades are mapped to ordinal grade scores for display while preserving raw grades.

Rank Subject Category Score Model Match Provenance Sampled
1 Gemini 3.1 Pro 63.96 Gemini 3.1 Pro Preview
google-gemini-3.1-pro-preview
Imported 2026-05-06
2 Claude Opus 4.6 63.16 Claude Opus 4.6
anthropic-claude-opus-4.6
Imported 2026-05-06
3 GPT-5.2 62.59 GPT-5.2
openai-gpt-5.2
Imported 2026-05-06
4 Kimi K2.5 57.79 KIMI MoonshotAI: Kimi K2.5
moonshotai-kimi-k2.5
Imported 2026-05-06
5 Gemini 3 Flash 51.49 Gemini 3 Flash Preview
google-gemini-3-flash-preview
Imported 2026-05-06
6 GLM-5 44.06 GLM GLM 5
z-ai-glm-5
Imported 2026-05-06
7 Qwen3.5-397B 42.25 Imported 2026-05-06
8 Grok 4 Heavy 41.69 Imported 2026-05-06
9 MiniMax-M2.5 41.68 MiniMax M2.5
minimax-minimax-m2.5
Imported 2026-05-06
10 Qwen3.5-122B 38.89 Imported 2026-05-06
11 DeepSeek V3.2 38.42 DeepSeek V3.2
deepseek-deepseek-v3.2
Imported 2026-05-06
12 GPT-5.3 Codex 37.34 GPT-5.3-Codex
openai-gpt-5.3-codex
Imported 2026-05-06
13 DeepSeek R1 35.21 R1
deepseek-r1
Imported 2026-05-06
14 Llama 4 Maverick 35.19 Llama 4 Maverick
meta-llama-4-maverick
Imported 2026-05-06
15 Qwen3.5-9B 34.19 Qwen3.5-9B
qwen-qwen3.5-9b
Imported 2026-05-06
16 Claude Sonnet 4.6 32.53 Claude Sonnet 4.6
anthropic-claude-sonnet-4.6
Imported 2026-05-06
17 Claude Sonnet 4.5 30.89 Claude Sonnet 4.5
anthropic-claude-sonnet-4.5
Imported 2026-05-06
18 GPT-OSS-120B 30.67 gpt-oss-120b
openai-gpt-oss-120b
Imported 2026-05-06
19 DeepSeek R2 30.63 Imported 2026-05-06
20 Claude Haiku 4.5 30.14 Claude Haiku 4.5
anthropic-claude-haiku-4.5
Imported 2026-05-06
21 Qwen3-Next-80B 29.90 Imported 2026-05-06
22 Gemini 3 Pro 29.76 Gemini 3
google-gemini-3
Imported 2026-05-06
23 Mistral Large 3 28.64 Imported 2026-05-06
24 Qwen3.5-4B 28.49 Imported 2026-05-06
25 Llama 4 Scout 27.51 Llama 4 Scout
meta-llama-llama-4-scout
Imported 2026-05-06
26 Qwen3.5-27B 25.86 Qwen3.5-27B
qwen-qwen3.5-27b
Imported 2026-05-06
27 Solar Open 100B 25 Imported 2026-05-06
28 K-EXAONE 24.26 Imported 2026-05-06
29 GPT-OSS-20B 23.61 gpt-oss-20b
openai-gpt-oss-20b
Imported 2026-05-06
30 GPT-5.1 21.43 GPT-5.1
openai-gpt-5.1
Imported 2026-05-06
31 Step-3.5-Flash 20.61 S Step 3.5 Flash
stepfun-step-3.5-flash
Imported 2026-05-06
32 Qwen3.5-35B 18.71 Imported 2026-05-06
33 GPT-5.4 18.39 GPT-5.4
openai-gpt-5.4
Imported 2026-05-06
34 Grok 4.1 Fast 17.52 GROK Grok 4.1 Fast
x-ai-grok-4.1-fast
Imported 2026-05-06
35 Nanbeige4.1-3B 15.32 Imported 2026-05-06
36 Phi-4 15.21 Phi 4
microsoft-phi-4
Imported 2026-05-06
37 Qwen3.5-Flash 5 Qwen3.5-Flash
qwen-qwen3.5-flash-02-23
Imported 2026-05-06
38 A.X K1 4.13 Imported 2026-05-06
39 Motif AI 3.67 Imported 2026-05-06
40 Gemini 2.5 FL-Lite 0 Imported 2026-05-06
41 GPT-5-Nano 0 GPT-5 Nano
openai-gpt-5-nano
Imported 2026-05-06
42 Mi:dm 2.0 Base 0 Imported 2026-05-06
1 InternVL3-78B 55.82 Imported 2026-05-06
2 Kimi-VL-A3B-Thinking 41.52 Imported 2026-05-06
3 Qwen2.5-VL-72B 21.58 Qwen2.5 VL 72B Instruct
qwen-qwen2.5-vl-72b-instruct
Imported 2026-05-06
4 Gemini 3 Flash 16.76 Gemini 3 Flash Preview
google-gemini-3-flash-preview
Imported 2026-05-06
5 Gemini 3 Pro 16.75 Gemini 3
google-gemini-3
Imported 2026-05-06
6 GPT-5.2 8.67 GPT-5.2
openai-gpt-5.2
Imported 2026-05-06
7 Claude Opus 4.6 8.51 Claude Opus 4.6
anthropic-claude-opus-4.6
Imported 2026-05-06
8 GPT-5 (original) 8.42 GPT-5
openai-gpt-5
Imported 2026-05-06
9 Gemini 3.1 Pro 8.20 Gemini 3.1 Pro Preview
google-gemini-3.1-pro-preview
Imported 2026-05-06
10 InternVL3.5-241B 7.77 Imported 2026-05-06
11 Grok 4 Heavy 7.65 Imported 2026-05-06
1 Qwen3.5-9B 69.30 Qwen3.5-9B
qwen-qwen3.5-9b
Imported 2026-05-06
2 Qwen3.5-4B 66.48 Imported 2026-05-06
3 Qwen3-VL-30B-A3B 62.70 Imported 2026-05-06
4 Gemini-2.5-FL-Lite 52.14 Imported 2026-05-06
5 GPT-5-Nano 50.88 GPT-5 Nano
openai-gpt-5-nano
Imported 2026-05-06
1 Claude Opus 4.6 52.61 Claude Opus 4.6
anthropic-claude-opus-4.6
Imported 2026-05-06
2 Gemini 3.1 Pro 37.66 Gemini 3.1 Pro Preview
google-gemini-3.1-pro-preview
Imported 2026-05-06
3 GPT-5.2 32.88 GPT-5.2
openai-gpt-5.2
Imported 2026-05-06
4 GPT-5.4 30.09 GPT-5.4
openai-gpt-5.4
Imported 2026-05-06
5 Qwen3.5-9B 25.48 Qwen3.5-9B
qwen-qwen3.5-9b
Imported 2026-05-06
6 Qwen3.5-4B 23.50 Imported 2026-05-06
7 Claude Sonnet 4.6 17.93 Claude Sonnet 4.6
anthropic-claude-sonnet-4.6
Imported 2026-05-06
8 GPT-5.3 Codex 16.79 GPT-5.3-Codex
openai-gpt-5.3-codex
Imported 2026-05-06
9 Gemini 3 Flash 8.04 Gemini 3 Flash Preview
google-gemini-3-flash-preview
Imported 2026-05-06
10 MiniMax-M2.5 5.28 MiniMax M2.5
minimax-minimax-m2.5
Imported 2026-05-06
1 Imagen 4 91 Imported 2026-05-06
2 Flux 2 Pro 89 Imported 2026-05-06
3 GPT Image 1.5 89 Imported 2026-05-06
4 DALL-E 3.5 85 Imported 2026-05-06
5 Flux 2 Dev 85 Imported 2026-05-06
6 Ideogram 3.0 85 Imported 2026-05-06
7 Nano Banana 2 85 Nano Banana 2 (Gemini 3.1 Flash Image Preview)
google-gemini-3.1-flash-image-preview
Imported 2026-05-06
8 Midjourney v7 83 Imported 2026-05-06
9 Seedream 4.5 83 Imported 2026-05-06
10 SD 3.5 81 Imported 2026-05-06
1 Sora 2 92.50 Imported 2026-05-06
2 Veo 3.1 92.50 Imported 2026-05-06
3 Runway Gen-4.5 90 Imported 2026-05-06
4 Seedance 2.0 90 Imported 2026-05-06
5 Kling 3.0 85 Imported 2026-05-06
6 Wan 2.6 82.50 Imported 2026-05-06
7 LTX-2 80 Imported 2026-05-06
8 Luma Ray3 80 Imported 2026-05-06
9 HaiLuo AI 77.50 Imported 2026-05-06
10 Pika 2.5 77.50 Imported 2026-05-06
1 Suno v4.5 95 Imported 2026-05-06
2 Udio v2 90 Imported 2026-05-06
3 Gemini Music 85 Imported 2026-05-06
4 Loudme 82.50 Imported 2026-05-06
5 MusicGen Large 80 Imported 2026-05-06
6 Riffusion v2 77.50 Imported 2026-05-06
7 Stable Audio 2.0 77.50 Imported 2026-05-06
8 JASCO 72.50 Imported 2026-05-06