ALL Bench Multimodal
ALL Bench Multimodal aggregates cross-verified AI model scores across LLM, VLM, agent, image generation, video generation, and music generation categories in one unified benchmark file.
96rows
category_scoreprimary metric
2026-05-06sampled
Metadata
Metrics
Category Score
| Rank | Subject | Category Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | Gemini 3.1 Pro | 63.96 | Gemini 3.1 Pro Preview google-gemini-3.1-pro-preview | Imported | 2026-05-06 |
| 2 | Claude Opus 4.6 | 63.16 | Claude Opus 4.6 anthropic-claude-opus-4.6 | Imported | 2026-05-06 |
| 3 | GPT-5.2 | 62.59 | GPT-5.2 openai-gpt-5.2 | Imported | 2026-05-06 |
| 4 | Kimi K2.5 | 57.79 | MoonshotAI: Kimi K2.5 moonshotai-kimi-k2.5 | Imported | 2026-05-06 |
| 5 | Gemini 3 Flash | 51.49 | Gemini 3 Flash Preview google-gemini-3-flash-preview | Imported | 2026-05-06 |
| 6 | GLM-5 | 44.06 | GLM 5 z-ai-glm-5 | Imported | 2026-05-06 |
| 7 | Qwen3.5-397B | 42.25 | — | Imported | 2026-05-06 |
| 8 | Grok 4 Heavy | 41.69 | — | Imported | 2026-05-06 |
| 9 | MiniMax-M2.5 | 41.68 | MiniMax M2.5 minimax-minimax-m2.5 | Imported | 2026-05-06 |
| 10 | Qwen3.5-122B | 38.89 | — | Imported | 2026-05-06 |
| 11 | DeepSeek V3.2 | 38.42 | DeepSeek V3.2 deepseek-deepseek-v3.2 | Imported | 2026-05-06 |
| 12 | GPT-5.3 Codex | 37.34 | GPT-5.3-Codex openai-gpt-5.3-codex | Imported | 2026-05-06 |
| 13 | DeepSeek R1 | 35.21 | R1 deepseek-r1 | Imported | 2026-05-06 |
| 14 | Llama 4 Maverick | 35.19 | Llama 4 Maverick meta-llama-4-maverick | Imported | 2026-05-06 |
| 15 | Qwen3.5-9B | 34.19 | Qwen3.5-9B qwen-qwen3.5-9b | Imported | 2026-05-06 |
| 16 | Claude Sonnet 4.6 | 32.53 | Claude Sonnet 4.6 anthropic-claude-sonnet-4.6 | Imported | 2026-05-06 |
| 17 | Claude Sonnet 4.5 | 30.89 | Claude Sonnet 4.5 anthropic-claude-sonnet-4.5 | Imported | 2026-05-06 |
| 18 | GPT-OSS-120B | 30.67 | gpt-oss-120b openai-gpt-oss-120b | Imported | 2026-05-06 |
| 19 | DeepSeek R2 | 30.63 | — | Imported | 2026-05-06 |
| 20 | Claude Haiku 4.5 | 30.14 | Claude Haiku 4.5 anthropic-claude-haiku-4.5 | Imported | 2026-05-06 |
| 21 | Qwen3-Next-80B | 29.90 | — | Imported | 2026-05-06 |
| 22 | Gemini 3 Pro | 29.76 | Gemini 3 google-gemini-3 | Imported | 2026-05-06 |
| 23 | Mistral Large 3 | 28.64 | — | Imported | 2026-05-06 |
| 24 | Qwen3.5-4B | 28.49 | — | Imported | 2026-05-06 |
| 25 | Llama 4 Scout | 27.51 | Llama 4 Scout meta-llama-llama-4-scout | Imported | 2026-05-06 |
| 26 | Qwen3.5-27B | 25.86 | Qwen3.5-27B qwen-qwen3.5-27b | Imported | 2026-05-06 |
| 27 | Solar Open 100B | 25 | — | Imported | 2026-05-06 |
| 28 | K-EXAONE | 24.26 | — | Imported | 2026-05-06 |
| 29 | GPT-OSS-20B | 23.61 | gpt-oss-20b openai-gpt-oss-20b | Imported | 2026-05-06 |
| 30 | GPT-5.1 | 21.43 | GPT-5.1 openai-gpt-5.1 | Imported | 2026-05-06 |
| 31 | Step-3.5-Flash | 20.61 | Step 3.5 Flash stepfun-step-3.5-flash | Imported | 2026-05-06 |
| 32 | Qwen3.5-35B | 18.71 | — | Imported | 2026-05-06 |
| 33 | GPT-5.4 | 18.39 | GPT-5.4 openai-gpt-5.4 | Imported | 2026-05-06 |
| 34 | Grok 4.1 Fast | 17.52 | Grok 4.1 Fast x-ai-grok-4.1-fast | Imported | 2026-05-06 |
| 35 | Nanbeige4.1-3B | 15.32 | — | Imported | 2026-05-06 |
| 36 | Phi-4 | 15.21 | Phi 4 microsoft-phi-4 | Imported | 2026-05-06 |
| 37 | Qwen3.5-Flash | 5 | Qwen3.5-Flash qwen-qwen3.5-flash-02-23 | Imported | 2026-05-06 |
| 38 | A.X K1 | 4.13 | — | Imported | 2026-05-06 |
| 39 | Motif AI | 3.67 | — | Imported | 2026-05-06 |
| 40 | Gemini 2.5 FL-Lite | 0 | — | Imported | 2026-05-06 |
| 41 | GPT-5-Nano | 0 | GPT-5 Nano openai-gpt-5-nano | Imported | 2026-05-06 |
| 42 | Mi:dm 2.0 Base | 0 | — | Imported | 2026-05-06 |
| 1 | InternVL3-78B | 55.82 | — | Imported | 2026-05-06 |
| 2 | Kimi-VL-A3B-Thinking | 41.52 | — | Imported | 2026-05-06 |
| 3 | Qwen2.5-VL-72B | 21.58 | Qwen2.5 VL 72B Instruct qwen-qwen2.5-vl-72b-instruct | Imported | 2026-05-06 |
| 4 | Gemini 3 Flash | 16.76 | Gemini 3 Flash Preview google-gemini-3-flash-preview | Imported | 2026-05-06 |
| 5 | Gemini 3 Pro | 16.75 | Gemini 3 google-gemini-3 | Imported | 2026-05-06 |
| 6 | GPT-5.2 | 8.67 | GPT-5.2 openai-gpt-5.2 | Imported | 2026-05-06 |
| 7 | Claude Opus 4.6 | 8.51 | Claude Opus 4.6 anthropic-claude-opus-4.6 | Imported | 2026-05-06 |
| 8 | GPT-5 (original) | 8.42 | GPT-5 openai-gpt-5 | Imported | 2026-05-06 |
| 9 | Gemini 3.1 Pro | 8.20 | Gemini 3.1 Pro Preview google-gemini-3.1-pro-preview | Imported | 2026-05-06 |
| 10 | InternVL3.5-241B | 7.77 | — | Imported | 2026-05-06 |
| 11 | Grok 4 Heavy | 7.65 | — | Imported | 2026-05-06 |
| 1 | Qwen3.5-9B | 69.30 | Qwen3.5-9B qwen-qwen3.5-9b | Imported | 2026-05-06 |
| 2 | Qwen3.5-4B | 66.48 | — | Imported | 2026-05-06 |
| 3 | Qwen3-VL-30B-A3B | 62.70 | — | Imported | 2026-05-06 |
| 4 | Gemini-2.5-FL-Lite | 52.14 | — | Imported | 2026-05-06 |
| 5 | GPT-5-Nano | 50.88 | GPT-5 Nano openai-gpt-5-nano | Imported | 2026-05-06 |
| 1 | Claude Opus 4.6 | 52.61 | Claude Opus 4.6 anthropic-claude-opus-4.6 | Imported | 2026-05-06 |
| 2 | Gemini 3.1 Pro | 37.66 | Gemini 3.1 Pro Preview google-gemini-3.1-pro-preview | Imported | 2026-05-06 |
| 3 | GPT-5.2 | 32.88 | GPT-5.2 openai-gpt-5.2 | Imported | 2026-05-06 |
| 4 | GPT-5.4 | 30.09 | GPT-5.4 openai-gpt-5.4 | Imported | 2026-05-06 |
| 5 | Qwen3.5-9B | 25.48 | Qwen3.5-9B qwen-qwen3.5-9b | Imported | 2026-05-06 |
| 6 | Qwen3.5-4B | 23.50 | — | Imported | 2026-05-06 |
| 7 | Claude Sonnet 4.6 | 17.93 | Claude Sonnet 4.6 anthropic-claude-sonnet-4.6 | Imported | 2026-05-06 |
| 8 | GPT-5.3 Codex | 16.79 | GPT-5.3-Codex openai-gpt-5.3-codex | Imported | 2026-05-06 |
| 9 | Gemini 3 Flash | 8.04 | Gemini 3 Flash Preview google-gemini-3-flash-preview | Imported | 2026-05-06 |
| 10 | MiniMax-M2.5 | 5.28 | MiniMax M2.5 minimax-minimax-m2.5 | Imported | 2026-05-06 |
| 1 | Imagen 4 | 91 | — | Imported | 2026-05-06 |
| 2 | Flux 2 Pro | 89 | — | Imported | 2026-05-06 |
| 3 | GPT Image 1.5 | 89 | — | Imported | 2026-05-06 |
| 4 | DALL-E 3.5 | 85 | — | Imported | 2026-05-06 |
| 5 | Flux 2 Dev | 85 | — | Imported | 2026-05-06 |
| 6 | Ideogram 3.0 | 85 | — | Imported | 2026-05-06 |
| 7 | Nano Banana 2 | 85 | Nano Banana 2 (Gemini 3.1 Flash Image Preview) google-gemini-3.1-flash-image-preview | Imported | 2026-05-06 |
| 8 | Midjourney v7 | 83 | — | Imported | 2026-05-06 |
| 9 | Seedream 4.5 | 83 | — | Imported | 2026-05-06 |
| 10 | SD 3.5 | 81 | — | Imported | 2026-05-06 |
| 1 | Sora 2 | 92.50 | — | Imported | 2026-05-06 |
| 2 | Veo 3.1 | 92.50 | — | Imported | 2026-05-06 |
| 3 | Runway Gen-4.5 | 90 | — | Imported | 2026-05-06 |
| 4 | Seedance 2.0 | 90 | — | Imported | 2026-05-06 |
| 5 | Kling 3.0 | 85 | — | Imported | 2026-05-06 |
| 6 | Wan 2.6 | 82.50 | — | Imported | 2026-05-06 |
| 7 | LTX-2 | 80 | — | Imported | 2026-05-06 |
| 8 | Luma Ray3 | 80 | — | Imported | 2026-05-06 |
| 9 | HaiLuo AI | 77.50 | — | Imported | 2026-05-06 |
| 10 | Pika 2.5 | 77.50 | — | Imported | 2026-05-06 |
| 1 | Suno v4.5 | 95 | — | Imported | 2026-05-06 |
| 2 | Udio v2 | 90 | — | Imported | 2026-05-06 |
| 3 | Gemini Music | 85 | — | Imported | 2026-05-06 |
| 4 | Loudme | 82.50 | — | Imported | 2026-05-06 |
| 5 | MusicGen Large | 80 | — | Imported | 2026-05-06 |
| 6 | Riffusion v2 | 77.50 | — | Imported | 2026-05-06 |
| 7 | Stable Audio 2.0 | 77.50 | — | Imported | 2026-05-06 |
| 8 | JASCO | 72.50 | — | Imported | 2026-05-06 |
No matching rows.