Video-MME v2
Video understanding benchmark with 800 videos and 3,200 multiple-choice QA items spanning retrieval, temporal understanding, and complex reasoning.
4rows
scoreprimary metric
2026-05-06sampled
Metadata
Metrics
Video-MME v2 score
| Rank | Subject | Video-MME v2 score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | moonshotai/Kimi-K2.5 | 61.10 | MoonshotAI: Kimi K2.5 moonshotai-kimi-k2.5 | Imported | 2026-05-06 |
| 2 | Qwen/Qwen3.5-397B-A17B | 55.90 | Qwen3.5 397B A17B qwen-qwen3.5-397b-a17b | Imported | 2026-05-06 |
| 3 | moonshotai/Kimi-K2.5 | 54.40 | MoonshotAI: Kimi K2.5 moonshotai-kimi-k2.5 | Imported | 2026-05-06 |
| 4 | Qwen/Qwen3.5-397B-A17B | 48.90 | Qwen3.5 397B A17B qwen-qwen3.5-397b-a17b | Imported | 2026-05-06 |
No matching rows.