LongVideoBench

LongVideoBench is a question-answering benchmark featuring video-language interleaved inputs up to an hour long. It includes 3,763 varying-length web-collected videos with subtitles across diverse themes and 6,678 human-annotated multiple-choice questions in 17 fine-grained categories for comprehensive evaluation of long-term multimodal understanding.

2rows
scoreprimary metric
2026-05-06sampled

Metadata

Metrics

Score, Normalized Score

Latest Results

Rank Subject Score Model Match Provenance Sampled
1 Kimi K2.5 0.80 KIMI MoonshotAI: Kimi K2.5
moonshotai-kimi-k2.5
Self-reported 2026-05-06
2 Qwen2.5 VL 7B Instruct 0.55 Self-reported 2026-05-06