Qwen3 VL 8B Thinking
Qwen / Qwen
27scores
27benchmarks
$0.117 / $1.365 per 1M tokenscost in/out
Metadata
Qwen Open source
Aliases: qwen-qwen3-vl-8b-thinking, qwen/qwen3-vl-8b-thinking, qwen3-vl-8b-thinking
| Benchmark | Category | Rank | Score | Sampled |
|---|---|---|---|---|
| OpenUGI | Alignment | 1126 | 16.80 | 2026-05-06 |
| From Perception to Action | Embodied AI | 13 | 8.3% | 2026-05-28 |
| Arena-Hard v2 | General Knowledge | 14 | 0.51 | 2026-05-06 |
| MMLU-ProX | General Knowledge | 19 | 0.71 | 2026-05-06 |
| MMLU-Redux | General Knowledge | 25 | 0.89 | 2026-05-06 |
| Multi-IF | Instruction Following | 9 | 0.75 | 2026-05-06 |
| MathVision | Intelligence | 41 | 59.60 | 2026-05-06 |
| PolyMATH | Mathematics | 13 | 0.47 | 2026-05-06 |
| BLINK | Multimodal | 3 | 0.69 | 2026-05-06 |
| CC-OCR | Multimodal | 16 | 0.76 | 2026-05-06 |
| CharadesSTA | Multimodal | 7 | 0.60 | 2026-05-06 |
| CharXiv-D | Multimodal | 7 | 0.86 | 2026-05-06 |
| CharXiv-R | Multimodal | 29 | 0.53 | 2026-05-06 |
| InfoVQAtest | Multimodal | 6 | 0.86 | 2026-05-06 |
| LVBench | Multimodal | 14 | 0.56 | 2026-05-06 |
| MM-MT-Bench | Multimodal | 7 | 8 | 2026-05-06 |
| MuirBench | Multimodal | 4 | 0.77 | 2026-05-06 |
| Physical AI Bench Understanding | Multimodal | 12 | 57.30 | 2026-05-06 |
| VideoMMMU | Multimodal | 19 | 0.73 | 2026-05-06 |
| OCRBench-V2 (en) | OCR | 6 | 0.64 | 2026-05-06 |
| OCRBench-V2 (zh) | OCR | 6 | 0.59 | 2026-05-06 |
| ERQA | Reasoning | 14 | 0.47 | 2026-05-06 |
| BFCL-v3 | Tool Use | 18 | 0.63 | 2026-05-06 |
| ODinW | Vision | 14 | 0.40 | 2026-05-06 |
| K-MetBench | Weather | 19 | 71.7% accuracy | 2026-05-28 |
| Creative Writing v3 | Writing | 12 | 0.82 | 2026-05-06 |
| WritingBench | Writing | 5 | 0.85 | 2026-05-06 |
No matching rows.