Qwen3.6 Plus
Qwen / Qwen
119scores
100benchmarks
$0.325 / $1.95 per 1M tokenscost in/out
Metadata
Qwen Open source
Aliases: qwen-qwen3.6-plus, qwen-qwen3.6-plus-04-02, qwen/qwen3.6-plus, qwen/qwen3.6-plus-04-02, qwen3.6-plus, qwen3.6-plus-04-02, Qwen3.6-Plus, Qwen3.6 Plus, Qwen 3.6 Plus, alibaba/qwen3.6-plus
| Benchmark | Category | Rank | Score | Sampled |
|---|---|---|---|---|
| AutoBench | Agentic | 8 | 3.07 | 2026-05-06 |
| Claw-Eval-Live | Agentic | 10 | 50.5 | 2026-05-27 |
| CoWorkBench | Agentic | 5 | 64.5% | 2026-05-28 |
| DeepPlanning | Agentic | 1 | 0.41 | 2026-05-06 |
| Gert Labs Rankings | Agentic | 17 | 0.53 | 2026-05-11 |
| MCP Atlas | Agentic | 3 | 74.1% | 2026-05-28 |
| MCPMark | Agentic | 6 | 48.2% | 2026-05-28 |
| OSWorld-Verified | Agentic | 8 | 0.63 | 2026-05-06 |
| PinchBench | Agentic | 61 | 0.64 | 2026-05-06 |
| QwenClawBench | Agentic | 5 | 57.2% | 2026-05-28 |
| QwenWorldBench | Agentic | 6 | 47.6% | 2026-05-28 |
| Tau2-Bench Telecom | Agentic | 7 | 97.7% | 2026-05-11 |
| TAU3-Bench | Agentic | 1 | 0.71 | 2026-05-06 |
| Terminal-Bench Hard | Agentic | 24 | 43.9% | 2026-05-11 |
| TERMS-Bench | Agentic | 8 | 60.4% SE+ | 2026-05-28 |
| TIR-Bench | Agentic | 1 | 0.62 | 2026-05-06 |
| Toolathlon | Agentic | 12 | 0.40 | 2026-05-06 |
| Vending-Bench 2 | Agentic | 12 | 5114.87 | 2026-05-28 |
| VitaBench | Agentic | 4 | 42.8% | 2026-05-28 |
| OpenUGI | Alignment | 268 | 43.51 | 2026-05-06 |
| OpenUGI | Alignment | 1023 | 22.94 | 2026-05-06 |
| ALE-Bench | Coding | 52 | 670.15 | 2026-05-06 |
| Arena AI Code | Coding | 13 | 1465 | 2026-05-06 |
| Claw-Eval | Coding | 6 | 57.1% | 2026-05-28 |
| Claw-Eval | Coding | 5 | 0.59 | 2026-05-06 |
| DeepSWE | Coding | 14 | 2.65 | 2026-05-26 |
| Kernel Bench L3 | Coding | 6 | 1.03/48% | 2026-05-28 |
| LiveCodeBench | Coding | 5 | 87.1% | 2026-05-28 |
| LiveCodeBench | Coding | 12 | 85.952% | 2026-05-28 |
| LMArena WebDev Arena | Coding | 13 | 1463.97 | 2026-05-06 |
| NL2Repo | Coding | 6 | 34.4% | 2026-05-28 |
| NL2Repo | Coding | 3 | 0.38 | 2026-05-06 |
| QwenSVG | Coding | 5 | 1432 | 2026-05-28 |
| QwenWebDev | Coding | 5 | 1500 | 2026-05-28 |
| SciCode | Coding | 5 | 41.4% | 2026-05-28 |
| SciCode | Coding | 85 | 40.7% | 2026-05-11 |
| SkillsBench | Coding | 5 | 45.7% | 2026-05-28 |
| SkillsBench | Coding | 2 | 0.46 | 2026-05-06 |
| SWE-bench Verified | Coding | 21 | 73.4% | 2026-05-28 |
| Terminal-Bench 2.0 | Coding | 24 | 44.944% | 2026-05-28 |
| Terminal-Bench 2.0 | Coding | 6 | 61.6% | 2026-05-28 |
| Terminal-Bench 2.1 | Coding | 10 | 53.184% | 2026-05-28 |
| Vibe Code Bench v1.1 | Coding | 18 | 25.565% | 2026-05-28 |
| OmniDocBench 1.5 | Document Understanding | 1 | 0.91 | 2026-05-06 |
| SAGE | Education | 19 | 44.86% | 2026-05-28 |
| CorpFin v2 | Finance | 32 | 61.927% | 2026-05-28 |
| Finance Agent v1.1 | Finance | 15 | 54.627% | 2026-05-04 |
| Finance Agent v2 | Finance | 13 | 40.846% | 2026-05-28 |
| MortgageTax | Finance | 15 | 67.965% | 2026-05-28 |
| TaxEval v2 | Finance | 16 | 74.734% | 2026-05-28 |
| InfiniteBM Heads-Up No-Limit Hold'em | Game | 7 | 1330.72 Elo / 14 games | 2026-05-28 |
| InfiniteBM Heads-Up No-Limit Hold'em | Game | 19 | 1143.23 Elo / 114 games | 2026-05-28 |
| InfiniteBM Liar's Dice | Game | 21 | 1185.82 Elo / 1714 games | 2026-05-28 |
| InfiniteBM Liar's Dice | Game | 34 | 877.72 Elo / 27 games | 2026-05-28 |
| BenchLM | General Knowledge | 29 | 73 | 2026-05-06 |
| MAXIFE | General Knowledge | 3 | 88.2% | 2026-05-28 |
| MAXIFE | General Knowledge | 1 | 0.88 | 2026-05-06 |
| MMLU-ProX | General Knowledge | 3 | 84.7% | 2026-05-28 |
| MMLU-ProX | General Knowledge | 1 | 0.85 | 2026-05-06 |
| MMLU-Redux | General Knowledge | 5 | 94.5% | 2026-05-28 |
| MMLU-Redux | General Knowledge | 2 | 0.94 | 2026-05-06 |
| NOVA-63 | General Knowledge | 3 | 57.9% | 2026-05-28 |
| NOVA-63 | General Knowledge | 4 | 0.58 | 2026-05-06 |
| MedCode | Healthcare | 38 | 36.894% | 2026-05-28 |
| MedScribe | Healthcare | 32 | 76.963% | 2026-05-28 |
| PhysicianBench | Healthcare | 9 | 13.7 +/- 4.0 | 2026-05-27 |
| IFBench | Instruction Following | 5 | 74.2% | 2026-05-28 |
| IFBench | Instruction Following | 5 | 0.74 | 2026-05-06 |
| IFEval | Instruction Following | 3 | 94.3% | 2026-05-28 |
| Artificial Analysis Intelligence Index | Intelligence | 20 | 49.98 | 2026-05-11 |
| GPQA Diamond | Intelligence | 19 | 87.374% | 2026-05-28 |
| HLE w/ tools | Intelligence | 5 | 50.2% | 2026-05-28 |
| Humanity's Last Exam | Intelligence | 6 | 28.8% | 2026-05-28 |
| Humanity's Last Exam | Intelligence | 43 | 25.7% | 2026-05-11 |
| LiveBench | Intelligence | 28 | 70.77 | 2026-05-05 |
| MMLU Pro | Intelligence | 11 | 87.668% | 2026-05-28 |
| MMLU-Pro | Intelligence | 3 | 88.5% | 2026-05-28 |
| MMMU Pro | Intelligence | 13 | 84.162% | 2026-05-28 |
| SuperGPQA | Intelligence | 3 | 71.6% | 2026-05-28 |
| Vals Index | Intelligence | 13 | 48.039% | 2026-05-28 |
| Vals Multimodal Index | Intelligence | 10 | 50.737% | 2026-05-28 |
| CaseLaw v2 | Legal | 49 | 51.447% | 2026-05-04 |
| LegalBench | Legal | 18 | 84.233% | 2026-05-28 |
| AA-LCR | Long Context | 4 | 0.68 | 2026-05-06 |
| MRCR-v2 128k | Long Context | 2 | 85.9% | 2026-05-28 |
| AIME | Math | 13 | 94.583% | 2026-04-16 |
| DynaMath | Mathematics | 1 | 0.88 | 2026-05-06 |
| HMMT 2025 | Mathematics | 5 | 0.97 | 2026-05-06 |
| HMMT February 2026 | Mathematics | 6 | 87.8% | 2026-05-28 |
| IMO-AnswerBench | Mathematics | 5 | 83.8% | 2026-05-28 |
| IMO-AnswerBench | Mathematics | 5 | 0.84 | 2026-05-06 |
| MathArena Apex | Mathematics | 6 | 8.8% | 2026-05-28 |
| PolyMATH | Mathematics | 1 | 0.77 | 2026-05-06 |
| INCLUDE | Multilingual | 4 | 85.1% | 2026-05-28 |
| MMMLU | Multilingual | 3 | 89.5% | 2026-05-28 |
| CC-OCR | Multimodal | 1 | 0.83 | 2026-05-06 |
| CharXiv-R | Multimodal | 6 | 0.81 | 2026-05-06 |
| Design Arena | Multimodal | 21 | 1285 | 2026-05-06 |
| MLVU | Multimodal | 2 | 0.87 | 2026-05-06 |
| SimpleVQA | Multimodal | 4 | 0.67 | 2026-05-06 |
| VideoMMMU | Multimodal | 8 | 0.84 | 2026-05-06 |
| ERQA | Reasoning | 1 | 0.66 | 2026-05-06 |
| Global PIQA | Reasoning | 4 | 89.8% | 2026-05-28 |
| Global PIQA | Reasoning | 3 | 0.90 | 2026-05-06 |
| GPQA Diamond | Reasoning | 4 | 90.4% | 2026-05-28 |
| GPQA Diamond | Reasoning | 26 | 88.2% | 2026-05-11 |
| CritPt | Science | 6 | 2.9% | 2026-05-28 |
| CritPt | Science | 52 | 2.9% | 2026-05-11 |
| WideSearch | Search | 3 | 0.74 | 2026-05-06 |
| SWE-bench Multilingual | Software Engineering | 5 | 73.8% | 2026-05-28 |
| SWE-bench Pro | Software Engineering | 6 | 56.6% | 2026-05-28 |
| SWE-bench Verified | Software Engineering | 5 | 78.8% | 2026-05-28 |
| CountBench | Spatial Reasoning | 4 | 0.98 | 2026-05-06 |
| RefCOCO-avg | Spatial Reasoning | 1 | 0.94 | 2026-05-06 |
| SpreadsheetBench | Spreadsheets | 6 | 80.2% | 2026-05-28 |
| BFCL-V4 | Tool Use | 6 | 68.9% | 2026-05-28 |
| WMT24++ | Translation | 2 | 84.3% | 2026-05-28 |
| WMT24++ | Translation | 3 | 0.84 | 2026-05-06 |
| ODinW | Vision | 1 | 0.52 | 2026-05-06 |
No matching rows.