Qwen3.6 Plus

Qwen / Qwen

119scores
100benchmarks
$0.325 / $1.95 per 1M tokenscost in/out

Metadata

Qwen Open source

Aliases: qwen-qwen3.6-plus, qwen-qwen3.6-plus-04-02, qwen/qwen3.6-plus, qwen/qwen3.6-plus-04-02, qwen3.6-plus, qwen3.6-plus-04-02, Qwen3.6-Plus, Qwen3.6 Plus, Qwen 3.6 Plus, alibaba/qwen3.6-plus

Benchmark Results

Benchmark Category Rank Score Sampled
AutoBench Agentic 8 3.07 2026-05-06
Claw-Eval-Live Agentic 10 50.5 2026-05-27
CoWorkBench Agentic 5 64.5% 2026-05-28
DeepPlanning Agentic 1 0.41 2026-05-06
Gert Labs Rankings Agentic 17 0.53 2026-05-11
MCP Atlas Agentic 3 74.1% 2026-05-28
MCPMark Agentic 6 48.2% 2026-05-28
OSWorld-Verified Agentic 8 0.63 2026-05-06
PinchBench Agentic 61 0.64 2026-05-06
QwenClawBench Agentic 5 57.2% 2026-05-28
QwenWorldBench Agentic 6 47.6% 2026-05-28
Tau2-Bench Telecom Agentic 7 97.7% 2026-05-11
TAU3-Bench Agentic 1 0.71 2026-05-06
Terminal-Bench Hard Agentic 24 43.9% 2026-05-11
TERMS-Bench Agentic 8 60.4% SE+ 2026-05-28
TIR-Bench Agentic 1 0.62 2026-05-06
Toolathlon Agentic 12 0.40 2026-05-06
Vending-Bench 2 Agentic 12 5114.87 2026-05-28
VitaBench Agentic 4 42.8% 2026-05-28
OpenUGI Alignment 268 43.51 2026-05-06
OpenUGI Alignment 1023 22.94 2026-05-06
ALE-Bench Coding 52 670.15 2026-05-06
Arena AI Code Coding 13 1465 2026-05-06
Claw-Eval Coding 6 57.1% 2026-05-28
Claw-Eval Coding 5 0.59 2026-05-06
DeepSWE Coding 14 2.65 2026-05-26
Kernel Bench L3 Coding 6 1.03/48% 2026-05-28
LiveCodeBench Coding 5 87.1% 2026-05-28
LiveCodeBench Coding 12 85.952% 2026-05-28
LMArena WebDev Arena Coding 13 1463.97 2026-05-06
NL2Repo Coding 6 34.4% 2026-05-28
NL2Repo Coding 3 0.38 2026-05-06
QwenSVG Coding 5 1432 2026-05-28
QwenWebDev Coding 5 1500 2026-05-28
SciCode Coding 5 41.4% 2026-05-28
SciCode Coding 85 40.7% 2026-05-11
SkillsBench Coding 5 45.7% 2026-05-28
SkillsBench Coding 2 0.46 2026-05-06
SWE-bench Verified Coding 21 73.4% 2026-05-28
Terminal-Bench 2.0 Coding 24 44.944% 2026-05-28
Terminal-Bench 2.0 Coding 6 61.6% 2026-05-28
Terminal-Bench 2.1 Coding 10 53.184% 2026-05-28
Vibe Code Bench v1.1 Coding 18 25.565% 2026-05-28
OmniDocBench 1.5 Document Understanding 1 0.91 2026-05-06
SAGE Education 19 44.86% 2026-05-28
CorpFin v2 Finance 32 61.927% 2026-05-28
Finance Agent v1.1 Finance 15 54.627% 2026-05-04
Finance Agent v2 Finance 13 40.846% 2026-05-28
MortgageTax Finance 15 67.965% 2026-05-28
TaxEval v2 Finance 16 74.734% 2026-05-28
InfiniteBM Heads-Up No-Limit Hold'em Game 7 1330.72 Elo / 14 games 2026-05-28
InfiniteBM Heads-Up No-Limit Hold'em Game 19 1143.23 Elo / 114 games 2026-05-28
InfiniteBM Liar's Dice Game 21 1185.82 Elo / 1714 games 2026-05-28
InfiniteBM Liar's Dice Game 34 877.72 Elo / 27 games 2026-05-28
BenchLM General Knowledge 29 73 2026-05-06
MAXIFE General Knowledge 3 88.2% 2026-05-28
MAXIFE General Knowledge 1 0.88 2026-05-06
MMLU-ProX General Knowledge 3 84.7% 2026-05-28
MMLU-ProX General Knowledge 1 0.85 2026-05-06
MMLU-Redux General Knowledge 5 94.5% 2026-05-28
MMLU-Redux General Knowledge 2 0.94 2026-05-06
NOVA-63 General Knowledge 3 57.9% 2026-05-28
NOVA-63 General Knowledge 4 0.58 2026-05-06
MedCode Healthcare 38 36.894% 2026-05-28
MedScribe Healthcare 32 76.963% 2026-05-28
PhysicianBench Healthcare 9 13.7 +/- 4.0 2026-05-27
IFBench Instruction Following 5 74.2% 2026-05-28
IFBench Instruction Following 5 0.74 2026-05-06
IFEval Instruction Following 3 94.3% 2026-05-28
Artificial Analysis Intelligence Index Intelligence 20 49.98 2026-05-11
GPQA Diamond Intelligence 19 87.374% 2026-05-28
HLE w/ tools Intelligence 5 50.2% 2026-05-28
Humanity's Last Exam Intelligence 6 28.8% 2026-05-28
Humanity's Last Exam Intelligence 43 25.7% 2026-05-11
LiveBench Intelligence 28 70.77 2026-05-05
MMLU Pro Intelligence 11 87.668% 2026-05-28
MMLU-Pro Intelligence 3 88.5% 2026-05-28
MMMU Pro Intelligence 13 84.162% 2026-05-28
SuperGPQA Intelligence 3 71.6% 2026-05-28
Vals Index Intelligence 13 48.039% 2026-05-28
Vals Multimodal Index Intelligence 10 50.737% 2026-05-28
CaseLaw v2 Legal 49 51.447% 2026-05-04
LegalBench Legal 18 84.233% 2026-05-28
AA-LCR Long Context 4 0.68 2026-05-06
MRCR-v2 128k Long Context 2 85.9% 2026-05-28
AIME Math 13 94.583% 2026-04-16
DynaMath Mathematics 1 0.88 2026-05-06
HMMT 2025 Mathematics 5 0.97 2026-05-06
HMMT February 2026 Mathematics 6 87.8% 2026-05-28
IMO-AnswerBench Mathematics 5 83.8% 2026-05-28
IMO-AnswerBench Mathematics 5 0.84 2026-05-06
MathArena Apex Mathematics 6 8.8% 2026-05-28
PolyMATH Mathematics 1 0.77 2026-05-06
INCLUDE Multilingual 4 85.1% 2026-05-28
MMMLU Multilingual 3 89.5% 2026-05-28
CC-OCR Multimodal 1 0.83 2026-05-06
CharXiv-R Multimodal 6 0.81 2026-05-06
Design Arena Multimodal 21 1285 2026-05-06
MLVU Multimodal 2 0.87 2026-05-06
SimpleVQA Multimodal 4 0.67 2026-05-06
VideoMMMU Multimodal 8 0.84 2026-05-06
ERQA Reasoning 1 0.66 2026-05-06
Global PIQA Reasoning 4 89.8% 2026-05-28
Global PIQA Reasoning 3 0.90 2026-05-06
GPQA Diamond Reasoning 4 90.4% 2026-05-28
GPQA Diamond Reasoning 26 88.2% 2026-05-11
CritPt Science 6 2.9% 2026-05-28
CritPt Science 52 2.9% 2026-05-11
WideSearch Search 3 0.74 2026-05-06
SWE-bench Multilingual Software Engineering 5 73.8% 2026-05-28
SWE-bench Pro Software Engineering 6 56.6% 2026-05-28
SWE-bench Verified Software Engineering 5 78.8% 2026-05-28
CountBench Spatial Reasoning 4 0.98 2026-05-06
RefCOCO-avg Spatial Reasoning 1 0.94 2026-05-06
SpreadsheetBench Spreadsheets 6 80.2% 2026-05-28
BFCL-V4 Tool Use 6 68.9% 2026-05-28
WMT24++ Translation 2 84.3% 2026-05-28
WMT24++ Translation 3 0.84 2026-05-06
ODinW Vision 1 0.52 2026-05-06