Qwen3.7 Max
Qwen / Qwen
57scores
56benchmarks
—cost in/out
Metadata
Qwen Closed/API
Aliases: qwen-qwen3.7-max, qwen/qwen3.7-max, qwen3.7-max, qwen3.7 max, Qwen3.7-Max, Qwen3.7 Max, Qwen 3.7 Max, alibaba-qwen3.7-max, alibaba/qwen3.7-max
| Benchmark | Category | Rank | Score | Sampled |
|---|---|---|---|---|
| CoWorkBench | Agentic | 2 | 67.2% | 2026-05-28 |
| ITBench-AA | Agentic | 3 | 42.5% | 2026-05-28 |
| MCP Atlas | Agentic | 1 | 76.4% | 2026-05-28 |
| MCPMark | Agentic | 1 | 60.8% | 2026-05-28 |
| QwenClawBench | Agentic | 2 | 64.3% | 2026-05-28 |
| QwenWorldBench | Agentic | 1 | 57.3% | 2026-05-28 |
| VitaBench | Agentic | 2 | 47.9% | 2026-05-28 |
| Claw-Eval | Coding | 2 | 65.2% | 2026-05-28 |
| IOI | Coding | 4 | 46.75% | 2026-05-26 |
| Kernel Bench L3 | Coding | 3 | 1.98/96% | 2026-05-28 |
| LiveCodeBench | Coding | 2 | 91.6% | 2026-05-28 |
| LiveCodeBench | Coding | 7 | 87.057% | 2026-05-28 |
| NL2Repo | Coding | 2 | 47.2% | 2026-05-28 |
| QwenSVG | Coding | 1 | 1608 | 2026-05-28 |
| QwenWebDev | Coding | 3 | 1568 | 2026-05-28 |
| SciCode | Coding | 1 | 53.5% | 2026-05-28 |
| SkillsBench | Coding | 1 | 59.2% | 2026-05-28 |
| SWE-bench Verified | Coding | 36 | 68.8% | 2026-05-28 |
| Terminal-Bench 2.0 | Coding | 9 | 59.176% | 2026-05-28 |
| Terminal-Bench 2.0 | Coding | 1 | 69.7% | 2026-05-28 |
| Terminal-Bench 2.1 | Coding | 6 | 61.049% | 2026-05-28 |
| Vibe Code Bench v1.1 | Coding | 36 | 11.418% | 2026-05-28 |
| CorpFin v2 | Finance | 24 | 63.714% | 2026-05-28 |
| Finance Agent v2 | Finance | 6 | 48.353% | 2026-05-28 |
| TaxEval v2 | Finance | 8 | 75.306% | 2026-05-28 |
| MAXIFE | General Knowledge | 1 | 89.2% | 2026-05-28 |
| MMLU-ProX | General Knowledge | 1 | 87% | 2026-05-28 |
| MMLU-Redux | General Knowledge | 3 | 95% | 2026-05-28 |
| NOVA-63 | General Knowledge | 2 | 59% | 2026-05-28 |
| MedCode | Healthcare | 33 | 38.751% | 2026-05-28 |
| MedScribe | Healthcare | 22 | 79.396% | 2026-05-28 |
| IFBench | Instruction Following | 1 | 79.1% | 2026-05-28 |
| IFEval | Instruction Following | 4 | 94.3% | 2026-05-28 |
| GPQA Diamond | Intelligence | 9 | 90.152% | 2026-05-28 |
| HLE w/ tools | Intelligence | 2 | 53.5% | 2026-05-28 |
| Humanity's Last Exam | Intelligence | 1 | 41.4% | 2026-05-28 |
| MMLU Pro | Intelligence | 6 | 89.311% | 2026-05-28 |
| MMLU-Pro | Intelligence | 2 | 89.6% | 2026-05-28 |
| SuperGPQA | Intelligence | 1 | 73.6% | 2026-05-28 |
| Vals Index | Intelligence | 6 | 57.294% | 2026-05-28 |
| LegalBench | Legal | 11 | 84.913% | 2026-05-28 |
| MRCR-v2 128k | Long Context | 1 | 90.4% | 2026-05-28 |
| ProofBench | Math | 10 | 26% | 2026-05-28 |
| HMMT February 2026 | Mathematics | 1 | 97.1% | 2026-05-28 |
| IMO-AnswerBench | Mathematics | 1 | 90% | 2026-05-28 |
| MathArena Apex | Mathematics | 1 | 44.5% | 2026-05-28 |
| INCLUDE | Multilingual | 2 | 86.2% | 2026-05-28 |
| MMMLU | Multilingual | 2 | 90.3% | 2026-05-28 |
| Global PIQA | Reasoning | 1 | 91.4% | 2026-05-28 |
| GPQA Diamond | Reasoning | 1 | 92.4% | 2026-05-28 |
| CritPt | Science | 3 | 11.4% | 2026-05-28 |
| SWE-bench Multilingual | Software Engineering | 1 | 78.3% | 2026-05-28 |
| SWE-bench Pro | Software Engineering | 1 | 60.6% | 2026-05-28 |
| SWE-bench Verified | Software Engineering | 3 | 80.4% | 2026-05-28 |
| SpreadsheetBench | Spreadsheets | 2 | 87% | 2026-05-28 |
| BFCL-V4 | Tool Use | 2 | 75% | 2026-05-28 |
| WMT24++ | Translation | 1 | 85.8% | 2026-05-28 |
No matching rows.