Qwen3 235B A22B Thinking 2507
Qwen / Qwen
27scores
24benchmarks
$0.1495 / $1.495 per 1M tokenscost in/out
Metadata
Qwen Open source
Aliases: qwen-qwen3-235b-a22b-thinking-2507, qwen/qwen3-235b-a22b-thinking-2507, qwen3-235b-a22b-thinking-2507
| Benchmark | Category | Rank | Score | Sampled |
|---|---|---|---|---|
| Galileo Agent Leaderboard | Agentic | 15 | 0.34 | 2026-05-06 |
| Tau2 Airline | Agentic | 10 | 0.58 | 2026-05-06 |
| Vending-Bench 2 | Agentic | 38 | -11.34 | 2026-05-28 |
| VitaBench | Agentic | 12 | 14.50 | 2026-05-06 |
| WildAgtEval | Agentic | 3 | 62.6% | 2026-05-28 |
| OpenUGI | Alignment | 871 | 28.75 | 2026-05-06 |
| ArtifactsBench | Coding | 6 | 55.01 | 2026-05-06 |
| CFEval | Coding | 1 | 2134 | 2026-05-06 |
| MMTU | Data | 11 | 0.53 | 2026-05-06 |
| Arena-Hard v2 | General Knowledge | 3 | 0.80 | 2026-05-06 |
| MMLU-ProX | General Knowledge | 5 | 0.81 | 2026-05-06 |
| MMLU-Redux | General Knowledge | 5 | 0.94 | 2026-05-06 |
| Multi-IF | Instruction Following | 1 | 0.81 | 2026-05-06 |
| ConStory-Bench | Long Context | 9 | CED 0.559 | 2026-05-28 |
| PolyMATH | Mathematics | 6 | 0.60 | 2026-05-06 |
| BRIDGE Medical Leaderboard | Medical | 31 | 48.11 | 2026-05-27 |
| BRIDGE Medical Leaderboard | Medical | 79 | 41.63 | 2026-05-27 |
| BRIDGE Medical Leaderboard | Medical | 99 | 40.14 | 2026-05-27 |
| Medmarks | Medical | 5 | 0.5160613633727644 | 2026-05-27 |
| Medmarks | Medical | 7 | 0.6031541433378664 | 2026-05-27 |
| Design Arena | Multimodal | 98 | 1091 | 2026-05-06 |
| Humanity's Last Exam (Text Only) | Reasoning | 18 | 15.43 | 2026-05-06 |
| MultiNRC | Reasoning | 21 | 27.11 | 2026-05-06 |
| OJBench | Reasoning | 6 | 0.33 | 2026-05-06 |
| BFCL-v3 | Tool Use | 5 | 0.72 | 2026-05-06 |
| Creative Writing v3 | Writing | 5 | 0.86 | 2026-05-06 |
| WritingBench | Writing | 1 | 0.88 | 2026-05-06 |
No matching rows.