Qwen3 235B A22B Thinking 2507

Qwen / Qwen

27scores
24benchmarks
$0.1495 / $1.495 per 1M tokenscost in/out

Metadata

Qwen Open source

Aliases: qwen-qwen3-235b-a22b-thinking-2507, qwen/qwen3-235b-a22b-thinking-2507, qwen3-235b-a22b-thinking-2507

Benchmark Results

Benchmark Category Rank Score Sampled
Galileo Agent Leaderboard Agentic 15 0.34 2026-05-06
Tau2 Airline Agentic 10 0.58 2026-05-06
Vending-Bench 2 Agentic 38 -11.34 2026-05-28
VitaBench Agentic 12 14.50 2026-05-06
WildAgtEval Agentic 3 62.6% 2026-05-28
OpenUGI Alignment 871 28.75 2026-05-06
ArtifactsBench Coding 6 55.01 2026-05-06
CFEval Coding 1 2134 2026-05-06
MMTU Data 11 0.53 2026-05-06
Arena-Hard v2 General Knowledge 3 0.80 2026-05-06
MMLU-ProX General Knowledge 5 0.81 2026-05-06
MMLU-Redux General Knowledge 5 0.94 2026-05-06
Multi-IF Instruction Following 1 0.81 2026-05-06
ConStory-Bench Long Context 9 CED 0.559 2026-05-28
PolyMATH Mathematics 6 0.60 2026-05-06
BRIDGE Medical Leaderboard Medical 31 48.11 2026-05-27
BRIDGE Medical Leaderboard Medical 79 41.63 2026-05-27
BRIDGE Medical Leaderboard Medical 99 40.14 2026-05-27
Medmarks Medical 5 0.5160613633727644 2026-05-27
Medmarks Medical 7 0.6031541433378664 2026-05-27
Design Arena Multimodal 98 1091 2026-05-06
Humanity's Last Exam (Text Only) Reasoning 18 15.43 2026-05-06
MultiNRC Reasoning 21 27.11 2026-05-06
OJBench Reasoning 6 0.33 2026-05-06
BFCL-v3 Tool Use 5 0.72 2026-05-06
Creative Writing v3 Writing 5 0.86 2026-05-06
WritingBench Writing 1 0.88 2026-05-06