Qwen3.5-9B

Qwen / Qwen

50scores
37benchmarks
$0.1 / $0.15 per 1M tokenscost in/out

Metadata

Qwen Open source

Aliases: qwen-qwen3.5-9b, qwen-qwen3.5-9b-20260310, qwen/qwen3.5-9b, qwen/qwen3.5-9b-20260310, qwen3.5-9b, qwen3.5-9b-20260310

Benchmark Results

Benchmark Category Rank Score Sampled
DeepPlanning Agentic 7 0.18 2026-05-06
PinchBench Agentic 63 0.45 2026-05-06
ScreenSpot-Pro Agentic 11 65.20 2026-05-06
t2-bench Agentic 14 0.79 2026-05-06
Tau2-Bench Telecom Agentic 75 86.8% 2026-05-11
Tau2-Bench Telecom Agentic 84 85.1% 2026-05-11
Terminal-Bench Hard Agentic 129 24.2% 2026-05-11
Terminal-Bench Hard Agentic 158 18.2% 2026-05-11
OpenUGI Alignment 1117 17.27 2026-05-06
OpenUGI Alignment 1150 15.08 2026-05-06
SciCode Coding 288 27.7% 2026-05-11
SciCode Coding 289 27.5% 2026-05-11
ALL Bench LLM General Knowledge 10 39.74 2026-05-06
MAXIFE General Knowledge 6 0.83 2026-05-06
MMLU-ProX General Knowledge 14 0.76 2026-05-06
MMLU-Redux General Knowledge 19 0.91 2026-05-06
NOVA-63 General Knowledge 6 0.56 2026-05-06
IFBench Instruction Following 12 0.65 2026-05-06
AI2D Intelligence 1 90.2 2026-05-27
AIIQ Composite IQ Intelligence 38 87 2026-05-12
Artificial Analysis Intelligence Index Intelligence 128 32.43 2026-05-11
Artificial Analysis Intelligence Index Intelligence 170 27.33 2026-05-11
Humanity's Last Exam Intelligence 108 13.3% 2026-05-11
Humanity's Last Exam Intelligence 180 8.6% 2026-05-11
MMBench Intelligence 1 90.1 2026-05-27
OCRBench v2 Intelligence 10 58.70 2026-05-06
OCRBench v2 Intelligence 2 64.10 2026-05-06
RealWorldQA Intelligence 1 80.3 2026-05-27
AA-LCR Long Context 7 0.63 2026-05-06
AIME 2026 Mathematics 7 92.50 2026-05-06
HMMT 2025 Mathematics 23 0.83 2026-05-06
HMMT February 2026 Mathematics 10 71.21 2026-05-06
PolyMATH Mathematics 7 0.57 2026-05-06
ALL Bench Multimodal Multimodal 15 34.19 2026-05-06
ALL Bench Multimodal Multimodal 1 69.30 2026-05-06
ALL Bench Multimodal Multimodal 5 25.48 2026-05-06
IDP Leaderboard Multimodal 12 76.69 2026-05-06
MDPBench Multimodal 8 65.70 2026-05-06
ParseBench Multimodal 16 31.90 2026-05-06
Artificial Analysis Openness Index Openness 147 38.89 2026-05-11
Artificial Analysis Openness Index Openness 148 38.89 2026-05-11
Global PIQA Reasoning 8 0.83 2026-05-06
GPQA Diamond Reasoning 99 80.6% 2026-05-11
GPQA Diamond Reasoning 115 78.6% 2026-05-11
CritPt Science 119 0.6% 2026-05-11
CritPt Science 151 0.3% 2026-05-11
BFCL-V4 Tool Use 5 0.66 2026-05-06
WMT24++ Translation 8 0.73 2026-05-06
K-MetBench Weather 15 74.9% accuracy 2026-05-28
K-MetBench Weather 31 60.4% accuracy 2026-05-28