CritPt

Research-level physics reasoning benchmark with composite challenges designed by active physics researchers.

398rows
scoreprimary metric
2026-05-28sampled

Metadata

Metrics

Accuracy

Showing 2 latest source slices.

Latest Results

Provider-published Qwen3.7-Max comparison scores. Rows are marked self-reported and should be interpreted as source claims unless independently reproduced.

Rank Subject Accuracy Model Match Provenance Sampled
1 DeepSeek V4 Pro Max 12.9% DeepSeek V4 Pro
deepseek-deepseek-v4-pro
Self-reported 2026-05-28
2 Claude Opus 4.6 Max 12.6% Claude Opus 4.6
anthropic-claude-opus-4.6
Self-reported 2026-05-28
3 Qwen3.7 Max 11.4% Qwen3.7 Max
qwen-qwen3.7-max
Self-reported 2026-05-28
4 Kimi K2.6 Thinking 8% KIMI MoonshotAI: Kimi K2.6
moonshotai-kimi-k2.6
Self-reported 2026-05-28
5 GLM-5.1 Thinking 4.6% GLM GLM 5.1
z-ai-glm-5.1
Self-reported 2026-05-28
6 Qwen3.6 Plus 2.9% Qwen3.6 Plus
qwen-qwen3.6-plus
Self-reported 2026-05-28
1 GPT-5.5 Pro (xhigh) 30.6% GPT-5.5 Pro
openai-gpt-5.5-pro
Imported 2026-05-11
2 GPT-5.4 Pro (xhigh) 30% GPT-5.4 Pro
openai-gpt-5.4-pro
Imported 2026-05-11
3 GPT-5.5 (xhigh) 27.1% GPT-5.5
openai-gpt-5.5
Imported 2026-05-11
4 Gemini 3 Deep Think 25.7% Imported 2026-05-11
5 GPT-5.5 (high) 25.4% GPT-5.5
openai-gpt-5.5
Imported 2026-05-11
6 GPT-5.4 (xhigh) 23.4% GPT-5.4
openai-gpt-5.4
Imported 2026-05-11
7 GPT-5.5 (medium) 18.6% GPT-5.5
openai-gpt-5.5
Imported 2026-05-11
8 Gemini 3.1 Pro Preview 17.7% Gemini 3.1 Pro Preview
google-gemini-3.1-pro-preview
Imported 2026-05-11
9 GPT-5.3 Codex (xhigh) 16.9% GPT-5.3-Codex
openai-gpt-5.3-codex
Imported 2026-05-11
10 DeepSeek V4 Pro (Reasoning, Max Effort) 12.9% DeepSeek V4 Pro
deepseek-deepseek-v4-pro
Imported 2026-05-11
11 Claude Opus 4.6 (Adaptive Reasoning, Max Effort) 12.6% Claude Opus 4.6
anthropic-claude-opus-4.6
Imported 2026-05-11
12 Claude Opus 4.7 (Adaptive Reasoning, Max Effort) 12% Claude Opus 4.7
anthropic-claude-opus-4.7
Imported 2026-05-11
13 GPT-5.2 (xhigh) 11.6% GPT-5.2
openai-gpt-5.2
Imported 2026-05-11
14 Muse Spark 11.3% Imported 2026-05-11
15 DeepSeek V4 Pro (Reasoning, High Effort) 10% DeepSeek V4 Pro
deepseek-deepseek-v4-pro
Imported 2026-05-11
16 GPT-5.4 mini (xhigh) 10% GPT-5.4 Mini
openai-gpt-5.4-mini
Imported 2026-05-11
17 GPT-5.4 nano (xhigh) 9.3% GPT-5.4 Nano
openai-gpt-5.4-nano
Imported 2026-05-11
18 Gemini 3 Pro Preview (high) 9.1% Gemini 3
google-gemini-3
Imported 2026-05-11
19 GPT-5.2 Codex (xhigh) 8.7% GPT-5.2-Codex
openai-gpt-5.2-codex
Imported 2026-05-11
20 Gemini 3 Flash Preview (Reasoning) 8.6% Gemini 3 Flash Preview
google-gemini-3-flash-preview
Imported 2026-05-11
21 GPT-5.5 (low) 8% GPT-5.5
openai-gpt-5.5
Imported 2026-05-11
22 Grok 4.3 8% GROK Grok 4.3
x-ai-grok-4.3
Imported 2026-05-11
23 Kimi K2.6 8% KIMI MoonshotAI: Kimi K2.6
moonshotai-kimi-k2.6
Imported 2026-05-11
24 GPT-5.2 (medium) 7.9% GPT-5.2
openai-gpt-5.2
Imported 2026-05-11
25 DeepSeek V3.2 Speciale 7.4% DeepSeek V3.2 Speciale
deepseek-deepseek-v3.2-speciale
Imported 2026-05-11
26 GPT-5.4 (low) 7.4% GPT-5.4
openai-gpt-5.4
Imported 2026-05-11
27 DeepSeek V4 Flash (Reasoning, Max Effort) 7.1% DeepSeek V4 Flash
deepseek-deepseek-v4-flash
Imported 2026-05-11
28 Grok 4.20 0309 v2 (Reasoning) 6.6% GROK Grok 4.20
x-ai-grok-4.20
Imported 2026-05-11
29 Grok 4.20 0309 (Reasoning) 6% GROK Grok 4.20
x-ai-grok-4.20
Imported 2026-05-11
30 GPT-5 (high) 5.7% GPT-5
openai-gpt-5
Imported 2026-05-11
31 GPT-5.1 Codex (high) 5.7% GPT-5.1-Codex
openai-gpt-5.1-codex
Imported 2026-05-11
32 Claude Opus 4.7 (Non-reasoning, High Effort) 5.1% Claude Opus 4.7
anthropic-claude-opus-4.7
Imported 2026-05-11
33 GPT-5 Codex (high) 5.1% GPT-5 Codex
openai-gpt-5-codex
Imported 2026-05-11
34 GPT-5.4 nano (medium) 5.1% GPT-5.4 Nano
openai-gpt-5.4-nano
Imported 2026-05-11
35 GPT-5.1 (high) 4.9% GPT-5.1
openai-gpt-5.1
Imported 2026-05-11
36 Claude Opus 4.5 (Reasoning) 4.6% Claude Opus 4.5
anthropic-claude-opus-4.5
Imported 2026-05-11
37 GLM-5.1 (Reasoning) 4.6% GLM GLM 5.1
z-ai-glm-5.1
Imported 2026-05-11
38 Hy3-preview (Reasoning) 4.6% T Hy3 preview
tencent-hy3-preview
Imported 2026-05-11
39 MiMo-V2-Flash (Reasoning) 4.3% MiMo-V2-Flash
xiaomi-mimo-v2-flash
Imported 2026-05-11
40 MiMo-V2.5-Pro 4% MiMo-V2.5-Pro
xiaomi-mimo-v2.5-pro
Imported 2026-05-11
41 MiMo-V2.5 3.7% MiMo-V2.5
xiaomi-mimo-v2.5
Imported 2026-05-11
42 Qwen3.6 Max Preview 3.7% Qwen3.6 Max Preview
qwen-qwen3.6-max-preview
Imported 2026-05-11
43 DeepSeek V4 Flash (Reasoning, High Effort) 3.4% DeepSeek V4 Flash
deepseek-deepseek-v4-flash
Imported 2026-05-11
44 Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) 3.1% Claude Sonnet 4.6
anthropic-claude-sonnet-4.6
Imported 2026-05-11
45 Kimi K2.5 (Reasoning) 3.1% KIMI MoonshotAI: Kimi K2.5
moonshotai-kimi-k2.5
Imported 2026-05-11
46 NVIDIA Nemotron 3 Super 120B A12B (Reasoning) 3.1% Nemotron 3 Super
nvidia-nemotron-3-super-120b-a12b
Imported 2026-05-11
47 DeepSeek V3.2 (Reasoning) 2.9% DeepSeek V3.2
deepseek-deepseek-v3.2
Imported 2026-05-11
48 GPT-5.4 mini (medium) 2.9% GPT-5.4 Mini
openai-gpt-5.4-mini
Imported 2026-05-11
49 Grok 4 Fast (Reasoning) 2.9% GROK Grok 4 Fast
x-ai-grok-4-fast
Imported 2026-05-11
50 Grok 4.1 Fast (Reasoning) 2.9% GROK Grok 4.1 Fast
x-ai-grok-4.1-fast
Imported 2026-05-11
51 MiMo-V2-Flash (Feb 2026) 2.9% MiMo-V2-Flash
xiaomi-mimo-v2-flash
Imported 2026-05-11
52 Qwen3.6 Plus 2.9% Qwen3.6 Plus
qwen-qwen3.6-plus
Imported 2026-05-11
53 Claude Opus 4.6 (Non-reasoning, High Effort) 2.8% Claude Opus 4.6
anthropic-claude-opus-4.6
Imported 2026-05-11
54 Gemini 2.5 Pro 2.6% Gemini 2.5 Pro
google-gemini-2.5-pro
Imported 2026-05-11
55 Kimi K2 Thinking 2.6% KIMI MoonshotAI: Kimi K2 Thinking
moonshotai-kimi-k2-thinking
Imported 2026-05-11
56 Step 3.5 Flash 2.5% S Step 3.5 Flash
stepfun-step-3.5-flash
Imported 2026-05-11
57 Step 3.5 Flash 2603 2.3% S Step 3.5 Flash
stepfun-step-3.5-flash
Imported 2026-05-11
58 DeepSeek V3.1 (Reasoning) 2% DeepSeek V3.1
deepseek-deepseek-chat-v3.1
Imported 2026-05-11
59 GLM-5 (Reasoning) 2% GLM GLM 5
z-ai-glm-5
Imported 2026-05-11
60 Grok 4 2% GROK Grok 4
x-ai-grok-4
Imported 2026-05-11
61 DeepSeek V3.1 Terminus (Reasoning) 1.7% DeepSeek V3.1 Terminus
deepseek-deepseek-v3.1-terminus
Imported 2026-05-11
62 GLM-4.7 (Reasoning) 1.7% GLM GLM 4.7
z-ai-glm-4.7
Imported 2026-05-11
63 Qwen3 Max Thinking 1.7% Qwen3 Max Thinking
qwen-qwen3-max-thinking
Imported 2026-05-11
64 Qwen3.5 397B A17B (Reasoning) 1.7% Qwen3.5 397B A17B
qwen-qwen3.5-397b-a17b
Imported 2026-05-11
65 DeepSeek R1 0528 (May '25) 1.4% R1
deepseek-r1
Imported 2026-05-11
66 DeepSeek V3.2 Exp (Non-reasoning) 1.4% DeepSeek V3.2
deepseek-deepseek-v3.2
Imported 2026-05-11
67 DeepSeek V3.2 Exp (Reasoning) 1.4% DeepSeek V3.2 Exp
deepseek-deepseek-v3.2-exp
Imported 2026-05-11
68 ERNIE 5.0 Thinking Preview 1.4% Imported 2026-05-11
69 Gemini 2.5 Flash (Non-reasoning) 1.4% Gemini 2.5 Flash
google-gemini-2.5-flash
Imported 2026-05-11
70 Gemini 3 Flash Preview (Non-reasoning) 1.4% Gemini 3 Flash Preview
google-gemini-3-flash-preview
Imported 2026-05-11
71 Gemma 4 31B (Reasoning) 1.4% Gemma 4 31B
google-gemma-4-31b-it
Imported 2026-05-11
72 GPT-5 mini (medium) 1.4% GPT-5 Mini
openai-gpt-5-mini
Imported 2026-05-11
73 GPT-5.5 (Non-reasoning) 1.4% GPT-5.5
openai-gpt-5.5
Imported 2026-05-11
74 gpt-oss-20B (high) 1.4% gpt-oss-20b
openai-gpt-oss-20b
Imported 2026-05-11
75 Kimi K2.6 (Non-reasoning) 1.4% KIMI MoonshotAI: Kimi K2.6
moonshotai-kimi-k2.6
Imported 2026-05-11
76 Apriel-v1.5-15B-Thinker 1.1% Imported 2026-05-11
77 Claude 4 Sonnet (Non-reasoning) 1.1% Imported 2026-05-11
78 Claude 4.5 Sonnet (Reasoning) 1.1% Imported 2026-05-11
79 Gemini 2.5 Flash (Reasoning) 1.1% Gemini 2.5 Flash
google-gemini-2.5-flash
Imported 2026-05-11
80 Gemini 3.1 Flash-Lite Preview 1.1% Gemini 3.1 Flash Lite Preview
google-gemini-3.1-flash-lite-preview
Imported 2026-05-11
81 GLM-4.6 (Reasoning) 1.1% GLM GLM 4.6
z-ai-glm-4.6
Imported 2026-05-11
82 GPT-5 (low) 1.1% GPT-5
openai-gpt-5
Imported 2026-05-11
83 gpt-oss-120B (high) 1.1% gpt-oss-120b
openai-gpt-oss-120b
Imported 2026-05-11
84 K-EXAONE (Reasoning) 1.1% Imported 2026-05-11
85 MiMo-V2-Omni 1.1% MiMo-V2-Omni
xiaomi-mimo-v2-omni
Imported 2026-05-11
86 MiMo-V2.5-Pro (Non-reasoning) 1.1% MiMo-V2.5-Pro
xiaomi-mimo-v2.5-pro
Imported 2026-05-11
87 MiniMax-M2.5 1.1% MiniMax M2.5
minimax-minimax-m2.5
Imported 2026-05-11
88 o3 1.1% o3
openai-o3
Imported 2026-05-11
89 Qwen3.6 27B (Reasoning) 1.1% Qwen3.6 27B
qwen-qwen3.6-27b
Imported 2026-05-11
90 Claude 3.7 Sonnet (Reasoning) 0.9% Claude 3.7 Sonnet (thinking)
anthropic-claude-3.7-sonnet-thinking
Imported 2026-05-11
91 Claude Sonnet 4.6 (Non-reasoning, High Effort) 0.9% Claude Sonnet 4.6
anthropic-claude-sonnet-4.6
Imported 2026-05-11
92 Claude Sonnet 4.6 (Non-reasoning, Low Effort) 0.9% Claude Sonnet 4.6
anthropic-claude-sonnet-4.6
Imported 2026-05-11
93 DeepSeek V3.2 (Non-reasoning) 0.9% DeepSeek V3.2
deepseek-deepseek-v3.2
Imported 2026-05-11
94 DeepSeek V4 Pro (Non-reasoning) 0.9% DeepSeek V4 Pro
deepseek-deepseek-v4-pro
Imported 2026-05-11
95 MiMo-V2-Omni-0327 0.9% Imported 2026-05-11
96 MiniMax-M2 0.9% MiniMax M2
minimax-minimax-m2
Imported 2026-05-11
97 NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) 0.9% Nemotron 3 Nano 30B A3B
nvidia-nemotron-3-nano-30b-a3b
Imported 2026-05-11
98 Qwen3 Max (Preview) 0.9% Qwen3 Max
qwen-qwen3-max
Imported 2026-05-11
99 Qwen3.5 122B A10B (Non-reasoning) 0.9% Qwen3.5-122B-A10B
qwen-qwen3.5-122b-a10b
Imported 2026-05-11
100 Qwen3.5 27B (Reasoning) 0.9% Qwen3.5-27B
qwen-qwen3.5-27b
Imported 2026-05-11
101 Qwen3.5 35B A3B (Reasoning) 0.9% Qwen3.5-35B-A3B
qwen-qwen3.5-35b-a3b
Imported 2026-05-11
102 Qwen3.5 397B A17B (Non-reasoning) 0.9% Qwen3.5 397B A17B
qwen-qwen3.5-397b-a17b
Imported 2026-05-11
103 Qwen3.6 27B (Non-reasoning) 0.9% Qwen3.6 27B
qwen-qwen3.6-27b
Imported 2026-05-11
104 Trinity Large Thinking 0.9% A Trinity Large Thinking
arcee-ai-trinity-large-thinking
Imported 2026-05-11
105 Mercury 2 0.8% I Mercury 2
inception-mercury-2
Imported 2026-05-11
106 DeepSeek R1 (Jan '25) 0.6% R1
deepseek-r1
Imported 2026-05-11
107 Gemma 4 E4B (Reasoning) 0.6% Imported 2026-05-11
108 GLM 5V Turbo (Reasoning) 0.6% GLM GLM 5V Turbo
z-ai-glm-5v-turbo
Imported 2026-05-11
109 GPT-5.2 (Non-reasoning) 0.6% GPT-5.2
openai-gpt-5.2
Imported 2026-05-11
110 GPT-5.4 (Non-reasoning) 0.6% GPT-5.4
openai-gpt-5.4
Imported 2026-05-11
111 Grok 3 mini Reasoning (high) 0.6% Imported 2026-05-11
112 Kimi K2.5 (Non-reasoning) 0.6% KIMI MoonshotAI: Kimi K2.5
moonshotai-kimi-k2.5
Imported 2026-05-11
113 Ling-1T 0.6% Imported 2026-05-11
114 MiniMax-M2.7 0.6% MiniMax M2.7
minimax-minimax-m2.7
Imported 2026-05-11
115 Nemotron Cascade 2 30B A3B 0.6% Imported 2026-05-11
116 o4-mini (high) 0.6% o4 Mini
openai-o4-mini
Imported 2026-05-11
117 Qwen3.5 122B A10B (Reasoning) 0.6% Qwen3.5-122B-A10B
qwen-qwen3.5-122b-a10b
Imported 2026-05-11
118 Qwen3.5 35B A3B (Non-reasoning) 0.6% Qwen3.5-35B-A3B
qwen-qwen3.5-35b-a3b
Imported 2026-05-11
119 Qwen3.5 9B (Non-reasoning) 0.6% Qwen3.5-9B
qwen-qwen3.5-9b
Imported 2026-05-11
120 Qwen3.5 Omni Plus 0.6% Imported 2026-05-11
121 Ring-1T 0.6% Imported 2026-05-11
122 Apriel-v1.6-15B-Thinker 0.3% Imported 2026-05-11
123 Claude 4 Sonnet (Reasoning) 0.3% Imported 2026-05-11
124 Claude Opus 4.5 (Non-reasoning) 0.3% Claude Opus 4.5
anthropic-claude-opus-4.5
Imported 2026-05-11
125 DeepSeek V4 Flash (Non-reasoning) 0.3% DeepSeek V4 Flash
deepseek-deepseek-v4-flash
Imported 2026-05-11
126 Doubao Seed Code 0.3% Imported 2026-05-11
127 EXAONE 4.5 33B 0.3% Imported 2026-05-11
128 Falcon-H1R-7B 0.3% Imported 2026-05-11
129 Gemini 2.5 Flash Preview (Sep '25) (Reasoning) 0.3% Imported 2026-05-11
130 Gemma 4 E4B (Non-reasoning) 0.3% Imported 2026-05-11
131 GLM-4.7-Flash (Reasoning) 0.3% GLM GLM 4.7 Flash
z-ai-glm-4.7-flash
Imported 2026-05-11
132 GLM-5-Turbo 0.3% GLM GLM 5 Turbo
z-ai-glm-5-turbo
Imported 2026-05-11
133 Grok 4.20 0309 v2 (Non-reasoning) 0.3% GROK Grok 4.20
x-ai-grok-4.20
Imported 2026-05-11
134 Hermes 4 - Llama-3.1 405B (Reasoning) 0.3% Imported 2026-05-11
135 Hy3-preview (Non-reasoning) 0.3% T Hy3 preview
tencent-hy3-preview
Imported 2026-05-11
136 INTELLECT-3 0.3% PI INTELLECT-3
prime-intellect-intellect-3
Imported 2026-05-11
137 Ling-2.6-1T 0.3% I Ling-2.6-1T
inclusionai-ling-2.6-1t
Imported 2026-05-11
138 Magistral Medium 1 0.3% Imported 2026-05-11
139 Magistral Medium 1.2 0.3% Imported 2026-05-11
140 Magistral Small 1.2 0.3% Imported 2026-05-11
141 MiMo-V2-Pro 0.3% MiMo-V2-Pro
xiaomi-mimo-v2-pro
Imported 2026-05-11
142 MiniMax-M2.1 0.3% MiniMax M2.1
minimax-minimax-m2.1
Imported 2026-05-11
143 Mistral Small 4 (Non-reasoning) 0.3% Mistral: Mistral Small 4
mistralai-mistral-small-2603
Imported 2026-05-11
144 Mistral Small 4 (Reasoning) 0.3% Mistral: Mistral Small 4
mistralai-mistral-small-2603
Imported 2026-05-11
145 o1 0.3% o1
openai-o1
Imported 2026-05-11
146 o3-mini (high) 0.3% o3 Mini High
openai-o3-mini-high
Imported 2026-05-11
147 Qwen3 30B A3B 2507 (Reasoning) 0.3% Imported 2026-05-11
148 Qwen3 32B (Reasoning) 0.3% Qwen3 32B
qwen-qwen3-32b
Imported 2026-05-11
149 Qwen3 VL 8B (Reasoning) 0.3% Imported 2026-05-11
150 Qwen3.5 27B (Non-reasoning) 0.3% Qwen3.5-27B
qwen-qwen3.5-27b
Imported 2026-05-11
151 Qwen3.5 9B (Reasoning) 0.3% Qwen3.5-9B
qwen-qwen3.5-9b
Imported 2026-05-11
152 Qwen3.6 35B A3B (Reasoning) 0.3% Qwen3.6 35B A3B
qwen-qwen3.6-35b-a3b
Imported 2026-05-11
153 Ring-flash-2.0 0.3% Imported 2026-05-11
154 Sarvam 30B (high) 0.3% Imported 2026-05-11
155 Tri-21B-Think 0.3% Imported 2026-05-11
156 Apertus 70B Instruct 0% Imported 2026-05-11
157 Apertus 8B Instruct 0% Imported 2026-05-11
158 Claude 3 Haiku 0% Claude 3 Haiku
anthropic-claude-3-haiku
Imported 2026-05-11
159 Claude 3.5 Haiku 0% Claude 3.5 Haiku
anthropic-claude-3.5-haiku
Imported 2026-05-11
160 Claude 3.7 Sonnet (Non-reasoning) 0% Claude 3.7 Sonnet
anthropic-claude-3.7-sonnet
Imported 2026-05-11
161 Claude 4.1 Opus (Reasoning) 0% Imported 2026-05-11
162 Claude 4.5 Haiku (Non-reasoning) 0% Imported 2026-05-11
163 Claude 4.5 Haiku (Reasoning) 0% Imported 2026-05-11
164 Claude 4.5 Sonnet (Non-reasoning) 0% Imported 2026-05-11
165 Cogito v2.1 (Reasoning) 0% Imported 2026-05-11
166 Command A 0% C Command A
cohere-command-a
Imported 2026-05-11
167 DeepSeek R1 0528 Qwen3 8B 0% Imported 2026-05-11
168 DeepSeek R1 Distill Llama 70B 0% R1 Distill Llama 70B
deepseek-deepseek-r1-distill-llama-70b
Imported 2026-05-11
169 DeepSeek V3 (Dec '24) 0% DeepSeek V3
deepseek-deepseek-chat
Imported 2026-05-11
170 DeepSeek V3 0324 0% DeepSeek V3 0324
deepseek-deepseek-chat-v3-0324
Imported 2026-05-11
171 DeepSeek V3.1 (Non-reasoning) 0% DeepSeek V3.1
deepseek-deepseek-chat-v3.1
Imported 2026-05-11
172 DeepSeek V3.1 Terminus (Non-reasoning) 0% DeepSeek V3.1 Terminus
deepseek-deepseek-v3.1-terminus
Imported 2026-05-11
173 Devstral 2 0% Imported 2026-05-11
174 Devstral Medium 0% Mistral: Devstral Medium
mistralai-devstral-medium
Imported 2026-05-11
175 Devstral Small (Jul '25) 0% Mistral: Devstral Small 1.1
mistralai-devstral-small
Imported 2026-05-11
176 Devstral Small (May '25) 0% Mistral: Devstral Small 1.1
mistralai-devstral-small
Imported 2026-05-11
177 Devstral Small 2 0% Imported 2026-05-11
178 ERNIE 4.5 300B A47B 0% ERNIE 4.5 300B A47B
baidu-ernie-4.5-300b-a47b
Imported 2026-05-11
179 Exaone 4.0 1.2B (Non-reasoning) 0% Imported 2026-05-11
180 Exaone 4.0 1.2B (Reasoning) 0% Imported 2026-05-11
181 EXAONE 4.0 32B (Non-reasoning) 0% Imported 2026-05-11
182 EXAONE 4.0 32B (Reasoning) 0% Imported 2026-05-11
183 Gemini 2.0 Flash (Feb '25) 0% Gemini 2.0 Flash
google-gemini-2.0-flash
Imported 2026-05-11
184 Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning) 0% Imported 2026-05-11
185 Gemini 2.5 Flash-Lite (Non-reasoning) 0% Gemini 2.5 Flash Lite
google-gemini-2.5-flash-lite
Imported 2026-05-11
186 Gemini 2.5 Flash-Lite (Reasoning) 0% Gemini 2.5 Flash Lite
google-gemini-2.5-flash-lite
Imported 2026-05-11
187 Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) 0% Gemini 2.5 Flash Lite Preview 09-2025
google-gemini-2.5-flash-lite-preview-09-2025
Imported 2026-05-11
188 Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) 0% Imported 2026-05-11
189 Gemini 3 Pro Preview (low) 0% Gemini 3
google-gemini-3
Imported 2026-05-11
190 Gemma 3 12B Instruct 0% Gemma 3 12B
google-gemma-3-12b-it
Imported 2026-05-11
191 Gemma 3 1B Instruct 0% Imported 2026-05-11
192 Gemma 3 270M 0% Imported 2026-05-11
193 Gemma 3 27B Instruct 0% Gemma 3 27B
google-gemma-3-27b-it
Imported 2026-05-11
194 Gemma 3 4B Instruct 0% Gemma 3 4B
google-gemma-3-4b-it
Imported 2026-05-11
195 Gemma 3n E2B Instruct 0% Imported 2026-05-11
196 Gemma 3n E4B Instruct 0% Imported 2026-05-11
197 Gemma 4 26B A4B (Non-reasoning) 0% Gemma 4 26B A4B
google-gemma-4-26b-a4b-it
Imported 2026-05-11
198 Gemma 4 26B A4B (Reasoning) 0% Gemma 4 26B A4B
google-gemma-4-26b-a4b-it
Imported 2026-05-11
199 Gemma 4 31B (Non-reasoning) 0% Gemma 4 31B
google-gemma-4-31b-it
Imported 2026-05-11
200 Gemma 4 E2B (Non-reasoning) 0% Imported 2026-05-11
201 Gemma 4 E2B (Reasoning) 0% Imported 2026-05-11
202 GLM-4.5 (Reasoning) 0% GLM GLM 4.5
z-ai-glm-4.5
Imported 2026-05-11
203 GLM-4.5-Air 0% GLM GLM 4.5 Air
z-ai-glm-4.5-air
Imported 2026-05-11
204 GLM-4.5V (Non-reasoning) 0% GLM GLM 4.5V
z-ai-glm-4.5v
Imported 2026-05-11
205 GLM-4.5V (Reasoning) 0% GLM GLM 4.5V
z-ai-glm-4.5v
Imported 2026-05-11
206 GLM-4.6 (Non-reasoning) 0% GLM GLM 4.6
z-ai-glm-4.6
Imported 2026-05-11
207 GLM-4.6V (Non-reasoning) 0% GLM GLM 4.6V
z-ai-glm-4.6v
Imported 2026-05-11
208 GLM-4.6V (Reasoning) 0% GLM GLM 4.6V
z-ai-glm-4.6v
Imported 2026-05-11
209 GLM-4.7 (Non-reasoning) 0% GLM GLM 4.7
z-ai-glm-4.7
Imported 2026-05-11
210 GLM-4.7-Flash (Non-reasoning) 0% GLM GLM 4.7 Flash
z-ai-glm-4.7-flash
Imported 2026-05-11
211 GLM-5 (Non-reasoning) 0% GLM GLM 5
z-ai-glm-5
Imported 2026-05-11
212 GLM-5.1 (Non-reasoning) 0% GLM GLM 5.1
z-ai-glm-5.1
Imported 2026-05-11
213 GPT-4.1 0% GPT-4.1
openai-gpt-4.1
Imported 2026-05-11
214 GPT-4.1 mini 0% GPT-4.1 Mini
openai-gpt-4.1-mini
Imported 2026-05-11
215 GPT-4.1 nano 0% GPT-4.1 Nano
openai-gpt-4.1-nano
Imported 2026-05-11
216 GPT-4o (Aug '24) 0% GPT-4o (2024-08-06)
openai-gpt-4o-2024-08-06
Imported 2026-05-11
217 GPT-4o (Nov '24) 0% GPT-4o
openai-gpt-4o
Imported 2026-05-11
218 GPT-5 (medium) 0% GPT-5
openai-gpt-5
Imported 2026-05-11
219 GPT-5 (minimal) 0% GPT-5
openai-gpt-5
Imported 2026-05-11
220 GPT-5 mini (high) 0% GPT-5 Mini
openai-gpt-5-mini
Imported 2026-05-11
221 GPT-5 mini (minimal) 0% GPT-5 Mini
openai-gpt-5-mini
Imported 2026-05-11
222 GPT-5 nano (high) 0% GPT-5 Nano
openai-gpt-5-nano
Imported 2026-05-11
223 GPT-5 nano (medium) 0% GPT-5 Nano
openai-gpt-5-nano
Imported 2026-05-11
224 GPT-5 nano (minimal) 0% GPT-5 Nano
openai-gpt-5-nano
Imported 2026-05-11
225 GPT-5.1 (Non-reasoning) 0% GPT-5.1
openai-gpt-5.1
Imported 2026-05-11
226 GPT-5.1 Codex mini (high) 0% GPT-5.1-Codex-Mini
openai-gpt-5.1-codex-mini
Imported 2026-05-11
227 GPT-5.4 mini (Non-Reasoning) 0% GPT-5.4 Mini
openai-gpt-5.4-mini
Imported 2026-05-11
228 GPT-5.4 nano (Non-Reasoning) 0% GPT-5.4 Nano
openai-gpt-5.4-nano
Imported 2026-05-11
229 gpt-oss-120B (low) 0% gpt-oss-120b
openai-gpt-oss-120b
Imported 2026-05-11
230 gpt-oss-20B (low) 0% gpt-oss-20b
openai-gpt-oss-20b
Imported 2026-05-11
231 Granite 3.3 8B (Non-reasoning) 0% Imported 2026-05-11
232 Granite 4.0 1B 0% Imported 2026-05-11
233 Granite 4.0 350M 0% Imported 2026-05-11
234 Granite 4.0 H 1B 0% Imported 2026-05-11
235 Granite 4.0 H 350M 0% Imported 2026-05-11
236 Granite 4.0 H Small 0% Imported 2026-05-11
237 Granite 4.0 Micro 0% Granite 4.0 Micro
ibm-granite-granite-4.0-h-micro
Imported 2026-05-11
238 Granite 4.1 30B 0% Imported 2026-05-11
239 Granite 4.1 3B 0% Imported 2026-05-11
240 Granite 4.1 8B 0% Granite 4.1 8B
ibm-granite-granite-4.1-8b
Imported 2026-05-11
241 Grok 3 0% GROK Grok 3
xaigrok-3
Imported 2026-05-11
242 Grok 4 Fast (Non-reasoning) 0% GROK Grok 4 Fast
x-ai-grok-4-fast
Imported 2026-05-11
243 Grok 4.1 Fast (Non-reasoning) 0% GROK Grok 4.1 Fast
x-ai-grok-4.1-fast
Imported 2026-05-11
244 Grok 4.20 0309 (Non-reasoning) 0% GROK Grok 4.20
x-ai-grok-4.20
Imported 2026-05-11
245 Grok 4.3 (Non-reasoning) 0% GROK Grok 4.3
x-ai-grok-4.3
Imported 2026-05-11
246 Grok Code Fast 1 0% GROK Grok Code Fast 1
x-ai-grok-code-fast-1
Imported 2026-05-11
247 Hermes 4 - Llama-3.1 405B (Non-reasoning) 0% Imported 2026-05-11
248 Hermes 4 - Llama-3.1 70B (Non-reasoning) 0% Imported 2026-05-11
249 Hermes 4 - Llama-3.1 70B (Reasoning) 0% Imported 2026-05-11
250 HyperCLOVA X SEED Think (32B) 0% Imported 2026-05-11
251 Jamba 1.7 Large 0% Imported 2026-05-11
252 Jamba 1.7 Mini 0% Imported 2026-05-11
253 Jamba Reasoning 3B 0% Imported 2026-05-11
254 JT-MINI 0% Imported 2026-05-11
255 K-EXAONE (Non-reasoning) 0% Imported 2026-05-11
256 K2 Think V2 0% Imported 2026-05-11
257 K2-V2 (high) 0% Imported 2026-05-11
258 K2-V2 (low) 0% Imported 2026-05-11
259 K2-V2 (medium) 0% Imported 2026-05-11
260 KAT Coder Pro V2 0% K KAT-Coder-Pro V2
kwaipilot-kat-coder-pro-v2
Imported 2026-05-11
261 KAT-Coder-Pro V1 0% Imported 2026-05-11
262 Kimi K2 0% KIMI MoonshotAI: Kimi K2 0711
moonshotai-kimi-k2
Imported 2026-05-11
263 Kimi K2 0905 0% KIMI MoonshotAI: Kimi K2 0905
moonshotai-kimi-k2-0905
Imported 2026-05-11
264 LFM2 1.2B 0% Imported 2026-05-11
265 LFM2 2.6B 0% Imported 2026-05-11
266 LFM2 24B A2B 0% LFM LFM2-24B-A2B
liquid-lfm-2-24b-a2b
Imported 2026-05-11
267 LFM2 8B A1B 0% Imported 2026-05-11
268 LFM2.5-1.2B-Instruct 0% LFM LFM2.5-1.2B-Instruct
liquid-lfm-2.5-1.2b-instruct
Imported 2026-05-11
269 LFM2.5-1.2B-Thinking 0% LFM LFM2.5-1.2B-Thinking
liquid-lfm-2.5-1.2b-thinking
Imported 2026-05-11
270 LFM2.5-VL-1.6B 0% Imported 2026-05-11
271 Ling 2.6 Flash 0% I Ling-2.6-flash
inclusionai-ling-2.6-flash
Imported 2026-05-11
272 Ling-flash-2.0 0% Imported 2026-05-11
273 Ling-mini-2.0 0% Imported 2026-05-11
274 Llama 3 Instruct 70B 0% Imported 2026-05-11
275 Llama 3 Instruct 8B 0% Imported 2026-05-11
276 Llama 3.1 Instruct 405B 0% Imported 2026-05-11
277 Llama 3.1 Instruct 70B 0% Imported 2026-05-11
278 Llama 3.1 Instruct 8B 0% Imported 2026-05-11
279 Llama 3.1 Nemotron Instruct 70B 0% Imported 2026-05-11
280 Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) 0% Imported 2026-05-11
281 Llama 3.2 Instruct 11B (Vision) 0% Imported 2026-05-11
282 Llama 3.2 Instruct 1B 0% Imported 2026-05-11
283 Llama 3.3 Instruct 70B 0% Imported 2026-05-11
284 Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) 0% Imported 2026-05-11
285 Llama 3.3 Nemotron Super 49B v1 (Reasoning) 0% Imported 2026-05-11
286 Llama 4 Maverick 0% Llama 4 Maverick
meta-llama-4-maverick
Imported 2026-05-11
287 Llama 4 Scout 0% Llama 4 Scout
meta-llama-llama-4-scout
Imported 2026-05-11
288 Llama Nemotron Super 49B v1.5 (Non-reasoning) 0% Imported 2026-05-11
289 Llama Nemotron Super 49B v1.5 (Reasoning) 0% Imported 2026-05-11
290 LongCat Flash Lite 0% Imported 2026-05-11
291 Magistral Small 1 0% Imported 2026-05-11
292 Mi:dm K 2.5 Pro 0% Imported 2026-05-11
293 Mi:dm K 2.5 Pro Preview 0% Imported 2026-05-11
294 MiMo-V2-Flash (Non-reasoning) 0% MiMo-V2-Flash
xiaomi-mimo-v2-flash
Imported 2026-05-11
295 MiniCPM-V 4.6 1.3B 0% Imported 2026-05-11
296 MiniMax M1 80k 0% Imported 2026-05-11
297 Ministral 3 14B 0% Imported 2026-05-11
298 Ministral 3 3B 0% Imported 2026-05-11
299 Ministral 3 8B 0% Imported 2026-05-11
300 Mistral 7B Instruct 0% Imported 2026-05-11
301 Mistral Large 2 (Nov '24) 0% Imported 2026-05-11
302 Mistral Large 3 0% Imported 2026-05-11
303 Mistral Medium 3 0% Mistral: Mistral Medium 3
mistralai-mistral-medium-3
Imported 2026-05-11
304 Mistral Medium 3.1 0% Mistral: Mistral Medium 3.1
mistralai-mistral-medium-3.1
Imported 2026-05-11
305 Mistral Medium 3.5 0% Mistral: Mistral Medium 3.5
mistralai-mistral-medium-3-5
Imported 2026-05-11
306 Mistral Small 3.1 0% Imported 2026-05-11
307 Mistral Small 3.2 0% Imported 2026-05-11
308 Molmo2-8B 0% Imported 2026-05-11
309 Motif-2-12.7B-Reasoning 0% Imported 2026-05-11
310 Nanbeige4.1-3B 0% Imported 2026-05-11
311 Nemotron 3 Nano Omni 30B A3B Reasoning 0% Imported 2026-05-11
312 Nova 2.0 Lite (low) 0% Imported 2026-05-11
313 Nova 2.0 Lite (medium) 0% Imported 2026-05-11
314 Nova 2.0 Lite (Non-reasoning) 0% Imported 2026-05-11
315 Nova 2.0 Omni (low) 0% Imported 2026-05-11
316 Nova 2.0 Omni (medium) 0% Imported 2026-05-11
317 Nova 2.0 Omni (Non-reasoning) 0% Imported 2026-05-11
318 Nova 2.0 Pro Preview (low) 0% Imported 2026-05-11
319 Nova 2.0 Pro Preview (medium) 0% Imported 2026-05-11
320 Nova 2.0 Pro Preview (Non-reasoning) 0% Imported 2026-05-11
321 Nova Lite 0% Nova Lite 1.0
amazon-nova-lite-v1
Imported 2026-05-11
322 Nova Micro 0% Nova Micro 1.0
amazon-nova-micro-v1
Imported 2026-05-11
323 Nova Premier 0% Imported 2026-05-11
324 Nova Pro 0% Nova Pro 1.0
amazon-nova-pro-v1
Imported 2026-05-11
325 NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) 0% Nemotron 3 Nano 30B A3B
nvidia-nemotron-3-nano-30b-a3b
Imported 2026-05-11
326 NVIDIA Nemotron 3 Nano 4B 0% Imported 2026-05-11
327 NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning) 0% Nemotron Nano 12B 2 VL
nvidia-nemotron-nano-12b-v2-vl
Imported 2026-05-11
328 NVIDIA Nemotron Nano 12B v2 VL (Reasoning) 0% Nemotron Nano 12B 2 VL
nvidia-nemotron-nano-12b-v2-vl
Imported 2026-05-11
329 NVIDIA Nemotron Nano 9B V2 (Non-reasoning) 0% Nemotron Nano 9B V2
nvidia-nemotron-nano-9b-v2
Imported 2026-05-11
330 NVIDIA Nemotron Nano 9B V2 (Reasoning) 0% Nemotron Nano 9B V2
nvidia-nemotron-nano-9b-v2
Imported 2026-05-11
331 Olmo 3 32B Think 0% OLMO Olmo 3 32B Think
allenai-olmo-3-32b-think
Imported 2026-05-11
332 Olmo 3 7B Instruct 0% Imported 2026-05-11
333 Olmo 3 7B Think 0% Imported 2026-05-11
334 Olmo 3.1 32B Instruct 0% OLMO Olmo 3.1 32B Instruct
allenai-olmo-3.1-32b-instruct
Imported 2026-05-11
335 Olmo 3.1 32B Think 0% Imported 2026-05-11
336 Phi-4 0% Phi 4
microsoft-phi-4
Imported 2026-05-11
337 Phi-4 Mini Instruct 0% Imported 2026-05-11
338 Qwen2.5 Instruct 72B 0% Qwen2.5 72B Instruct
qwen-qwen-2.5-72b-instruct
Imported 2026-05-11
339 Qwen3 0.6B (Non-reasoning) 0% Imported 2026-05-11
340 Qwen3 0.6B (Reasoning) 0% Imported 2026-05-11
341 Qwen3 1.7B (Non-reasoning) 0% Imported 2026-05-11
342 Qwen3 1.7B (Reasoning) 0% Imported 2026-05-11
343 Qwen3 14B (Non-reasoning) 0% Qwen3 14B
qwen-qwen3-14b
Imported 2026-05-11
344 Qwen3 14B (Reasoning) 0% Qwen3 14B
qwen-qwen3-14b
Imported 2026-05-11
345 Qwen3 235B A22B (Non-reasoning) 0% Qwen3 235B A22B
qwen-qwen3-235b-a22b
Imported 2026-05-11
346 Qwen3 235B A22B (Reasoning) 0% Qwen3 235B A22B
qwen-qwen3-235b-a22b
Imported 2026-05-11
347 Qwen3 235B A22B 2507 (Reasoning) 0% Qwen3 235B A22B Instruct 2507
qwen-qwen3-235b-a22b-2507
Imported 2026-05-11
348 Qwen3 235B A22B 2507 Instruct 0% Qwen3 235B A22B Instruct 2507
qwen-qwen3-235b-a22b-2507
Imported 2026-05-11
349 Qwen3 30B A3B (Non-reasoning) 0% Qwen3 30B A3B
qwen-qwen3-30b-a3b
Imported 2026-05-11
350 Qwen3 30B A3B (Reasoning) 0% Qwen3 30B A3B
qwen-qwen3-30b-a3b
Imported 2026-05-11
351 Qwen3 30B A3B 2507 Instruct 0% Imported 2026-05-11
352 Qwen3 4B 2507 (Reasoning) 0% Imported 2026-05-11
353 Qwen3 4B 2507 Instruct 0% Imported 2026-05-11
354 Qwen3 8B (Non-reasoning) 0% Qwen3 8B
qwen-qwen3-8b
Imported 2026-05-11
355 Qwen3 8B (Reasoning) 0% Qwen3 8B
qwen-qwen3-8b
Imported 2026-05-11
356 Qwen3 Coder 30B A3B Instruct 0% Qwen3 Coder 30B A3B Instruct
qwen-qwen3-coder-30b-a3b-instruct
Imported 2026-05-11
357 Qwen3 Coder 480B A35B Instruct 0% Qwen3 Coder 480B A35B
qwen-qwen3-coder
Imported 2026-05-11
358 Qwen3 Coder Next 0% Qwen3 Coder Next
qwen-qwen3-coder-next
Imported 2026-05-11
359 Qwen3 Max 0% Qwen3 Max
qwen-qwen3-max
Imported 2026-05-11
360 Qwen3 Max Thinking (Preview) 0% Qwen3 Max Thinking
qwen-qwen3-max-thinking
Imported 2026-05-11
361 Qwen3 Next 80B A3B (Reasoning) 0% Imported 2026-05-11
362 Qwen3 Next 80B A3B Instruct 0% Qwen3 Next 80B A3B Instruct
qwen-qwen3-next-80b-a3b-instruct
Imported 2026-05-11
363 Qwen3 Omni 30B A3B (Reasoning) 0% Imported 2026-05-11
364 Qwen3 Omni 30B A3B Instruct 0% Imported 2026-05-11
365 Qwen3 VL 235B A22B (Reasoning) 0% Imported 2026-05-11
366 Qwen3 VL 235B A22B Instruct 0% Qwen3 VL 235B A22B Instruct
qwen-qwen3-vl-235b-a22b-instruct
Imported 2026-05-11
367 Qwen3 VL 30B A3B (Reasoning) 0% Imported 2026-05-11
368 Qwen3 VL 30B A3B Instruct 0% Qwen3 VL 30B A3B Instruct
qwen-qwen3-vl-30b-a3b-instruct
Imported 2026-05-11
369 Qwen3 VL 32B (Reasoning) 0% Imported 2026-05-11
370 Qwen3 VL 32B Instruct 0% Qwen3 VL 32B Instruct
qwen-qwen3-vl-32b-instruct
Imported 2026-05-11
371 Qwen3 VL 4B (Reasoning) 0% Imported 2026-05-11
372 Qwen3 VL 4B Instruct 0% Imported 2026-05-11
373 Qwen3 VL 8B Instruct 0% Qwen3 VL 8B Instruct
qwen-qwen3-vl-8b-instruct
Imported 2026-05-11
374 Qwen3.5 0.8B (Non-reasoning) 0% Imported 2026-05-11
375 Qwen3.5 0.8B (Reasoning) 0% Imported 2026-05-11
376 Qwen3.5 2B (Non-reasoning) 0% Imported 2026-05-11
377 Qwen3.5 2B (Reasoning) 0% Imported 2026-05-11
378 Qwen3.5 4B (Non-reasoning) 0% Imported 2026-05-11
379 Qwen3.5 4B (Reasoning) 0% Imported 2026-05-11
380 Qwen3.5 Omni Flash 0% Imported 2026-05-11
381 Qwen3.6 35B A3B (Non-reasoning) 0% Qwen3.6 35B A3B
qwen-qwen3.6-35b-a3b
Imported 2026-05-11
382 Reka Flash 3 0% REKA Reka Flash 3
rekaai-reka-flash-3
Imported 2026-05-11
383 Sarvam 105B (high) 0% Imported 2026-05-11
384 Sarvam M (Reasoning) 0% Imported 2026-05-11
385 Seed-OSS-36B-Instruct 0% Imported 2026-05-11
386 Solar Open 100B (Reasoning) 0% Imported 2026-05-11
387 Solar Pro 2 (Non-reasoning) 0% Imported 2026-05-11
388 Solar Pro 2 (Reasoning) 0% Imported 2026-05-11
389 Solar Pro 3 0% U Solar Pro 3
upstage-solar-pro-3
Imported 2026-05-11
390 Step3 VL 10B 0% Imported 2026-05-11
391 Tiny Aya Global 0% Imported 2026-05-11
392 Tri-21B-think Preview 0% Imported 2026-05-11