Arena AI Code
Crowdsourced Arena AI pairwise human-preference leaderboard for code generation and coding-assistant models.
73rows
eloprimary metric
2026-05-06sampled
Metadata
Metrics
Arena ELO, 95% confidence interval (lower is better), Votes
| Rank | Subject | Arena ELO | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | claude-opus-4-7-thinking | 1570 | — | Imported | 2026-05-06 |
| 2 | claude-opus-4-7 | 1561 | Claude Opus 4.7 anthropic-claude-opus-4.7 | Imported | 2026-05-06 |
| 3 | claude-opus-4-6-thinking | 1548 | Claude Opus 4.6 anthropic-claude-opus-4.6 | Imported | 2026-05-06 |
| 4 | claude-opus-4-6 | 1543 | Claude Opus 4.6 anthropic-claude-opus-4.6 | Imported | 2026-05-06 |
| 5 | glm-5.1 | 1532 | GLM 5.1 z-ai-glm-5.1 | Imported | 2026-05-06 |
| 6 | claude-sonnet-4-6 | 1526 | Claude Sonnet 4.6 anthropic-claude-sonnet-4.6 | Imported | 2026-05-06 |
| 7 | kimi-k2.6 | 1525 | MoonshotAI: Kimi K2.6 moonshotai-kimi-k2.6 | Imported | 2026-05-06 |
| 8 | muse-spark | 1509 | — | Imported | 2026-05-06 |
| 9 | claude-opus-4-5-20251101-thinking-32k | 1491 | — | Imported | 2026-05-06 |
| 10 | gpt-5.5-high (codex-harness) | 1490 | GPT-5.5 openai-gpt-5.5 | Imported | 2026-05-06 |
| 11 | mimo-v2.5-pro | 1475 | MiMo-V2.5-Pro xiaomi-mimo-v2.5-pro | Imported | 2026-05-06 |
| 12 | claude-opus-4-5-20251101 | 1467 | Claude Opus 4.5 anthropic-claude-opus-4.5 | Imported | 2026-05-06 |
| 13 | qwen3.6-plus | 1465 | Qwen3.6 Plus qwen-qwen3.6-plus | Imported | 2026-05-06 |
| 14 | gpt-5.4-high (codex-harness) | 1457 | GPT-5.4 openai-gpt-5.4 | Imported | 2026-05-06 |
| 15 | deepseek-v4-pro-thinking | 1455 | DeepSeek V4 Pro deepseek-deepseek-v4-pro | Imported | 2026-05-06 |
| 16 | gemini-3.1-pro-preview | 1454 | Gemini 3.1 Pro Preview google-gemini-3.1-pro-preview | Imported | 2026-05-06 |
| 17 | mimo-v2.5 | 1446 | MiMo-V2.5 xiaomi-mimo-v2.5 | Imported | 2026-05-06 |
| 18 | gpt-5.5 (codex-harness) | 1443 | GPT-5.5 openai-gpt-5.5 | Imported | 2026-05-06 |
| 19 | glm-4.7 | 1440 | GLM 4.7 z-ai-glm-4.7 | Imported | 2026-05-06 |
| 20 | gemini-3-pro | 1438 | Gemini 3 google-gemini-3 | Imported | 2026-05-06 |
| 21 | gpt-5.4-medium (codex-harness) | 1437 | GPT-5.4 openai-gpt-5.4 | Imported | 2026-05-06 |
| 22 | gemini-3-flash | 1437 | Gemini 3 Flash Preview google-gemini-3-flash-preview | Imported | 2026-05-06 |
| 23 | glm-5 | 1436 | GLM 5 z-ai-glm-5 | Imported | 2026-05-06 |
| 24 | kimi-k2.5-thinking | 1430 | MoonshotAI: Kimi K2.5 moonshotai-kimi-k2.5 | Imported | 2026-05-06 |
| 25 | mimo-v2-pro | 1430 | MiMo-V2-Pro xiaomi-mimo-v2-pro | Imported | 2026-05-06 |
| 26 | minimax-m2.7 | 1408 | MiniMax M2.7 minimax-minimax-m2.7 | Imported | 2026-05-06 |
| 27 | kimi-k2.5-instant | 1408 | MoonshotAI: Kimi K2.5 moonshotai-kimi-k2.5 | Imported | 2026-05-06 |
| 28 | gpt-5.3-codex (codex-harness) | 1406 | GPT-5.3-Codex openai-gpt-5.3-codex | Imported | 2026-05-06 |
| 29 | gpt-5.2 | 1404 | GPT-5.2 openai-gpt-5.2 | Imported | 2026-05-06 |
| 30 | grok-4.3 | 1403 | Grok 4.3 x-ai-grok-4.3 | Imported | 2026-05-06 |
| 31 | gpt-5.4-mini-high | 1401 | GPT-5.4 Mini openai-gpt-5.4-mini | Imported | 2026-05-06 |
| 32 | grok-4.20-beta-0309-reasoning | 1399 | Grok 4.20 x-ai-grok-4.20 | Imported | 2026-05-06 |
| 33 | gpt-5-medium | 1393 | GPT-5 openai-gpt-5 | Imported | 2026-05-06 |
| 34 | minimax-m2.1-preview | 1392 | — | Imported | 2026-05-06 |
| 35 | gpt-5.1-medium | 1391 | GPT-5.1 openai-gpt-5.1 | Imported | 2026-05-06 |
| 36 | gemini-3-flash (thinking-minimal) | 1389 | Gemini 3 Flash Preview google-gemini-3-flash-preview | Imported | 2026-05-06 |
| 37 | claude-sonnet-4-5-20250929-thinking-32k | 1388 | — | Imported | 2026-05-06 |
| 38 | qwen3.5-397b-a17b | 1388 | Qwen3.5 397B A17B qwen-qwen3.5-397b-a17b | Imported | 2026-05-06 |
| 39 | gemma-4-31b | 1387 | Gemma 4 31B google-gemma-4-31b-it | Imported | 2026-05-06 |
| 40 | claude-sonnet-4-5-20250929 | 1386 | Claude Sonnet 4.5 anthropic-claude-sonnet-4.5 | Imported | 2026-05-06 |
| 41 | claude-opus-4-1-20250805 | 1385 | Claude Opus 4.1 anthropic-claude-opus-4.1 | Imported | 2026-05-06 |
| 42 | minimax-m2.5 | 1383 | MiniMax M2.5 minimax-minimax-m2.5 | Imported | 2026-05-06 |
| 43 | deepseek-v3.2-thinking | 1368 | DeepSeek V3.2 deepseek-deepseek-v3.2 | Imported | 2026-05-06 |
| 44 | qwen3.5-122b-a10b | 1363 | Qwen3.5-122B-A10B qwen-qwen3.5-122b-a10b | Imported | 2026-05-06 |
| 45 | gemma-4-26b-a4b | 1360 | Gemma 4 26B A4B google-gemma-4-26b-a4b-it | Imported | 2026-05-06 |
| 46 | glm-4.6 | 1355 | GLM 4.6 z-ai-glm-4.6 | Imported | 2026-05-06 |
| 47 | qwen3.5-27b | 1352 | Qwen3.5-27B qwen-qwen3.5-27b | Imported | 2026-05-06 |
| 48 | gpt-5.1 | 1340 | GPT-5.1 openai-gpt-5.1 | Imported | 2026-05-06 |
| 49 | mimo-v2-flash (non-thinking) | 1337 | MiMo-V2-Flash xiaomi-mimo-v2-flash | Imported | 2026-05-06 |
| 50 | gpt-5.2-codex | 1335 | GPT-5.2-Codex openai-gpt-5.2-codex | Imported | 2026-05-06 |
| 51 | deepseek-v3.2 | 1332 | DeepSeek V3.2 deepseek-deepseek-v3.2 | Imported | 2026-05-06 |
| 52 | kimi-k2-thinking-turbo | 1329 | MoonshotAI: Kimi K2 Thinking moonshotai-kimi-k2-thinking | Imported | 2026-05-06 |
| 53 | gpt-5.1-codex | 1329 | GPT-5.1-Codex openai-gpt-5.1-codex | Imported | 2026-05-06 |
| 54 | claude-haiku-4-5-20251001 | 1317 | Claude Haiku 4.5 anthropic-claude-haiku-4.5 | Imported | 2026-05-06 |
| 55 | minimax-m2 | 1304 | MiniMax M2 minimax-minimax-m2 | Imported | 2026-05-06 |
| 56 | mimo-v2-flash (thinking) | 1301 | MiMo-V2-Flash xiaomi-mimo-v2-flash | Imported | 2026-05-06 |
| 57 | deepseek-v3.2-exp | 1286 | DeepSeek V3.2 Exp deepseek-deepseek-v3.2-exp | Imported | 2026-05-06 |
| 58 | qwen3-coder-480b-a35b-instruct | 1281 | Qwen3 Coder 480B A35B qwen-qwen3-coder | Imported | 2026-05-06 |
| 59 | KAT-Coder-Pro-V1 | 1258 | — | Imported | 2026-05-06 |
| 60 | qwen3.5-35b-a3b | 1249 | Qwen3.5-35B-A3B qwen-qwen3.5-35b-a3b | Imported | 2026-05-06 |
| 61 | trinity-large-thinking | 1245 | Trinity Large Thinking arcee-ai-trinity-large-thinking | Imported | 2026-05-06 |
| 62 | gemini-3.1-flash-lite-preview | 1240 | Gemini 3.1 Flash Lite Preview google-gemini-3.1-flash-lite-preview | Imported | 2026-05-06 |
| 63 | gpt-5.1-codex-mini | 1239 | GPT-5.1-Codex-Mini openai-gpt-5.1-codex-mini | Imported | 2026-05-06 |
| 64 | qwen3.5-flash | 1237 | Qwen3.5-Flash qwen-qwen3.5-flash-02-23 | Imported | 2026-05-06 |
| 65 | grok-4-1-fast-reasoning | 1234 | Grok 4.1 Fast x-ai-grok-4.1-fast | Imported | 2026-05-06 |
| 66 | mistral-large-3 | 1222 | — | Imported | 2026-05-06 |
| 67 | grok-4.1-thinking | 1208 | — | Imported | 2026-05-06 |
| 68 | gemini-2.5-pro | 1203 | Gemini 2.5 Pro google-gemini-2.5-pro | Imported | 2026-05-06 |
| 69 | devstral-2 | 1199 | — | Imported | 2026-05-06 |
| 70 | mercury-2 | 1165 | Mercury 2 inception-mercury-2 | Imported | 2026-05-06 |
| 71 | grok-4-fast-reasoning | 1149 | Grok 4 Fast x-ai-grok-4-fast | Imported | 2026-05-06 |
| 72 | grok-code-fast-1 | 1140 | Grok Code Fast 1 x-ai-grok-code-fast-1 | Imported | 2026-05-06 |
| 73 | devstral-medium-2507 | 1091 | Mistral: Devstral Medium mistralai-devstral-medium | Imported | 2026-05-06 |
No matching rows.