LMArena WebDev Arena
LMArena's WebDev Arena leaderboard for model performance on interactive web development tasks judged by human preference.
25rows
ratingprimary metric
2026-05-06sampled
Metadata
Metrics
Arena rating, Rating lower bound, Rating upper bound, Votes
| Rank | Subject | Arena rating | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | claude-opus-4-7-thinking | 1567.85 | — | Imported | 2026-05-06 |
| 2 | claude-opus-4-7 | 1561.91 | Claude Opus 4.7 anthropic-claude-opus-4.7 | Imported | 2026-05-06 |
| 3 | claude-opus-4-6-thinking | 1548.84 | Claude Opus 4.6 anthropic-claude-opus-4.6 | Imported | 2026-05-06 |
| 4 | claude-opus-4-6 | 1544.36 | Claude Opus 4.6 anthropic-claude-opus-4.6 | Imported | 2026-05-06 |
| 5 | glm-5.1 | 1531.70 | GLM 5.1 z-ai-glm-5.1 | Imported | 2026-05-06 |
| 6 | claude-sonnet-4-6 | 1526.17 | Claude Sonnet 4.6 anthropic-claude-sonnet-4.6 | Imported | 2026-05-06 |
| 7 | kimi-k2.6 | 1524.58 | MoonshotAI: Kimi K2.6 moonshotai-kimi-k2.6 | Imported | 2026-05-06 |
| 8 | muse-spark | 1509.38 | — | Imported | 2026-05-06 |
| 9 | claude-opus-4-5-20251101-thinking-32k | 1490.60 | — | Imported | 2026-05-06 |
| 10 | gpt-5.5-high (codex-harness) | 1490.28 | GPT-5.5 openai-gpt-5.5 | Imported | 2026-05-06 |
| 11 | mimo-v2.5-pro | 1476.42 | MiMo-V2.5-Pro xiaomi-mimo-v2.5-pro | Imported | 2026-05-06 |
| 12 | claude-opus-4-5-20251101 | 1467.21 | Claude Opus 4.5 anthropic-claude-opus-4.5 | Imported | 2026-05-06 |
| 13 | qwen3.6-plus | 1463.97 | Qwen3.6 Plus qwen-qwen3.6-plus | Imported | 2026-05-06 |
| 14 | gpt-5.4-high (codex-harness) | 1456.78 | GPT-5.4 openai-gpt-5.4 | Imported | 2026-05-06 |
| 15 | gemini-3.1-pro-preview | 1454.71 | Gemini 3.1 Pro Preview google-gemini-3.1-pro-preview | Imported | 2026-05-06 |
| 16 | deepseek-v4-pro-thinking | 1454.67 | DeepSeek V4 Pro deepseek-deepseek-v4-pro | Imported | 2026-05-06 |
| 17 | mimo-v2.5 | 1445.56 | MiMo-V2.5 xiaomi-mimo-v2.5 | Imported | 2026-05-06 |
| 18 | gpt-5.5 (codex-harness) | 1441.00 | GPT-5.5 openai-gpt-5.5 | Imported | 2026-05-06 |
| 19 | glm-4.7 | 1439.90 | GLM 4.7 z-ai-glm-4.7 | Imported | 2026-05-06 |
| 20 | gemini-3-pro | 1438.22 | Gemini 3 google-gemini-3 | Imported | 2026-05-06 |
| 21 | gpt-5.4-medium (codex-harness) | 1437.09 | GPT-5.4 openai-gpt-5.4 | Imported | 2026-05-06 |
| 22 | gemini-3-flash | 1437.04 | Gemini 3 Flash Preview google-gemini-3-flash-preview | Imported | 2026-05-06 |
| 23 | glm-5 | 1435.92 | GLM 5 z-ai-glm-5 | Imported | 2026-05-06 |
| 24 | kimi-k2.5-thinking | 1429.73 | MoonshotAI: Kimi K2.5 moonshotai-kimi-k2.5 | Imported | 2026-05-06 |
| 25 | mimo-v2-pro | 1428.51 | MiMo-V2-Pro xiaomi-mimo-v2-pro | Imported | 2026-05-06 |
No matching rows.