LiveCodeBench
Official LiveCodeBench code-generation leaderboard for contamination-aware coding evaluation over problems collected from Codeforces, LeetCode, and AtCoder.
33rows
pass_at_1primary metric
2026-05-28sampled
Metadata
Metrics
Pass@1, Easy Pass@1, Medium Pass@1, Hard Pass@1, Problem count
Showing 2 latest source slices.
| Rank | Subject | Pass@1 | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | DeepSeek V4 Pro Max | 93.5% | DeepSeek V4 Pro deepseek-deepseek-v4-pro | Self-reported | 2026-05-28 |
| 2 | Qwen3.7 Max | 91.6% | Qwen3.7 Max qwen-qwen3.7-max | Self-reported | 2026-05-28 |
| 3 | Kimi K2.6 Thinking | 89.6% | MoonshotAI: Kimi K2.6 moonshotai-kimi-k2.6 | Self-reported | 2026-05-28 |
| 4 | Claude Opus 4.6 Max | 88.8% | Claude Opus 4.6 anthropic-claude-opus-4.6 | Self-reported | 2026-05-28 |
| 5 | Qwen3.6 Plus | 87.1% | Qwen3.6 Plus qwen-qwen3.6-plus | Self-reported | 2026-05-28 |
| 1 | O4-Mini (High) | 80.20 | o4 Mini High openai-o4-mini-high | Imported | 2026-05-06 |
| 2 | O3 (High) | 75.80 | o3 openai-o3 | Imported | 2026-05-06 |
| 3 | O4-Mini (Medium) | 74.20 | o4 Mini openai-o4-mini | Imported | 2026-05-06 |
| 4 | Gemini-2.5-Pro-06-05 | 73.60 | — | Imported | 2026-05-06 |
| 5 | DeepSeek-R1-0528 | 73.10 | R1 0528 deepseek-deepseek-r1-0528 | Imported | 2026-05-06 |
| 6 | Gemini-2.5-Pro-05-06 | 71.80 | — | Imported | 2026-05-06 |
| 7 | EXAONE-4.0-32B | 70 | — | Imported | 2026-05-06 |
| 8 | OpenReasoning-Nemotron-32B | 69.80 | — | Imported | 2026-05-06 |
| 9 | O3-Mini-2025-01-31 (High) | 67.40 | o3 Mini High openai-o3-mini-high | Imported | 2026-05-06 |
| 10 | OpenCodeReasoning-Nemotron-1.1-32B | 66.80 | — | Imported | 2026-05-06 |
| 11 | Grok-3-Mini (High) | 66.70 | Grok 3 Mini x-ai-grok-3-mini | Imported | 2026-05-06 |
| 12 | O4-Mini (Low) | 65.90 | o4 Mini openai-o4-mini | Imported | 2026-05-06 |
| 13 | Qwen3-235B-A22B | 65.90 | Qwen3 235B A22B qwen-qwen3-235b-a22b | Imported | 2026-05-06 |
| 14 | XBai-o4-medium | 65 | — | Imported | 2026-05-06 |
| 15 | O3-Mini-2025-01-31 (Med) | 63 | o3-mini openai-o3-mini | Imported | 2026-05-06 |
| 16 | Gemini-2.5-Flash-05-20 | 61.90 | — | Imported | 2026-05-06 |
| 17 | Gemini-2.5-Flash-04-17 | 60.60 | — | Imported | 2026-05-06 |
| 18 | O3-Mini-2025-01-31 (Low) | 57 | o3-mini openai-o3-mini | Imported | 2026-05-06 |
| 19 | Claude-Opus-4 (Thinking) | 56.60 | Claude Opus 4 anthropic-claude-opus-4 | Imported | 2026-05-06 |
| 20 | Claude-Sonnet-4 (Thinking) | 55.90 | Claude Sonnet 4 anthropic-claude-sonnet-4 | Imported | 2026-05-06 |
| 21 | Claude-Sonnet-4 | 47.10 | Claude Sonnet 4 anthropic-claude-sonnet-4 | Imported | 2026-05-06 |
| 22 | Claude-Opus-4 | 46.90 | Claude Opus 4 anthropic-claude-opus-4 | Imported | 2026-05-06 |
| 23 | Claude-3.5-Sonnet-20241022 | 36.40 | Claude 3.5 Sonnet anthropic-claude-3.5-sonnet | Imported | 2026-05-06 |
| 24 | GPT-4O-2024-08-06 | 29.50 | GPT-4o (2024-08-06) openai-gpt-4o-2024-08-06 | Imported | 2026-05-06 |
| 25 | GPT-4-Turbo-2024-04-09 | 28.70 | GPT-4 Turbo openai-gpt-4-turbo | Imported | 2026-05-06 |
| 26 | GPT-4O-mini-2024-07-18 | 27.50 | GPT-4o-mini (2024-07-18) openai-gpt-4o-mini-2024-07-18 | Imported | 2026-05-06 |
| 27 | DeepSeek-V3 | 27.20 | DeepSeek V3 deepseek-deepseek-chat | Imported | 2026-05-06 |
| 28 | Claude-3-Haiku | 20.20 | Claude 3 Haiku anthropic-claude-3-haiku | Imported | 2026-05-06 |
No matching rows.