LiveCodeBench

Official LiveCodeBench code-generation leaderboard for contamination-aware coding evaluation over problems collected from Codeforces, LeetCode, and AtCoder.

33rows
pass_at_1primary metric
2026-05-28sampled

Metadata

Metrics

Pass@1, Easy Pass@1, Medium Pass@1, Hard Pass@1, Problem count

Showing 2 latest source slices.

Latest Results

Provider-published Qwen3.7-Max comparison scores. Rows are marked self-reported and should be interpreted as source claims unless independently reproduced.

Rank Subject Pass@1 Model Match Provenance Sampled
1 DeepSeek V4 Pro Max 93.5% DeepSeek V4 Pro
deepseek-deepseek-v4-pro
Self-reported 2026-05-28
2 Qwen3.7 Max 91.6% Qwen3.7 Max
qwen-qwen3.7-max
Self-reported 2026-05-28
3 Kimi K2.6 Thinking 89.6% KIMI MoonshotAI: Kimi K2.6
moonshotai-kimi-k2.6
Self-reported 2026-05-28
4 Claude Opus 4.6 Max 88.8% Claude Opus 4.6
anthropic-claude-opus-4.6
Self-reported 2026-05-28
5 Qwen3.6 Plus 87.1% Qwen3.6 Plus
qwen-qwen3.6-plus
Self-reported 2026-05-28
1 O4-Mini (High) 80.20 o4 Mini High
openai-o4-mini-high
Imported 2026-05-06
2 O3 (High) 75.80 o3
openai-o3
Imported 2026-05-06
3 O4-Mini (Medium) 74.20 o4 Mini
openai-o4-mini
Imported 2026-05-06
4 Gemini-2.5-Pro-06-05 73.60 Imported 2026-05-06
5 DeepSeek-R1-0528 73.10 R1 0528
deepseek-deepseek-r1-0528
Imported 2026-05-06
6 Gemini-2.5-Pro-05-06 71.80 Imported 2026-05-06
7 EXAONE-4.0-32B 70 Imported 2026-05-06
8 OpenReasoning-Nemotron-32B 69.80 Imported 2026-05-06
9 O3-Mini-2025-01-31 (High) 67.40 o3 Mini High
openai-o3-mini-high
Imported 2026-05-06
10 OpenCodeReasoning-Nemotron-1.1-32B 66.80 Imported 2026-05-06
11 Grok-3-Mini (High) 66.70 GROK Grok 3 Mini
x-ai-grok-3-mini
Imported 2026-05-06
12 O4-Mini (Low) 65.90 o4 Mini
openai-o4-mini
Imported 2026-05-06
13 Qwen3-235B-A22B 65.90 Qwen3 235B A22B
qwen-qwen3-235b-a22b
Imported 2026-05-06
14 XBai-o4-medium 65 Imported 2026-05-06
15 O3-Mini-2025-01-31 (Med) 63 o3-mini
openai-o3-mini
Imported 2026-05-06
16 Gemini-2.5-Flash-05-20 61.90 Imported 2026-05-06
17 Gemini-2.5-Flash-04-17 60.60 Imported 2026-05-06
18 O3-Mini-2025-01-31 (Low) 57 o3-mini
openai-o3-mini
Imported 2026-05-06
19 Claude-Opus-4 (Thinking) 56.60 Claude Opus 4
anthropic-claude-opus-4
Imported 2026-05-06
20 Claude-Sonnet-4 (Thinking) 55.90 Claude Sonnet 4
anthropic-claude-sonnet-4
Imported 2026-05-06
21 Claude-Sonnet-4 47.10 Claude Sonnet 4
anthropic-claude-sonnet-4
Imported 2026-05-06
22 Claude-Opus-4 46.90 Claude Opus 4
anthropic-claude-opus-4
Imported 2026-05-06
23 Claude-3.5-Sonnet-20241022 36.40 Claude 3.5 Sonnet
anthropic-claude-3.5-sonnet
Imported 2026-05-06
24 GPT-4O-2024-08-06 29.50 GPT-4o (2024-08-06)
openai-gpt-4o-2024-08-06
Imported 2026-05-06
25 GPT-4-Turbo-2024-04-09 28.70 GPT-4 Turbo
openai-gpt-4-turbo
Imported 2026-05-06
26 GPT-4O-mini-2024-07-18 27.50 GPT-4o-mini (2024-07-18)
openai-gpt-4o-mini-2024-07-18
Imported 2026-05-06
27 DeepSeek-V3 27.20 DeepSeek V3
deepseek-deepseek-chat
Imported 2026-05-06
28 Claude-3-Haiku 20.20 Claude 3 Haiku
anthropic-claude-3-haiku
Imported 2026-05-06