HUMAINE

Prolific human-centered LLM benchmark measuring real-world user success, satisfaction, engagement, and preference patterns across demographically broad feedback.

45rows
overall_scoreprimary metric
2026-05-06sampled

Metadata

Metrics

Overall score, Average success, Average satisfaction, Average engagement, Success rate, Conversation count

Latest Results

Model-level statistics over 105282 conversations. Preference aggregates sourced from https://huggingface.co/spaces/ProlificAI/humaine-leaderboard/raw/main/leaderboard-app/public/preference_analysis_frontend.json.

Rank Subject Overall score Model Match Provenance Sampled
1 deepseek/deepseek-r1-0528 3.79 R1 0528
deepseek-deepseek-r1-0528
Imported 2026-05-06
2 openai/o3 3.79 o3
openai-o3
Imported 2026-05-06
3 google/gemini-2.5-pro 3.76 Gemini 2.5 Pro
google-gemini-2.5-pro
Imported 2026-05-06
4 google/gemini-3.1-pro-preview 3.73 Gemini 3.1 Pro Preview
google-gemini-3.1-pro-preview
Imported 2026-05-06
5 x-ai/grok-4 3.72 GROK Grok 4
x-ai-grok-4
Imported 2026-05-06
6 moonshotai/kimi-k2 3.71 KIMI MoonshotAI: Kimi K2 0711
moonshotai-kimi-k2
Imported 2026-05-06
7 openai/gpt-5.4 3.70 GPT-5.4
openai-gpt-5.4
Imported 2026-05-06
8 openai/gpt-5.5 3.70 GPT-5.5
openai-gpt-5.5
Imported 2026-05-06
9 mistralai/magistral-medium-2506 3.69 Imported 2026-05-06
10 anthropic/claude-opus-4.7 3.68 Claude Opus 4.7
anthropic-claude-opus-4.7
Imported 2026-05-06
11 x-ai/grok-3 3.67 GROK Grok 3
xaigrok-3
Imported 2026-05-06
12 deepseek/deepseek-v4-flash 3.66 DeepSeek V4 Flash
deepseek-deepseek-v4-flash
Imported 2026-05-06
13 google/gemini-2.5-flash 3.66 Gemini 2.5 Flash
google-gemini-2.5-flash
Imported 2026-05-06
14 deepseek/deepseek-chat-v3-0324 3.64 DeepSeek V3 0324
deepseek-deepseek-chat-v3-0324
Imported 2026-05-06
15 openai/gpt-5-mini 3.63 GPT-5 Mini
openai-gpt-5-mini
Imported 2026-05-06
16 openai/gpt-5 3.61 GPT-5
openai-gpt-5
Imported 2026-05-06
17 x-ai/grok-4.20-beta 3.60 GROK Grok 4.20
x-ai-grok-4.20
Imported 2026-05-06
18 openai/o4-mini 3.58 o4 Mini
openai-o4-mini
Imported 2026-05-06
19 google/gemma-3-27b-it 3.57 Gemma 3 27B
google-gemma-3-27b-it
Imported 2026-05-06
20 openai/gpt-5.2-chat 3.57 GPT-5.2 Chat
openai-gpt-5.2-chat
Imported 2026-05-06
21 openai/o1-mini 3.57 Imported 2026-05-06
22 google/gemini-2.0-flash-001 3.55 Gemini 2.0 Flash
google-gemini-2.0-flash
Imported 2026-05-06
23 moonshotai/kimi-k2.5 3.55 KIMI MoonshotAI: Kimi K2.5
moonshotai-kimi-k2.5
Imported 2026-05-06
24 openai/gpt-4.1 3.53 GPT-4.1
openai-gpt-4.1
Imported 2026-05-06
25 anthropic/claude-opus-4 3.50 Claude Opus 4
anthropic-claude-opus-4
Imported 2026-05-06
26 anthropic/claude-sonnet-4 3.50 Claude Sonnet 4
anthropic-claude-sonnet-4
Imported 2026-05-06
27 anthropic/claude-sonnet-4.5 3.49 Claude Sonnet 4.5
anthropic-claude-sonnet-4.5
Imported 2026-05-06
28 anthropic/claude-opus-4.6 3.48 Claude Opus 4.6
anthropic-claude-opus-4.6
Imported 2026-05-06
29 z-ai/glm-4.7 3.47 GLM GLM 4.7
z-ai-glm-4.7
Imported 2026-05-06
30 openai/o1 3.44 o1
openai-o1
Imported 2026-05-06
31 anthropic/claude-3.7-sonnet 3.40 Claude 3.7 Sonnet
anthropic-claude-3.7-sonnet
Imported 2026-05-06
32 cohere/command-a 3.40 C Command A
cohere-command-a
Imported 2026-05-06
33 qwen/qwen3-235b-a22b-2507 3.40 Qwen3 235B A22B Instruct 2507
qwen-qwen3-235b-a22b-2507
Imported 2026-05-06
34 deepseek/deepseek-v3.2 3.39 DeepSeek V3.2
deepseek-deepseek-v3.2
Imported 2026-05-06
35 openai/o3-mini 3.38 o3-mini
openai-o3-mini
Imported 2026-05-06
36 allenai/olmo-3.1-32b-instruct 3.37 OLMO Olmo 3.1 32B Instruct
allenai-olmo-3.1-32b-instruct
Imported 2026-05-06
37 meta-llama/llama-3.3-70b-instruct 3.37 Llama 3.3 70B Instruct
meta-llama-llama-3.3-70b-instruct
Imported 2026-05-06
38 google/gemini-3-pro 3.34 Gemini 3
google-gemini-3
Imported 2026-05-06
39 minimax/minimax-m2.1 3.34 MiniMax M2.1
minimax-minimax-m2.1
Imported 2026-05-06
40 openai/gpt-4o 3.33 GPT-4o
openai-gpt-4o
Imported 2026-05-06
41 mistralai/mistral-large-3 3.31 Imported 2026-05-06
42 mistralai/mistral-nemo 3.30 Mistral: Mistral Nemo
mistralai-mistral-nemo
Imported 2026-05-06
43 meta-llama/llama-4-maverick 3.27 Llama 4 Maverick
meta-llama-4-maverick
Imported 2026-05-06
44 anthropic/claude-opus-4.5 3.25 Claude Opus 4.5
anthropic-claude-opus-4.5
Imported 2026-05-06
45 cohere/command-r7b-12-2024 3.13 C Command R7B (12-2024)
cohere-command-r7b-12-2024
Imported 2026-05-06